Distributed sampling storage for statistical analysis of massive sensor data

Hiroshi Sato, Hisashi Kurasawa, Takeru Inoue, Motonori Nakamura, Hajime Matsumura, Keiichi Koyanagi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Cyber-physical systems interconnect the cyber world with the physical world in which sensors are massively networked to monitor the physical world. Various services are expected to be able to use sensor data reflecting the physical world with information technology. Given this expectation, it is important to simultaneously provide timely access to massive data and reduce storage costs. We propose a data storage scheme for storing and querying massive sensor data. This scheme is scalable by adopting a distributed architecture, fault-tolerant even without costly data replication, and enables users to efficiently select multi-scale random data samples for statistical analysis. We implemented a prototype system based on our scheme and evaluated its sampling performance. The results show that the prototype system exhibits lower latency than a conventional distributed storage system.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages233-243
Number of pages11
Volume7465 LNCS
DOIs
Publication statusPublished - 2012
EventInternational Cross-Domain Conference and Workshop on Availability, Reliability, and Security, CD-ARES 2012 - Prague
Duration: 2012 Aug 202012 Aug 24

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7465 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

OtherInternational Cross-Domain Conference and Workshop on Availability, Reliability, and Security, CD-ARES 2012
CityPrague
Period12/8/2012/8/24

Fingerprint

Statistical Analysis
Statistical methods
Sampling
Sensor
Sensors
Prototype
Data Replication
Distributed Architecture
Information technology
Data Storage
Storage System
Interconnect
Information Technology
Fault-tolerant
Latency
Data storage equipment
Distributed Systems
Monitor
Costs
Cyber Physical System

Keywords

  • data accuracy
  • random sampling
  • relaxed durability

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Sato, H., Kurasawa, H., Inoue, T., Nakamura, M., Matsumura, H., & Koyanagi, K. (2012). Distributed sampling storage for statistical analysis of massive sensor data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7465 LNCS, pp. 233-243). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7465 LNCS). https://doi.org/10.1007/978-3-642-32498-7_18

Distributed sampling storage for statistical analysis of massive sensor data. / Sato, Hiroshi; Kurasawa, Hisashi; Inoue, Takeru; Nakamura, Motonori; Matsumura, Hajime; Koyanagi, Keiichi.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7465 LNCS 2012. p. 233-243 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7465 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sato, H, Kurasawa, H, Inoue, T, Nakamura, M, Matsumura, H & Koyanagi, K 2012, Distributed sampling storage for statistical analysis of massive sensor data. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 7465 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 7465 LNCS, pp. 233-243, International Cross-Domain Conference and Workshop on Availability, Reliability, and Security, CD-ARES 2012, Prague, 12/8/20. https://doi.org/10.1007/978-3-642-32498-7_18
Sato H, Kurasawa H, Inoue T, Nakamura M, Matsumura H, Koyanagi K. Distributed sampling storage for statistical analysis of massive sensor data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7465 LNCS. 2012. p. 233-243. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-32498-7_18
Sato, Hiroshi ; Kurasawa, Hisashi ; Inoue, Takeru ; Nakamura, Motonori ; Matsumura, Hajime ; Koyanagi, Keiichi. / Distributed sampling storage for statistical analysis of massive sensor data. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7465 LNCS 2012. pp. 233-243 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{6724e32583cd4e1b8bb54cfc002066e0,
title = "Distributed sampling storage for statistical analysis of massive sensor data",
abstract = "Cyber-physical systems interconnect the cyber world with the physical world in which sensors are massively networked to monitor the physical world. Various services are expected to be able to use sensor data reflecting the physical world with information technology. Given this expectation, it is important to simultaneously provide timely access to massive data and reduce storage costs. We propose a data storage scheme for storing and querying massive sensor data. This scheme is scalable by adopting a distributed architecture, fault-tolerant even without costly data replication, and enables users to efficiently select multi-scale random data samples for statistical analysis. We implemented a prototype system based on our scheme and evaluated its sampling performance. The results show that the prototype system exhibits lower latency than a conventional distributed storage system.",
keywords = "data accuracy, random sampling, relaxed durability",
author = "Hiroshi Sato and Hisashi Kurasawa and Takeru Inoue and Motonori Nakamura and Hajime Matsumura and Keiichi Koyanagi",
year = "2012",
doi = "10.1007/978-3-642-32498-7_18",
language = "English",
isbn = "9783642324970",
volume = "7465 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "233--243",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Distributed sampling storage for statistical analysis of massive sensor data

AU - Sato, Hiroshi

AU - Kurasawa, Hisashi

AU - Inoue, Takeru

AU - Nakamura, Motonori

AU - Matsumura, Hajime

AU - Koyanagi, Keiichi

PY - 2012

Y1 - 2012

N2 - Cyber-physical systems interconnect the cyber world with the physical world in which sensors are massively networked to monitor the physical world. Various services are expected to be able to use sensor data reflecting the physical world with information technology. Given this expectation, it is important to simultaneously provide timely access to massive data and reduce storage costs. We propose a data storage scheme for storing and querying massive sensor data. This scheme is scalable by adopting a distributed architecture, fault-tolerant even without costly data replication, and enables users to efficiently select multi-scale random data samples for statistical analysis. We implemented a prototype system based on our scheme and evaluated its sampling performance. The results show that the prototype system exhibits lower latency than a conventional distributed storage system.

AB - Cyber-physical systems interconnect the cyber world with the physical world in which sensors are massively networked to monitor the physical world. Various services are expected to be able to use sensor data reflecting the physical world with information technology. Given this expectation, it is important to simultaneously provide timely access to massive data and reduce storage costs. We propose a data storage scheme for storing and querying massive sensor data. This scheme is scalable by adopting a distributed architecture, fault-tolerant even without costly data replication, and enables users to efficiently select multi-scale random data samples for statistical analysis. We implemented a prototype system based on our scheme and evaluated its sampling performance. The results show that the prototype system exhibits lower latency than a conventional distributed storage system.

KW - data accuracy

KW - random sampling

KW - relaxed durability

UR - http://www.scopus.com/inward/record.url?scp=84865658914&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84865658914&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-32498-7_18

DO - 10.1007/978-3-642-32498-7_18

M3 - Conference contribution

SN - 9783642324970

VL - 7465 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 233

EP - 243

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -