Skip to main content

Data Quality in Mobile Sensing Datasets for Pervasive Healthcare

  • Chapter
  • First Online:
Handbook of Large-Scale Distributed Computing in Smart Healthcare

Abstract

Mobile sensing is becoming a popular approach for inferring patterns of activity and behavior to determine how they affect health and wellbeing. This data-driven approach has the potential to become a major tool in the field of epidemiology, aimed at determining the causes of disease in populations, as well as motivating behavior change. These sensing technologies are generating large datasets that demand significant processing and data management resources. Studies in mobile sensing for healthcare have motivated the creation of large, complex datasets with information opportunistically gathered from distributed sensors in mobile devices. In this chapter, we discuss some of the architectural challenges regarding data gathering in this distributed data-intensive environment such as the healthcare industry, as well as issues regarding the organization and sharing of the large amounts of data collected. Some of these issues include the heterogeneity of the devices, diversity of sensors used, and the need for data provenance when integrating datasets from diverse studies. We highlight that assessing data quality is of paramount importance for conducting longitudinal studies and building on historical knowledge as new data become available. Finally, we identify future research topics in the growing field of mobile sensing and its application to healthcare and wellbeing. We discuss aspects of data curation, data quality, and data provenance, and we provide suggestions on how these challenges could be addressed in the near future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A sensing campaign is a data collection campaign following a research protocol. A sensing campaign can involve tens or thousands of participants carrying one or more mobile devices, which opportunistically collect data from the user or her surroundings, or ask the user to carry out certain task (e.g., take a picture when feeling sad or answer a question).

  2. 2.

    https://www.nih.gov/research-training/allofus-research-program.

  3. 3.

    https://dmponline.dcc.ac.uk/.

References

  1. Calatroni, A., Roggen, D., Tröster, G. (2011). Collection and curation of a large reference dataset for activity recognition. In Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on (pp. 30–35). IEEE.

    Google Scholar 

  2. Lichen, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.

  3. Alcalá, J., Fernández, A., Luengo, J., Derrac, J., García, S., Sánchez, L., and Herrera, F. (2010). Keel data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework. Journal of Multiple-Valued Logic and Soft Computing, 17(255–287).

    Google Scholar 

  4. Rücknagel, J., et al. Metadata Schema for the Description of Research Data Repositories: version 3.0, 29 p, DOI:http://doi.org/10.2312/re3.008.

  5. White, Hollie C., et al. “The Dryad data repository: A Singapore framework metadata architecture in a DSpace environment.” Universitätsverlag Göttingen (2008): 157.

    Google Scholar 

  6. Data Observation Network for Earth (DataONE). Data Citation and Attribution; https://www.dataone.org/citing-dataone (accessed Sep. 2016).

  7. Boose, E, A Ellison, L Osterweil, L Clarke, R Podorozhny, J Hadley, A Wise, and D Foster (2007) Ensuring reliable datasets for environmental models and forecasts. Ecological Informatics, 2(3):237–247.

    Google Scholar 

  8. Voss, EA., Makadia, R., Matcho, A., Ma, Q., Knoll, C., Scheme, M., DeFalco, FJ., Lonche, A., Zhu, V., Ryan, PB. (2015). Feasibility and utility of applications of the common data model to multiple, disparate observational health databases. Journal of the American Medical Informatics Association. 22(3):553–64.

    Google Scholar 

  9. Estrin D, Sim I. Health care delivery. Open mHealth architecture: an engine for health care innovation. Science 2010 Nov 5;330(6005):759–760.

    Google Scholar 

  10. Kumar, S., Abowd, G. D., Abraham, W. T., al’Absi, M., Beck, J. G., Chau, D. H., Genevan, D. (2015). Center of excellence for mobile sensor Data-to-Knowledge (MD2K). Journal of the American Medical Informatics Association, 22(6), 1137–1142.

    Google Scholar 

  11. Netzahualcóyotl Hernández, and Jesús Favela. Estimating the Perception of Physical Fatigue Among Older Adults Using Mobile Phones. Human Behavior Understanding. Volume 9277 of the series Lecture Notes in Computer Science. 978-3-319-24194-4.

    Google Scholar 

  12. Darien Miranda, Jesus Favela, Catalina Ibarra, Netzahualcóyotl Cruz. Naturalistic Enactment to Elicit and Recognize Caregiver State Anxiety. Journal of Medical Systems, 2016.

    Google Scholar 

  13. Tactiohealth. Retrieved at November 14th 2016, from http://www.tactiohealth.com/

  14. Garmin. Retrieved at November 14th 2016, from http://www.garmin.com/en-US

  15. Fitbit. Retrieved at November 14th 2016, from https://www.fitbit.com/mx

  16. Owletcare. Retrieved at November 14th 2016, from http://www.owletcare.com/

  17. Zephyrhealth. Retrieved at November 14th 2016, from https://zephyrhealth.com/

  18. Withings. Retrieved at November 14th 2016, from http://www.withings.com/us/en/

  19. Neumitra. Retrieved at November 14th 2016, from https://www.neumitra.com/

  20. Jins-meme. Retrieved at November 14th 2016, from https://jins-meme.com/en/

  21. Proteus. Retrieved at November 14th 2016, from http://www.proteus.com/

  22. Ihealthlabs. Retrieved at November 14th 2016, from https://ihealthlabs.com/

  23. ResearchKit framework. Retrieved 22/09/2016, from http://researchkit.org/

  24. Aharon, N., Gardner, A., Sumter, C., Peatland, A.: Funf: Open Sensing Framework. http://funf.media.mit.edu (2011).

  25. Hernández, N., Arnrich, B., Favela, J., Yavuz, GR., Demiray, B., Fontecha, J., Ersoy, C. mk-sense: An extensible platform to conduct multi-institutional mobile sensing campaigns. Ubiquitous Computing & Ambient Intelligence. Health, AAL, HCI, IoT, Smart Cities, Sensors & Security. UCAmI 2016 (Accepted for publication).

    Google Scholar 

  26. Gersch, M., Lindert, R., Hewing, M.: AAL-business models: Different Prospects for the Successful Implementation of Innovative Services in the Primary and Secondary Healthcare Market. In: AALIANCE Conference, Malaga, Spain (2010).

    Google Scholar 

  27. Walkonomics. Retrieved at November 14th 2016, from http://walkonomics.com

  28. Walkscore. Retrieved at November 14th 2016, from http://walkscore.com

  29. Daniele Quercia, Luca Maria Aiello, Rossano Schifanella, and Adam Davies. 2015. The Digital Life of Walkable Streets. In Proceedings of the 24th International Conference on World Wide Web (WWW ‘15). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, Switzerland, 875–884.

    Google Scholar 

  30. The Library. Data Management. Retrieved 22/09/2016, from http://libraries.ucsd.edu/services/data-curation/data-management

  31. Van den Lynden, V., Corti, L., Willard, M. & Bishop, L. (2009). Managing and Sharing Data: A Best Practice Guide for Researchers. Retrieved 22/09/2016, from http://www.data-archive.ac.uk/media/2894/managingsharing.pdf

  32. DCC Curation Lifecycle Model. Retrieved 22/08/2016, from http://www.dcc.ac.uk/resources/curation-lifecycle-model

  33. Radebaugh, J. Understanding Metadata. National Information Standards Organization, 2004. 20 pp. ISBN: 978-1-880124-62-8.

    Google Scholar 

  34. Sweeney L, Crosas M, Bar-Sinai M. Sharing Sensitive Data with Confidence: The Datatags System. Technology Science. 2015101601. October 16, 2015. http://techscience.org/a/2015101601

  35. Reiner Schlitzer. Oceanographic quality flag schemes and mappings between them. Alfred Wegener Institute for Polar and Marine Research. Retrieved 22/09/2016, from http://odv.awi.de/fileadmin/user_upload/odv/misc/ODV4_QualityFlagSets.pdf

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Netzahualcóyotl Hernández .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Hernández, N., Castro, L.A., Favela, J., Michán, L., Arnrich, B. (2017). Data Quality in Mobile Sensing Datasets for Pervasive Healthcare. In: Khan, S., Zomaya, A., Abbas, A. (eds) Handbook of Large-Scale Distributed Computing in Smart Healthcare. Scalable Computing and Communications. Springer, Cham. https://doi.org/10.1007/978-3-319-58280-1_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-58280-1_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-58279-5

  • Online ISBN: 978-3-319-58280-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics