Skip to main content

Validating Big Data, a Big Issue

  • Chapter
  • First Online:
Efficacy Analysis in Clinical Trials an Update

Abstract

Big data consist of multiple fractions of small data. If you wish your big data to be valid, then you will, first, have to make sure, that the fractions are validated:

by the use of the scientific rules for clinical trials, and, in addition,

by the use of traditional diagnostic test validations.

Once this is all done well and good, only then you will be at the starting point of a serious big data analysis. Unfortunately, this is a pretty laborious scenario, and, although, currently, many data bases of big data do exist, most of them are, documentedly, of a poor quality and un-validated. Big data analyses tend to suffer from too many null-values, lack of experienced analysis teams, lacking validation tools, limited validation checklists. Big data tools are in expensive commercial software, and have not been judged by Academia. The best approach to big data analyses may be the use of large checklists, multiple analysis teams, and the use of multiple independent computers with simple programs rather than supercomputers with complex programs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Becker D, King T, Mcmullen B (2015) Big data, big data quality problem. Proceedings of IEEE international conference on bog data, pp 2264–3053.

    Google Scholar 

  • Gao G, Xie C, Tao C (2016) Big data validation and quality assurance issues, challenges and needs. In: Proceedings of IEEE symposium on service oriented system engineering, Oxford, UK, pp 433–441.

    Google Scholar 

  • Gassman J, Owens W, Kuntz T, Martin J, Amoroso W (1995) Data quality assurance, monitoring, and reporting. Control Clin Trials 16:104–136

    Article  Google Scholar 

  • Laranjeiro N, Soydemir S, Bernardino J (2015) A survey on data quality, classifying poor data. In: Proceedings of IEEE 21st Pacific Rim international symposium, pp 179–188.

    Google Scholar 

  • Woodall P, Gao J, Parlikad A, Koronios A (2015) Classifying data quality problems in asset management. Springer Publications, Heidelberg

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Cleophas, T.J., Zwinderman, A.H. (2019). Validating Big Data, a Big Issue. In: Efficacy Analysis in Clinical Trials an Update. Springer, Cham. https://doi.org/10.1007/978-3-030-19918-0_20

Download citation

Publish with us

Policies and ethics