Zusammenfassung
In Medizin und Gesundheitswesen sind immer größere Mengen immer vielfältigerer Daten verfügbar, die zunehmend schneller generiert werden. Dieser allgemeine Trend wird als Big Data bezeichnet. Die Analyse von Big Data mit Methoden des maschinellen Lernens führt zur Entwicklung innovativer Lösungen, die neue medizinische Einsichten generieren und die Qualität und Effizienz im Gesundheitssystem erhöhen können. Prototypische Beispiele existieren im Bereich der Analyse klinischer Texte, der klinischen Entscheidungsunterstützung, der Analyse von Daten aus öffentlichen Datenquellen oder Wearables und in Form der Entwicklung persönlicher Assistenten. Diese Potenziale bringen aber auch neue Herausforderungen im Bereich Datenschutz und in der Transparenz bzw. Nachvollziehbarkeit der Ergebnisse für den medizinischen Experten mit sich.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
In der Statistik und im maschinellen Lernen haben sich unterschiedliche Sprachgebräuche herausgebildet, dort werden Merkmale Variablen genannt und insbesondere das Zielmerkmal als abhängige Variable bezeichnet.
Literatur
Aggarwal CC, Yu PS (2008) Privacy-preserving data mining: models and algorithms. Springer, US
Alaa AM, Hu S, Schaar M (2017) Learning from clinical judgments: semi-markov-modulated marked Hawkes processes for risk prognosis. Proceedings of the 34th international conference on machine learning. PMLR 70:60–69
Amir S et al (2017) Quantifying Mental Health from Social Media with Neural User Embeddings. Proceedings of machine learning for healthcare 2017, PMLR 68:306–321
Bishop C (2006) Pattern recognition and machine learning. Springer, New York
Blecker S et al (2016) Comparison of approaches for heart failure case identification from electronic health record data. J Am Med Assoc Cardiol 1:1014–1020. https://doi.org/10.1001/jamacardio.2016.3236
Butler D (2013) When Google got flu wrong, US outbreak foxes a leading web-based method for tracking seasonal flu. Nature 494:155–156. http://www.nature.com/news/when-google-got-flu-wrong-1.12413. Zugegriffen: 6. Juni 2018
Choi E et al (2016) Doctor AI: predicting clinical events via recurrent neural networks. Proceedings of the 1st machine learning for healthcare conference. PMLR 56:301–318
Craven MW, Shavlik JW (1996) Extracting tree-structured representations of trained networks. Adv Neural Process Sys 8:24–30
Dempsey WH et al (2016) iSurvive: an interpretable, event-time prediction model for mHealth. Proceedings of the 34th international conference on machine learning. PMLR 70:970–979
Dernoncourt F et al (2017) De-identification of patient notes with recurrent neural networks. J Am Med Inf Assoc 24:596–606. https://doi.org/10.1093/jamia/ocw156
Doshi-Velez F et al (2017) Accountability of AI under the law: the role of explanation. https://arxiv.org/abs/1711.01134. Zugegriffen: 9. Juni 2018
Dwork C (2006) Differential privacy. 33rd international colloquium on automata, languages and programming, part II (ICALP 2006). Springer, Heidelberg, S 1–12
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542:115–118. https://doi.org/10.1038/nature21056
Ferrucci D et al (2010) Building Watson: an overview of the deepQA Project. AI magazin, Fall 2010, Association for the Advancement of Artificial Intelligence, S 59–79
Fletcher RR et al (2011) Wearable sensor platform and mobile application for use in cognitive behavioral therapy for drug addiction and PTSD. Proceedings of 2011 annual international conference of the IEEE engineering in medicine and biology society, Boston, S 1802–1805. https://doi.org/10.1109/iembs.2011.6090513
Futoma J, Hariharan S, Heller K (2017) Learning to detect sepsis with a multitask Gaussian process RNN classifier. Proceedings of the 34th international conference on machine learning. PMLR 70:1174–1182
Gardner J, Xiong L (2009) An integrated framework for de-identifying unstructured medical data. Data Knowl Eng 68:1441–1451. https://doi.org/doi.org/10.1016/j.datak.2009.07.006
Gartner (2017) Gartner IT glossary, big data – from the Gartner IT glossary: what is big data? https://www.gartner.com/it-glossary/big-data. Zugegriffen: 22. Juni 2018
Garvin JH et al (2018) Automating quality measures for heart failure using natural language processing: a descriptive study in the department of veterans affairs. JMIR Med Inform 6(1):e5
Giansanti Daniele et al (2008) Assessment of fall-risk by means of a neural network based on parameters assessed by a wearable device during posturography. Med Eng Phys 30:367–372
Ginsberg J et al (2009) Detecting influenza epidemics using search engine query data. Nature 457:1012–1014. https://doi.org/10.1038/nature07634
Gonzalez-Hernandez G et al (2017) Capturing the patient’s perspective: a review of advances in natural language processing of health-related text. IMIA Yearb Med Inform 1:214–227
Goodfellow I, Bengio Y (2016) Deep learning. MIT Press. http://deeplearningbook.org. Zugegriffen: 6. Juni 2018
Grace K, Salvatier J, Dafoe A, Zhang B, O (2017) When will AI exceed human performance? Evidence from AI experts. arXiv preprint. arXiv:1705.08807
Gravina R et al (2017) Multi-sensor fusion in body sensor networks: state-of-the-art and research challenges. Inf Fusion 35:68–80
Grosskreutz H et al (2012) An enhanced relevance criterion for more concise supervised pattern. KDD ’12, Proceedings of the 18th ACM SIGKDD conference on knowledge discovery and data mining (KDD 2012). ACM, S 1442–1450
Grosskreutz H, Lemmen B, Rüping S (2010) Privacy-preserving data-mining. Informatik-Spektrum 33:380–383
Gurulingappa H et al (2013) Automatic detection of adverse events to predict drug label changes using text and data mining techniques. Pharmacoepidemiol Drug Saf 22:1189–1194. https://doi.org/10.1002/pds.3493
Haq HUI, Ahmad R, Hussain SUI (2017) Intelligent EHRs: predicting procedure codes from diagnosis codes. 31st conference on neural information processing systems (NIPS 2017), Long Beach
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference, and prediction, 2. Aufl. Springer, New York
Kao HC, Tang KF, Chang EY (2018) Context-aware symptom checking for disease diagnosis using hierarchical reinforcement learning. Proceedings of AAAI coference on artificial intelligence
Karssemeijer N, Laak JAWM van der, and the CAMELYON16 Consortium (2017) Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 318:2199–2210. https://doi.org/10.1001/jama.2017.14585
Kim B, Khanna R, Koyejo S (2016) Examples are not Enough, Learn to Criticize! Criticism for Interpretability. Neural Information Processing Systems. Adv Neural Inf Process Syst 2280–2288
King RC et al (2017) Application of data fusion techniques and technologies for wearable health monitoring. Med Eng Phys 42:1–12
Kreimeyer K et al (2017) Natural language processing systems for capturing and standardizing unstructured clinical information. A systematic review. J Biomed Inform 73:14–29. https://doi.org/doi.org/10.1016/j.jbi.2017.07.012
Laney D (2001) 3D data management: controlling data volume, velocity and variety. META Group, Stamford
Leaman R, Khare R, Lu Z (2015) Challenges in clinical natural language processing for automated disorder normalization. J Biomed Inform 57:28–37. https://doi.org/doi.org/10.1016/j.jbi.2015.07.010
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Limsopatham N, Collier N (2016) Normalising medical concepts in social media texts by learning semantic representation. Proceedings of the 54th annual meeting of the association for computational linguistics, Berlin, S 1014–1023
Lipton ZC et al (2016) Learning to diagnose with LSTM recurrent neural networks. International conference on learning representations (ICLR 2016)
Madan S et al (2016) The BEL information extraction workflow (BELIEF): evaluation in the biocreative v bel and iat track. Database J Biol Database Curation 2016:baw136 (PMC)
Mikolov T et al (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Sys 26:3111–3119
Mintz M, Bills S, Snow R, Jurafsky D (2009) Distant supervision for relation extraction without labeled data. Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP 2. Association for Computational Linguistics, Stroudsburg, USA, S 1003–1011
Miotto R et al (2015) Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform, bbx044. https://doi.org/10.1093/bib/bbx044
Montavon G, Samek W, Müller KR (2017) Methods for interpreting and understanding deep neural networks. Digit Signal Process 73:1–15
Nguyen H, Patrick J (2016) Text mining in clinical domain: dealing with noise. KDD ’16, Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining 22, ACM, S 549–558
Nguyen P, Tran T, Wickramasinghe N, Venkatesh S (2017) Deepr: a convolutional net for medical records. IEEE J Biomed Health Inform 21:22–30. https://doi.org/10.1109/JBHI.2016.2633963
Nicolas J et al (2013) A data mining approach for grouping and analyzing trajectories of care using claim data: the example of breast cancer. BMC Med Inform Decis Making 13:130. https://doi.org/10.1186/1472-6947-13-130
Nosenge N (2016) Can you teach old drugs new tricks? Nature 534:314–316. https://doi.org/10.1038/534314a
O’Connor K et al (2014) Pharmacovigilance on twitter? Mining tweets for adverse drug reactions. Am Med Inform Assoc 2014:924–933
Osthus D et al (2017) Dynamic Bayesian influenza forecasting in the United States with hierarchical discrepancy. arXiv preprint. arXiv:1708.09481
Pommerening K et al (2014) Leitfaden zum Datenschutz in medizinischen Forschungsprojekten – Generische Lösungen der TMF 2.0. Medizinisch Wissenschaftliche Verlagsgesellschaft
Quinlan JR (1993) C4.5: Programs for machine learning. Machine learning. Morgan Kaufmann, San Mateo
Rajpurkar P, Hannun AY, Haghpanahi M, Bourn C, Ng AY (2017) Cardiologist-level arrhythmia detection with convolutional neural networks. arXiv preprint. arXiv:1707.01836
Ribeiro MT, Singh S, Guestrin C (2016) “Why should I trust you?”: Explaining the Predictions of any classifier. KDD ’16, Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, S 1135-1144. https://doi.org/10.1145/2939672.2939778
Röhrig B et al (2009) Types of study in medical research. Deutsch Ärtzteblatt Int 106:262–268. https://doi.org/10.3238/arztebl.2009.0262
Rotmensch M et al (2017) Learning a health knowledge graph from electronic medical records. Sci Rep 7:5994. https://doi.org/10.1038/s41598-017-05778-z
Ruud KL et al (2010) Automated detection of follow-up appointments using text mining of discharge records. Int J Qual Health Care 22:229–235. https://doi.org/10.1093/intqhc/mzq012
Salarian A et al (2007) Ambulatory monitoring of physical activities in patients with parkinson’s disease. IEEE Trans Biomed Eng 54:2296–2299. https://doi.org/10.1109/TBME.2007.896591
Sculley D et al (2015) Hidden technical debt in machine learning systems. Adv Neural Inf Process Sys 28:817–824
Sebastiani P, Mandl KD, Szolovits P, Kohane IS, Ramoni MF (2006) A Bayesian dynamic model for influenza surveillance. Stat Med 25:1803–1825. https://doi.org/10.1002/sim.2566
Semigran HL, Levine DM, Nundy S, Mehrotra A (2016) Comparison of physician and computer diagnostic accuracy. J Am Med Assoc Int Med 176:1860–1861. https://doi.org/10.1001/jamainternmed.2016.6001
Shearer C (2000) The CRISP-DM model: the new blueprint for data mining. J Data Warehouse 5:13–22
Stuart EA (2010) Matching methods for causal inference: a review and a look forward. Stat Sci 25:1–21. https://doi.org/10.1214/09-STS313
Sutton R, Barto A (1998) Reinforcement learning: an introduction. MIT Press, Cambridge
Sweeney L (2000) Simple demographics often identify people uniquely. Carnegie Mellon University, Data privacy Working Paper 3. Pittsburgh
Szegedy et al (2015). Going deeper with convolutions. Proceedings of 2015 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, Boston
Tamang S et al (2015) Detecting unplanned care from clinician notes in electronic health records. J Oncol Pract 11:3
Turing AM (1950) Computing machinery and intelligence. Mind 49:433–460
Vasan G, Pilarski PM (2017) Learning from demonstration: teaching a myoelectric prosthesis with an intact limb via reinforcement learning. 2017 International Conference on Rehabilitation Robotics (ICORR), London, S 1457–1464. https://doi.org/10.1109/icorr.2017.8009453
Wang Z, Brudno M (2017) Towards a directory of rare disease specialists: identifying experts from publication history. Proceedings of machine learning for healthcare 2017. PMLR: 352–360
Wang X, Sontag D, Wang F (2014) Unsupervised learning of disease progression models. KDD ’14, Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, S 85–94 https://doi.org/10.1145/2623330.2623754
Wang Y et al (2018) Clinical information extraction applications: a literature review. J Biomed Inf 77:34–49. https://doi.org/10.1016/j.jbi.2017.11.011
Yang YP et al (2017) The effects of an activity promotion system on active living in overweight subjects with metabolic abnormalities. Obes Res Clin Pract 11:718-727. https://doi.org/10.1016/j.orcp.2017.06.002
Yang Y, Fasching PA, Tresp V (2017) Predictive modeling of therapy decisions in metastatic breast cancer with recurrent neural network encoder and multinomial hierarchical regression decoder. 2017 IEEE international conference on healthcare informatics (ICHI), Park City, S 46–55. https://doi.org/10.1109/ichi.2017.51
Yildirim P, Ekmekci IO, Holzinger A (2013) On knowledge discovery in open medical data on the example of the fda drug adverse event reporting system for alendronate (Fosamax). Human-computer interaction and knowledge discovery in complex, unstructured, Big Data, S 95–206
Yumak Z, Pu P (2013) Survey of sensor-based personal wellness management systems. BioNanoSci 3:254–269. https://doi.org/10.1007/s12668-013-0099-0
Zarringhalam K et al (2014) Robust clinical outcome prediction based on Bayesian analysis of transcriptional profiles and prior causal networks. Bioinformatics 30:i69–i77. https://doi.org/10.1093/bioinformatics/btu272
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer-Verlag GmbH Deutschland, ein Teil von Springer Nature
About this chapter
Cite this chapter
Rüping, S., Sander, J. (2019). Big Data in Gesundheitswesen und Medizin. In: Haring, R. (eds) Gesundheit digital. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-57611-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-662-57611-3_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-57610-6
Online ISBN: 978-3-662-57611-3
eBook Packages: Medicine (German Language)