Clinical Natural Language Processing with Deep Learning

Hasan, Sadid A.; Farri, Oladimeji

doi:10.1007/978-3-030-05249-2_5

Sadid A. Hasan⁴ &
Oladimeji Farri⁴

2955 Accesses
16 Citations

Abstract

The emergence and proliferation of electronic health record (EHR) systems has incrementally resulted in large volumes of clinical free text documents available across healthcare networks. The huge amount of data inspires research and development focused on novel clinical natural language processing (NLP) solutions to optimize clinical care and improve patient outcomes. In recent years, deep learning techniques have demonstrated superior performance over traditional machine learning (ML) techniques for various general-domain NLP tasks, e.g., language modeling, parts-of-speech (POS) tagging, named entity recognition, paraphrase identification, sentiment analysis, etc. Clinical documents pose unique challenges compared to general-domain text due to the widespread use of acronyms and nonstandard clinical jargons by healthcare providers, inconsistent document structure and organization, and requirement for rigorous de-identification and anonymization to ensure patient data privacy. This tutorial chapter will present an overview of how deep learning techniques can be applied to solve NLP tasks in general, followed by a literature survey of existing deep learning algorithms applied to clinical NLP problems. Finally, we include a description of various deep learning-driven clinical NLP applications developed at the artificial intelligence (AI) lab in Philips Research in recent years—such as diagnostic inferencing from unstructured clinical narratives, relevant biomedical article retrieval based on clinical case scenarios, clinical paraphrase generation, adverse drug event (ADE) detection from social media, and medical image caption generation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alsaffar, M., Yellowlees, P., Odor, A., Hogarth, M.: The state of open source electronic health record projects: a software anthropology study. JMIR Med. Inform. 5(1), e6 (2017)
Article Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)
Article Google Scholar
Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016). http://www.deeplearningbook.org
MATH Google Scholar
Goldberg,Y.: A primer on neural network models for natural language processing. J. Artif. Intell. Res. 57, 345–420 (2016)
Article MathSciNet Google Scholar
Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, pp. 1556–1566 (2015)
Google Scholar
Stewart, G.W.: On the early history of the singular value decomposition. SIAM Rev. 35(4), 551–566 (1993)
Article MathSciNet Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space (2013). CoRR: abs/1301.3781
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of the 27th Annual Conference on Neural Information Processing Systems NIPS 2013, pp. 3111–3119 (2013)
Google Scholar
Luong, T., Socher, R., Manning, C.D.: Better word representations with recursive neural networks for morphology. In: Proceedings of the Seventeenth Conference on Computational Natural Language Learning, CoNLL 2013, pp. 104–113 (2013)
Google Scholar
Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, 21–26 June 2014, pp. 1188–1196 (2014)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, pp. 1746–1751 (2014)
Google Scholar
Goldberg, Y.: Neural Network Methods in Natural Language Processing. Morgan & Claypool Publishers, San Rafael (2017)
Book Google Scholar
Johnson, R., Zhang, T.: Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of the 11th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL–HLT), Denver, CO, pp. 103–112 (2015)
Google Scholar
Lee, K., Qadir, A., Hasan, S.A., Datla, V.V., Prakash, A., Liu, J., Farri, O.: Adverse drug event detection in tweets with semi-supervised convolutional neural networks. In: Proceedings of the 26th International Conference on World Wide Web (WWW), pp. 705–714 (2017)
Google Scholar
Gehring, J., Auli, M., Grangier, D., Dauphin, Y.N.: A convolutional encoder model for neural machine translation (2016). ArXiv e-prints
Google Scholar
Gehring, J., Auli, M., Grangier, D., Yarats, D., Dauphin, Y.N.: Convolutional sequence to sequence learning (2017). ArXiv e-prints
Google Scholar
Sutskever, I., Martens, J., Hinton, G.E.: Generating text with recurrent neural networks. In: Proceedings of ICML, pp. 1017–1024 (2011)
Google Scholar
Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 5(2), 157–166 (1994)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: encoder–decoder approaches. In: Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pp. 103–111 (2014)
Google Scholar
Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Cortes, C., Lawrence, N.D., Lee, D.D., Sugiyama, M., Garnett, R. (eds.) NIPS, pp. 2440–2448 (2015)
Google Scholar
Weston, J., Chopra, S., Bordes, A.: Memory networks (2014). CoRR: abs/1410.3916
Google Scholar
Miller, A., Fisch, A., Dodge, J., Karimi, A.-H., Bordes, A., Weston, J.: Key-value memory networks for directly reading documents (2016). CoRR: abs/1606.03126
Google Scholar
Watkins, C.J.C.H., Dayan, P.: Q-learning. Mach. Learn. 8, 279–292 (1992)
MATH Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)
Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Baker, S., Korhonen, A., Pyysalo, S.: Cancer hallmark text classification using convolutional neural networks. In: BioTxtM, pp. 1–9 (2016)
Google Scholar
Lin, C., Miller, T., Dligach, D., Bethard, S., Savova, G.: Representations of time expressions for temporal relation extraction with convolutional neural networks. In: BioNLP, pp. 322–327 (2017)
Google Scholar
Mohan, S., Fiorini, N., Kim, S., Lu, Z.: Deep learning for biomedical information retrieval: learning textual relevance from click logs. In: BioNLP, pp. 222–231 (2017)
Google Scholar
Peng, Y., Lu, Z.: Deep learning for extracting protein-protein interactions from biomedical literature. In: BioNLP, pp. 29–38 (2017)
Google Scholar
Asada, M., Miwa, M., Sasaki, Y.: Extracting drug-drug interactions with attention CNNs. In: BioNLP, pp. 9–18 (2017)
Google Scholar
Chen, M.C., Ball, R.L., Yang, L., Moradzadeh, N., Chapman, B.E., Larson, D.B., Langlotz, C., Amrhein, T.J., Lungren, M.: Deep learning to classify radiology free-text reports. Radiology 286, 845–852 (2017)
Article Google Scholar
Sulieman, L., Gilmore, D., French, C., Cronin, R.M., Jackson, G.P., Russell, M., Fabbri, D.: Classifying patient portal messages using convolutional neural networks. J. Biomed. Inform. 74, 59–70 (2017)
Article Google Scholar
Crichton, G., Pyysalo, S., Chiu, B., Korhonen, A.: A neural network multi-task learning approach to biomedical named entity recognition. BMC Bioinform. 18(1), 368 (2017)
Article Google Scholar
Karimi, S., Dai, X., Hassanzadeh, H., Nguyen, A.: Automatic diagnosis coding of radiology reports: a comparison of deep learning and conventional classification methods. In: BioNLP, pp. 328–332 (2017)
Google Scholar
Feldman, R., Netzer, O., Peretz, A., Rosenfeld, B.: Utilizing text mining on online medical forums to predict label change due to adverse drug reactions. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, pp. 1779–1788 (2015)
Google Scholar
Lardon, J., Abdellaoui, R., Bellet, F., Asfari, H., Souvignet, J., Texier, N., Jaulent, M.C., Beyens, M.N., Burgun, A., Bousquet, C.: Adverse drug reaction identification and extraction in social media: a scoping review. J. Med. Internet Res. 17(7), e171 (2015)
Article Google Scholar
Sarker, A., Ginn, R.E., Nikfarjam, A., O’Connor, K., Smith, K., Jayaraman, S., Upadhaya, T., Gonzalez, G.: Utilizing social media data for pharmacovigilance: a review. J. Biomed. Inform. 54, 202–212 (2015)
Article Google Scholar
Yang, M., Kiang, M., Shang, W.: Filtering big data from social media - building an early warning system for adverse drug reactions. J. Biomed. Inform. 54(C), 230–240 (2015)
Article Google Scholar
Sarker, A., Gonzalez, G.: Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J. Biomed. Inform. 53, 196–207 (2015)
Article Google Scholar
Liu, X., Chen, H.: Identifying adverse drug events from patient social media: a case study for diabetes. IEEE Intell. Syst. 30(3), 44–51 (2015)
Article Google Scholar
Jagannatha, A.N., Yu, H.: Bidirectional RNN for medical event detection in electronic health records. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 473–482 (2016)
Google Scholar
Jagannatha, A., Yu, H.: Structured prediction models for RNN based sequence labeling in clinical text. In: EMNLP, pp. 856–865 (2016)
Google Scholar
Maharana, A., Yetisgen, M.: Clinical event detection with hybrid neural architecture. In: BioNLP, pp. 351–355 (2017)
Google Scholar
Yadav, S., Ekbal, A., Saha, S., Bhattacharyya, P.: Deep learning architecture for patient data de-identification in clinical records. In: Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP), pp. 32–41 (2016)
Google Scholar
Dernoncourt, F., Lee, J.Y., Uzuner, Ö., Szolovits, P.: De-identification of patient notes with recurrent neural networks. J. Am. Med. Inform. Assoc. 24(3), 596–606 (2017)
Google Scholar
Liu, Z., Tang, B., Wang, X., Chen, Q.: De-identification of clinical notes via recurrent neural network and conditional random field. J. Biomed. Inform. 75, S34–S42 (2017)
Article Google Scholar
Salloum, W., Finley, G., Edwards, E., Miller, M., Suendermann-Oeft, D.: Deep learning for punctuation restoration in medical reports. In: BioNLP, pp. 159–164 (2017)
Google Scholar
Patchigolla, R.V.S.S., Sahu, S., Anand, A.: Biomedical event trigger identification using bidirectional recurrent neural network based models. In: BioNLP, pp. 316–321 (2017)
Google Scholar
He, H., Ganjam, K., Jain, N., Lundin, J., White, R., Lin, J.: An insight extraction system on biomedical literature with deep neural networks. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, 9–11 Sept 2017, pp. 2691–2701 (2017)
Google Scholar
Chalapathy, R., Borzeshi, E.Z., Piccardi, M.: Bidirectional LSTM-CRF for clinical concept extraction. In: ClinicalNLP@COLING 2016, pp. 7–12 (2016)
Google Scholar
Unanue, I.J., Borzeshi, E.Z., Piccardi, M.: Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition. J. Biomed. Inform. 76, 102–109 (2017)
Article Google Scholar
Liu, Z., Yang, M., Wang, X., Chen, Q., Tang, B., Wang, Z., Xu, H.: Entity recognition from clinical texts via recurrent neural network. BMC Med. Inform. Decis. Mak. 17(2), 67 (2017)
Article Google Scholar
Stanovsky, G., Gruhl, D., Mendes, P.: Recognizing mentions of adverse drug reaction in social media using knowledge-infused recurrent models. In: EACL, pp. 142–151 (2017)
Google Scholar
Sahu, S.K., Anand, A.: Recurrent neural network models for disease name recognition using domain invariant features. In: ACL (2016)
Google Scholar
Elhadad, N., Sutaria, K.: Mining a lexicon of technical terms and lay equivalents. In: Proceedings of the Workshop on BioNLP, pp. 49–56 (2007)
Google Scholar
Deléger, L., Zweigenbaum, P.: Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora. In: Proceedings of the 2nd Workshop on Building and Using Comparable Corpora: From Parallel to Non-parallel Corpora, pp. 2–10 (2009)
Google Scholar
Wang, C., Cao, L., Zhou, B.: Medical synonym extraction with concept space models. In: Proceedings of IJCAI, pp. 989–995 (2015)
Google Scholar
Prakash, A., Hasan, S.A., Lee, K., Datla, V., Qadir, A., Liu, J., Farri, O.: Neural paraphrase generation with stacked residual LSTM networks. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2923–2934 (2016)
Google Scholar
Hasan, S.A., Liu, B., Liu, J., Qadir, A., Lee, K., Datla, V.V., Prakash, A., Farri, O.: Neural clinical paraphrase generation with attention. In: Proceedings of the Clinical Natural Language Processing Workshop, ClinicalNLP@COLING, pp. 42–53 (2016)
Google Scholar
Choi, E., Bahadori, M.T., Sun, J.: Doctor AI: predicting clinical events via recurrent neural networks (2015). Preprint. ArXiv:1511.05942
Google Scholar
Choi, E., Bahadori, M.T., Schuetz, A., Stewart, W.F., Sun, J.: Retain: interpretable predictive model in healthcare using reverse time attention mechanism (2016). CoRR: abs/1608.05745
Google Scholar
Netto, S.M.B., de Paiva, A.C., de Almeida Neto, A., Silva, A.C., Leite, V.R.C.: Application on Reinforcement Learning for Diagnosis Based on Medical Image. INTECH Open Access Publisher, London (2008)
Google Scholar
Poolla, R.: A reinforcement learning approach to obtain treatment strategies in sequential medical decision problems. Graduate Theses and Dissertations, University of South Florida (2003)
Google Scholar
Shortreed, S.M., Laber, E., Lizotte, D.J., Stroup, T.S., Pineau, J., Murphy, S.A.: Informing sequential clinical decision-making through reinforcement learning: an empirical study. Mach. Learn. 84(1–2), 109–136 (2011)
Article MathSciNet Google Scholar
Zhao, Y., Zeng, D., Socinski, M.A., Kosorok, M.R.: Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer. Biometrics 67(4), 1422–1433 (2011)
Article MathSciNet Google Scholar
Narasimhan, K., Kulkarni, T.D., Barzilay, R.: Language understanding for text-based games using deep reinforcement learning. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, Lisbon, 17–21 Sept 2015, pp. 1–11 (2015)
Google Scholar
Narasimhan, K., Yala, A., Barzilay, R.: Improving information extraction by acquiring external evidence with reinforcement learning (2016). Preprint. ArXiv:1603.07954
Google Scholar
Prakash, A., Zhao, S., Hasan, S.A., Datla, V.V., Lee, K., Qadir, A., Liu, J., Farri, O.: Condensed memory networks for clinical diagnostic inferencing. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 3274–3280 (2017)
Google Scholar
Johnson, A.E.W., Pollard, T.J., Shen, L., Lehman, L.-W.H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L.A., Mark, R.G.: MIMIC-III, a freely accessible critical care database. Sci. Data 3, 160035 (2016)
Article Google Scholar
Ling, Y., Hasan, S.A., Datla, V., Qadir, A., Lee, K., Liu, J., Farri, O.: Diagnostic inferencing via improving clinical concept extraction with deep reinforcement learning: a preliminary study. In: Proceedings of the 2nd Machine Learning for Healthcare Conference, pp. 271–285 (2017)
Google Scholar
Roberts, K., Simpson, M.S., Voorhees, E., Hersh, W.R.: Overview of the TREC 2015 clinical decision support track. In: TREC (2015)
Google Scholar
Ling, Y., Hasan, S.A., Datla, V.V., Qadir, A., Lee, K., Liu, J., Farri, O.: Learning to diagnose: assimilating clinical narratives using deep reinforcement learning. In: Proceedings of the Eighth International Joint Conference on Natural Language Processing, IJCNLP, pp. 895–905 (2017)
Google Scholar
Roberts, K., Demner-Fushman, D., Voorhees, E., Hersh, W.R.: Overview of the TREC 2016 clinical decision support track. In: TREC (2016)
Google Scholar
Hasan, S.A., Zhao, S., Datla, V., Liu, J., Lee, K., Qadir, A., Prakash, A., Farri, O.: Clinical question answering using key-value memory networks and knowledge graph. In: TREC (2016)
Google Scholar
Datla, V., Hasan, S.A., Qadir, A., Lee, K., Ling, Y., Liu, J., Farri, O.: Automated clinical diagnosis: the role of content in various sections of a clinical document. In: IEEE-BIBM International Workshop on Biomedical and Health Informatics (2017)
Google Scholar
Pavlick, E., Rastogi, P., Ganitkevitch, J., Van Durme, B., Callison-Burch, C.: PPDB 2.0: better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification. In: Proceedings of ACL-IJCNLP, pp. 425–430 (2015)
Google Scholar
Lindberg, D., Humphreys, B., McCray, A.: The unified medical language system. Methods Inf. Med. 32(4), 281–291 (1993)
Article Google Scholar
Adduru, V., Hasan, S.A., Liu, J., Ling, Y., Datla, V., Lee, K., Qadir, A., Farri, O.: Towards dataset creation and establishing baselines for sentence-level neural clinical paraphrase generation and simplification. In: Proceedings of the 3rd International Workshop on Knowledge Discovery in Healthcare Data (KDH) @ IJCAI-ECAI (2018)
Google Scholar
Ionescu, B., Müller, H., Villegas, M., Arenas, H., Boato, G., Dang-Nguyen, D.-T., Cid, Y.D., Eickhoff, C., de Herrera, A.G.S., Gurrin, C., Islam, M.B., Kovalev, V., Liauchuk, V., Mothe, J., Piras, L., Riegler, M., Schwall, I.: Overview of ImageCLEF 2017: information extraction from images. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction - 8th International Conference of the CLEF Association, CLEF Proceedings, pp. 315–337 (2017)
Google Scholar
Eickhoff, C., Schwall, I., de Herrera, A.G.S., Müller, H.: Overview of imageclefcaption 2017 - image caption prediction and concept detection for biomedical images. In: Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum (2017)
Google Scholar
Hasan, S.A., Ling, Y., Liu, J., Sreenivasan, R., Anand, S., Arora, T.R., Datla, V.V., Lee, K., Qadir, A., Swisher, C., Farri, O.: PRNA at imageclef 2017 caption prediction and concept detection tasks. In: Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum (2017)
Google Scholar
Hasan, S.A., Ling, Y., Liu, J., Sreenivasan, R., Anand, S., Arora, T.R., Datla, V., Lee, K., Qadir, A., Swisher, C., Farri, O.: Attention-based medical caption generation with image modality classification and clinical concept mapping. In: Proceedings of the 9th International Conference and Labs of the Evaluation Forum (CLEF) (2018)
Google Scholar
Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. In: Proceedings of the 32nd International Conference on Machine Learning, vol. 37, pp. 2048–2057 (2015)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). CoRR: abs/1409.1556
Google Scholar

Download references

Author information

Authors and Affiliations

Artificial Intelligence Lab, Philips Research North America, Cambridge, MA, USA
Sadid A. Hasan & Oladimeji Farri

Authors

Sadid A. Hasan
View author publications
You can also search for this author in PubMed Google Scholar
Oladimeji Farri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sadid A. Hasan .

Editor information

Editors and Affiliations

Philips Research, Eindhoven, The Netherlands
Sergio Consoli
Dept of Mathematics and Computer Science, University of Cagliari, Cagliari, Italy
Diego Reforgiato Recupero
Data Science Department, Philips Research, Eindhoven, The Netherlands
Milan Petković

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hasan, S.A., Farri, O. (2019). Clinical Natural Language Processing with Deep Learning. In: Consoli, S., Reforgiato Recupero, D., Petković, M. (eds) Data Science for Healthcare. Springer, Cham. https://doi.org/10.1007/978-3-030-05249-2_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-05249-2_5
Published: 24 February 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05248-5
Online ISBN: 978-3-030-05249-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics