Skip to main content

Clinical Natural Language Processing with Deep Learning

  • Chapter
  • First Online:
Data Science for Healthcare

Abstract

The emergence and proliferation of electronic health record (EHR) systems has incrementally resulted in large volumes of clinical free text documents available across healthcare networks. The huge amount of data inspires research and development focused on novel clinical natural language processing (NLP) solutions to optimize clinical care and improve patient outcomes. In recent years, deep learning techniques have demonstrated superior performance over traditional machine learning (ML) techniques for various general-domain NLP tasks, e.g., language modeling, parts-of-speech (POS) tagging, named entity recognition, paraphrase identification, sentiment analysis, etc. Clinical documents pose unique challenges compared to general-domain text due to the widespread use of acronyms and nonstandard clinical jargons by healthcare providers, inconsistent document structure and organization, and requirement for rigorous de-identification and anonymization to ensure patient data privacy. This tutorial chapter will present an overview of how deep learning techniques can be applied to solve NLP tasks in general, followed by a literature survey of existing deep learning algorithms applied to clinical NLP problems. Finally, we include a description of various deep learning-driven clinical NLP applications developed at the artificial intelligence (AI) lab in Philips Research in recent years—such as diagnostic inferencing from unstructured clinical narratives, relevant biomedical article retrieval based on clinical case scenarios, clinical paraphrase generation, adverse drug event (ADE) detection from social media, and medical image caption generation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alsaffar, M., Yellowlees, P., Odor, A., Hogarth, M.: The state of open source electronic health record projects: a software anthropology study. JMIR Med. Inform. 5(1), e6 (2017)

    Article  Google Scholar 

  2. Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)

    Article  Google Scholar 

  3. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press, Cambridge (2016). http://www.deeplearningbook.org

    MATH  Google Scholar 

  4. Goldberg,Y.: A primer on neural network models for natural language processing. J. Artif. Intell. Res. 57, 345–420 (2016)

    Article  MathSciNet  Google Scholar 

  5. Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, pp. 1556–1566 (2015)

    Google Scholar 

  6. Stewart, G.W.: On the early history of the singular value decomposition. SIAM Rev. 35(4), 551–566 (1993)

    Article  MathSciNet  Google Scholar 

  7. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space (2013). CoRR: abs/1301.3781

    Google Scholar 

  8. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of the 27th Annual Conference on Neural Information Processing Systems NIPS 2013, pp. 3111–3119 (2013)

    Google Scholar 

  9. Luong, T., Socher, R., Manning, C.D.: Better word representations with recursive neural networks for morphology. In: Proceedings of the Seventeenth Conference on Computational Natural Language Learning, CoNLL 2013, pp. 104–113 (2013)

    Google Scholar 

  10. Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, 21–26 June 2014, pp. 1188–1196 (2014)

    Google Scholar 

  11. Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, pp. 1746–1751 (2014)

    Google Scholar 

  12. Goldberg, Y.: Neural Network Methods in Natural Language Processing. Morgan & Claypool Publishers, San Rafael (2017)

    Book  Google Scholar 

  13. Johnson, R., Zhang, T.: Effective use of word order for text categorization with convolutional neural networks. In: Proceedings of the 11th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL–HLT), Denver, CO, pp. 103–112 (2015)

    Google Scholar 

  14. Lee, K., Qadir, A., Hasan, S.A., Datla, V.V., Prakash, A., Liu, J., Farri, O.: Adverse drug event detection in tweets with semi-supervised convolutional neural networks. In: Proceedings of the 26th International Conference on World Wide Web (WWW), pp. 705–714 (2017)

    Google Scholar 

  15. Gehring, J., Auli, M., Grangier, D., Dauphin, Y.N.: A convolutional encoder model for neural machine translation (2016). ArXiv e-prints

    Google Scholar 

  16. Gehring, J., Auli, M., Grangier, D., Yarats, D., Dauphin, Y.N.: Convolutional sequence to sequence learning (2017). ArXiv e-prints

    Google Scholar 

  17. Sutskever, I., Martens, J., Hinton, G.E.: Generating text with recurrent neural networks. In: Proceedings of ICML, pp. 1017–1024 (2011)

    Google Scholar 

  18. Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 5(2), 157–166 (1994)

    Article  Google Scholar 

  19. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  20. Cho, K., van Merrienboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: encoder–decoder approaches. In: Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pp. 103–111 (2014)

    Google Scholar 

  21. Sukhbaatar, S., Szlam, A., Weston, J., Fergus, R.: End-to-end memory networks. In: Cortes, C., Lawrence, N.D., Lee, D.D., Sugiyama, M., Garnett, R. (eds.) NIPS, pp. 2440–2448 (2015)

    Google Scholar 

  22. Weston, J., Chopra, S., Bordes, A.: Memory networks (2014). CoRR: abs/1410.3916

    Google Scholar 

  23. Miller, A., Fisch, A., Dodge, J., Karimi, A.-H., Bordes, A., Weston, J.: Key-value memory networks for directly reading documents (2016). CoRR: abs/1606.03126

    Google Scholar 

  24. Watkins, C.J.C.H., Dayan, P.: Q-learning. Mach. Learn. 8, 279–292 (1992)

    MATH  Google Scholar 

  25. Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)

    Google Scholar 

  26. Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)

    Article  Google Scholar 

  27. Baker, S., Korhonen, A., Pyysalo, S.: Cancer hallmark text classification using convolutional neural networks. In: BioTxtM, pp. 1–9 (2016)

    Google Scholar 

  28. Lin, C., Miller, T., Dligach, D., Bethard, S., Savova, G.: Representations of time expressions for temporal relation extraction with convolutional neural networks. In: BioNLP, pp. 322–327 (2017)

    Google Scholar 

  29. Mohan, S., Fiorini, N., Kim, S., Lu, Z.: Deep learning for biomedical information retrieval: learning textual relevance from click logs. In: BioNLP, pp. 222–231 (2017)

    Google Scholar 

  30. Peng, Y., Lu, Z.: Deep learning for extracting protein-protein interactions from biomedical literature. In: BioNLP, pp. 29–38 (2017)

    Google Scholar 

  31. Asada, M., Miwa, M., Sasaki, Y.: Extracting drug-drug interactions with attention CNNs. In: BioNLP, pp. 9–18 (2017)

    Google Scholar 

  32. Chen, M.C., Ball, R.L., Yang, L., Moradzadeh, N., Chapman, B.E., Larson, D.B., Langlotz, C., Amrhein, T.J., Lungren, M.: Deep learning to classify radiology free-text reports. Radiology 286, 845–852 (2017)

    Article  Google Scholar 

  33. Sulieman, L., Gilmore, D., French, C., Cronin, R.M., Jackson, G.P., Russell, M., Fabbri, D.: Classifying patient portal messages using convolutional neural networks. J. Biomed. Inform. 74, 59–70 (2017)

    Article  Google Scholar 

  34. Crichton, G., Pyysalo, S., Chiu, B., Korhonen, A.: A neural network multi-task learning approach to biomedical named entity recognition. BMC Bioinform. 18(1), 368 (2017)

    Article  Google Scholar 

  35. Karimi, S., Dai, X., Hassanzadeh, H., Nguyen, A.: Automatic diagnosis coding of radiology reports: a comparison of deep learning and conventional classification methods. In: BioNLP, pp. 328–332 (2017)

    Google Scholar 

  36. Feldman, R., Netzer, O., Peretz, A., Rosenfeld, B.: Utilizing text mining on online medical forums to predict label change due to adverse drug reactions. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Sydney, pp. 1779–1788 (2015)

    Google Scholar 

  37. Lardon, J., Abdellaoui, R., Bellet, F., Asfari, H., Souvignet, J., Texier, N., Jaulent, M.C., Beyens, M.N., Burgun, A., Bousquet, C.: Adverse drug reaction identification and extraction in social media: a scoping review. J. Med. Internet Res. 17(7), e171 (2015)

    Article  Google Scholar 

  38. Sarker, A., Ginn, R.E., Nikfarjam, A., O’Connor, K., Smith, K., Jayaraman, S., Upadhaya, T., Gonzalez, G.: Utilizing social media data for pharmacovigilance: a review. J. Biomed. Inform. 54, 202–212 (2015)

    Article  Google Scholar 

  39. Yang, M., Kiang, M., Shang, W.: Filtering big data from social media - building an early warning system for adverse drug reactions. J. Biomed. Inform. 54(C), 230–240 (2015)

    Article  Google Scholar 

  40. Sarker, A., Gonzalez, G.: Portable automatic text classification for adverse drug reaction detection via multi-corpus training. J. Biomed. Inform. 53, 196–207 (2015)

    Article  Google Scholar 

  41. Liu, X., Chen, H.: Identifying adverse drug events from patient social media: a case study for diabetes. IEEE Intell. Syst. 30(3), 44–51 (2015)

    Article  Google Scholar 

  42. Jagannatha, A.N., Yu, H.: Bidirectional RNN for medical event detection in electronic health records. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 473–482 (2016)

    Google Scholar 

  43. Jagannatha, A., Yu, H.: Structured prediction models for RNN based sequence labeling in clinical text. In: EMNLP, pp. 856–865 (2016)

    Google Scholar 

  44. Maharana, A., Yetisgen, M.: Clinical event detection with hybrid neural architecture. In: BioNLP, pp. 351–355 (2017)

    Google Scholar 

  45. Yadav, S., Ekbal, A., Saha, S., Bhattacharyya, P.: Deep learning architecture for patient data de-identification in clinical records. In: Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP), pp. 32–41 (2016)

    Google Scholar 

  46. Dernoncourt, F., Lee, J.Y., Uzuner, Ö., Szolovits, P.: De-identification of patient notes with recurrent neural networks. J. Am. Med. Inform. Assoc. 24(3), 596–606 (2017)

    Google Scholar 

  47. Liu, Z., Tang, B., Wang, X., Chen, Q.: De-identification of clinical notes via recurrent neural network and conditional random field. J. Biomed. Inform. 75, S34–S42 (2017)

    Article  Google Scholar 

  48. Salloum, W., Finley, G., Edwards, E., Miller, M., Suendermann-Oeft, D.: Deep learning for punctuation restoration in medical reports. In: BioNLP, pp. 159–164 (2017)

    Google Scholar 

  49. Patchigolla, R.V.S.S., Sahu, S., Anand, A.: Biomedical event trigger identification using bidirectional recurrent neural network based models. In: BioNLP, pp. 316–321 (2017)

    Google Scholar 

  50. He, H., Ganjam, K., Jain, N., Lundin, J., White, R., Lin, J.: An insight extraction system on biomedical literature with deep neural networks. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, 9–11 Sept 2017, pp. 2691–2701 (2017)

    Google Scholar 

  51. Chalapathy, R., Borzeshi, E.Z., Piccardi, M.: Bidirectional LSTM-CRF for clinical concept extraction. In: ClinicalNLP@COLING 2016, pp. 7–12 (2016)

    Google Scholar 

  52. Unanue, I.J., Borzeshi, E.Z., Piccardi, M.: Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition. J. Biomed. Inform. 76, 102–109 (2017)

    Article  Google Scholar 

  53. Liu, Z., Yang, M., Wang, X., Chen, Q., Tang, B., Wang, Z., Xu, H.: Entity recognition from clinical texts via recurrent neural network. BMC Med. Inform. Decis. Mak. 17(2), 67 (2017)

    Article  Google Scholar 

  54. Stanovsky, G., Gruhl, D., Mendes, P.: Recognizing mentions of adverse drug reaction in social media using knowledge-infused recurrent models. In: EACL, pp. 142–151 (2017)

    Google Scholar 

  55. Sahu, S.K., Anand, A.: Recurrent neural network models for disease name recognition using domain invariant features. In: ACL (2016)

    Google Scholar 

  56. Elhadad, N., Sutaria, K.: Mining a lexicon of technical terms and lay equivalents. In: Proceedings of the Workshop on BioNLP, pp. 49–56 (2007)

    Google Scholar 

  57. Deléger, L., Zweigenbaum, P.: Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora. In: Proceedings of the 2nd Workshop on Building and Using Comparable Corpora: From Parallel to Non-parallel Corpora, pp. 2–10 (2009)

    Google Scholar 

  58. Wang, C., Cao, L., Zhou, B.: Medical synonym extraction with concept space models. In: Proceedings of IJCAI, pp. 989–995 (2015)

    Google Scholar 

  59. Prakash, A., Hasan, S.A., Lee, K., Datla, V., Qadir, A., Liu, J., Farri, O.: Neural paraphrase generation with stacked residual LSTM networks. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pp. 2923–2934 (2016)

    Google Scholar 

  60. Hasan, S.A., Liu, B., Liu, J., Qadir, A., Lee, K., Datla, V.V., Prakash, A., Farri, O.: Neural clinical paraphrase generation with attention. In: Proceedings of the Clinical Natural Language Processing Workshop, ClinicalNLP@COLING, pp. 42–53 (2016)

    Google Scholar 

  61. Choi, E., Bahadori, M.T., Sun, J.: Doctor AI: predicting clinical events via recurrent neural networks (2015). Preprint. ArXiv:1511.05942

    Google Scholar 

  62. Choi, E., Bahadori, M.T., Schuetz, A., Stewart, W.F., Sun, J.: Retain: interpretable predictive model in healthcare using reverse time attention mechanism (2016). CoRR: abs/1608.05745

    Google Scholar 

  63. Netto, S.M.B., de Paiva, A.C., de Almeida Neto, A., Silva, A.C., Leite, V.R.C.: Application on Reinforcement Learning for Diagnosis Based on Medical Image. INTECH Open Access Publisher, London (2008)

    Google Scholar 

  64. Poolla, R.: A reinforcement learning approach to obtain treatment strategies in sequential medical decision problems. Graduate Theses and Dissertations, University of South Florida (2003)

    Google Scholar 

  65. Shortreed, S.M., Laber, E., Lizotte, D.J., Stroup, T.S., Pineau, J., Murphy, S.A.: Informing sequential clinical decision-making through reinforcement learning: an empirical study. Mach. Learn. 84(1–2), 109–136 (2011)

    Article  MathSciNet  Google Scholar 

  66. Zhao, Y., Zeng, D., Socinski, M.A., Kosorok, M.R.: Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer. Biometrics 67(4), 1422–1433 (2011)

    Article  MathSciNet  Google Scholar 

  67. Narasimhan, K., Kulkarni, T.D., Barzilay, R.: Language understanding for text-based games using deep reinforcement learning. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015, Lisbon, 17–21 Sept 2015, pp. 1–11 (2015)

    Google Scholar 

  68. Narasimhan, K., Yala, A., Barzilay, R.: Improving information extraction by acquiring external evidence with reinforcement learning (2016). Preprint. ArXiv:1603.07954

    Google Scholar 

  69. Prakash, A., Zhao, S., Hasan, S.A., Datla, V.V., Lee, K., Qadir, A., Liu, J., Farri, O.: Condensed memory networks for clinical diagnostic inferencing. In: Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, pp. 3274–3280 (2017)

    Google Scholar 

  70. Johnson, A.E.W., Pollard, T.J., Shen, L., Lehman, L.-W.H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L.A., Mark, R.G.: MIMIC-III, a freely accessible critical care database. Sci. Data 3, 160035 (2016)

    Article  Google Scholar 

  71. Ling, Y., Hasan, S.A., Datla, V., Qadir, A., Lee, K., Liu, J., Farri, O.: Diagnostic inferencing via improving clinical concept extraction with deep reinforcement learning: a preliminary study. In: Proceedings of the 2nd Machine Learning for Healthcare Conference, pp. 271–285 (2017)

    Google Scholar 

  72. Roberts, K., Simpson, M.S., Voorhees, E., Hersh, W.R.: Overview of the TREC 2015 clinical decision support track. In: TREC (2015)

    Google Scholar 

  73. Ling, Y., Hasan, S.A., Datla, V.V., Qadir, A., Lee, K., Liu, J., Farri, O.: Learning to diagnose: assimilating clinical narratives using deep reinforcement learning. In: Proceedings of the Eighth International Joint Conference on Natural Language Processing, IJCNLP, pp. 895–905 (2017)

    Google Scholar 

  74. Roberts, K., Demner-Fushman, D., Voorhees, E., Hersh, W.R.: Overview of the TREC 2016 clinical decision support track. In: TREC (2016)

    Google Scholar 

  75. Hasan, S.A., Zhao, S., Datla, V., Liu, J., Lee, K., Qadir, A., Prakash, A., Farri, O.: Clinical question answering using key-value memory networks and knowledge graph. In: TREC (2016)

    Google Scholar 

  76. Datla, V., Hasan, S.A., Qadir, A., Lee, K., Ling, Y., Liu, J., Farri, O.: Automated clinical diagnosis: the role of content in various sections of a clinical document. In: IEEE-BIBM International Workshop on Biomedical and Health Informatics (2017)

    Google Scholar 

  77. Pavlick, E., Rastogi, P., Ganitkevitch, J., Van Durme, B., Callison-Burch, C.: PPDB 2.0: better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification. In: Proceedings of ACL-IJCNLP, pp. 425–430 (2015)

    Google Scholar 

  78. Lindberg, D., Humphreys, B., McCray, A.: The unified medical language system. Methods Inf. Med. 32(4), 281–291 (1993)

    Article  Google Scholar 

  79. Adduru, V., Hasan, S.A., Liu, J., Ling, Y., Datla, V., Lee, K., Qadir, A., Farri, O.: Towards dataset creation and establishing baselines for sentence-level neural clinical paraphrase generation and simplification. In: Proceedings of the 3rd International Workshop on Knowledge Discovery in Healthcare Data (KDH) @ IJCAI-ECAI (2018)

    Google Scholar 

  80. Ionescu, B., Müller, H., Villegas, M., Arenas, H., Boato, G., Dang-Nguyen, D.-T., Cid, Y.D., Eickhoff, C., de Herrera, A.G.S., Gurrin, C., Islam, M.B., Kovalev, V., Liauchuk, V., Mothe, J., Piras, L., Riegler, M., Schwall, I.: Overview of ImageCLEF 2017: information extraction from images. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction - 8th International Conference of the CLEF Association, CLEF Proceedings, pp. 315–337 (2017)

    Google Scholar 

  81. Eickhoff, C., Schwall, I., de Herrera, A.G.S., MĂĽller, H.: Overview of imageclefcaption 2017 - image caption prediction and concept detection for biomedical images. In: Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum (2017)

    Google Scholar 

  82. Hasan, S.A., Ling, Y., Liu, J., Sreenivasan, R., Anand, S., Arora, T.R., Datla, V.V., Lee, K., Qadir, A., Swisher, C., Farri, O.: PRNA at imageclef 2017 caption prediction and concept detection tasks. In: Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum (2017)

    Google Scholar 

  83. Hasan, S.A., Ling, Y., Liu, J., Sreenivasan, R., Anand, S., Arora, T.R., Datla, V., Lee, K., Qadir, A., Swisher, C., Farri, O.: Attention-based medical caption generation with image modality classification and clinical concept mapping. In: Proceedings of the 9th International Conference and Labs of the Evaluation Forum (CLEF) (2018)

    Google Scholar 

  84. Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. In: Proceedings of the 32nd International Conference on Machine Learning, vol. 37, pp. 2048–2057 (2015)

    Google Scholar 

  85. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). CoRR: abs/1409.1556

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sadid A. Hasan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Hasan, S.A., Farri, O. (2019). Clinical Natural Language Processing with Deep Learning. In: Consoli, S., Reforgiato Recupero, D., Petković, M. (eds) Data Science for Healthcare. Springer, Cham. https://doi.org/10.1007/978-3-030-05249-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-05249-2_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-05248-5

  • Online ISBN: 978-3-030-05249-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics