Skip to main content

Exploring Pattern Structures of Syntactic Trees for Relation Extraction

  • Conference paper
  • First Online:
Formal Concept Analysis (ICFCA 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9113))

Included in the following conference series:

Abstract

In this paper we explore the possibility of defining an original pattern structure for managing syntactic trees. More precisely, we are interested in the extraction of relations such as drug-drug interactions (DDIs) in medical texts where sentences are represented as syntactic trees. In this specific pattern structure, called STPS, the similarity operator is based on rooted tree intersection. Moreover, we introduce “Lazy Pattern Structure Classification” (LPSC), which is a symbolic method able to extract and classify DDI sentences w.r.t. STPS. To decrease computation time, a projection and a set of tree-simplification operations are proposed. We evaluated the method by means of a 10-fold cross validation on the corpus of the DDI extraction challenge 2011, and we obtained very encouraging results that are reported at the end of the paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The maximal trees from a set X are all trees of X that are not a rooted subtree of another tree in X.

References

  1. Balcázar, J.L., Bifet, A., Lozano, A.: Intersection algorithms and a closure operator on unordered trees. In: MLG, p. 1 (2006)

    Google Scholar 

  2. Björne, J., Airola, A., Pahikkala, T., Salakoski, T.: Drug-drug interaction extraction from biomedical texts with svm and rls classifiers. In: Proceedings of DDIExtraction-2011 challenge task, pp. 35–42 (2011)

    Google Scholar 

  3. Chowdhury, F.M., Abacha, A.B., Lavelli, A., Zweigenbaum, P.: Two different machine learning techniques for drug-drug interaction extraction. In: Challenge Task on Drug-Drug Interaction Extraction, pp. 19–26 (2011)

    Google Scholar 

  4. Chowdhury, M.F.M., Lavelli, A.: Drug-drug interaction extraction using composite kernels. In: Challenge Task on Drug-Drug Interaction Extraction, pp. 27–33 (2011)

    Google Scholar 

  5. Galitsky, B.A., Ilvovsky, D., Kuznetsov, S.O., Strok, F.: Finding maximal common sub-parse thickets for multi-sentence search. In: Croitoru, M., Rudolph, S., Woltran, S., Gonzales, C. (eds.) GKR 2013. LNCS, vol. 8323, pp. 39–57. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  6. Ganter, B., Kuznetsov, S.O.: Pattern structures and their projections. In: Delugach, H.S., Stumme, G. (eds.) ICCS 2001. LNCS (LNAI), vol. 2120, pp. 129–142. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  7. Garcia-Blasco, S., Mola-Velasco, S.M., Danger, R., Rosso, P.: Automatic drug-drug interaction detection: A machine learning approach with maximal frequent sequence extraction. In: Challenge Task on Drug-Drug Interaction Extraction, pp. 51–58 (2011)

    Google Scholar 

  8. Klein, D., Manning, C.D.: Accurate unlexicalized parsing. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 423–430. Association for Computational Linguistics (2003)

    Google Scholar 

  9. Kuznetsov, S.O.: Fitting pattern structures to knowledge discovery in big data. In: Cellier, P., Distel, F., Ganter, B. (eds.) ICFCA 2013. LNCS, vol. 7880, pp. 254–266. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  10. Kuznetsov, S.O., Samokhin, M.V.: Learning closed sets of labeled graphs for chemical applications. In: Kramer, S., Pfahringer, B. (eds.) ILP 2005. LNCS (LNAI), vol. 3625, pp. 190–208. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  11. Minard, A.L., Makour, L., Ligozat, A.L., Grau, B.: Feature selection for drug-drug interaction detection using machine-learning based approaches. In: Challenge Task on Drug-Drug Interaction Extraction, pp. 43–50 (2011)

    Google Scholar 

  12. Segura-Bedmar, I., Martınez, P., Sánchez-Cisneros, D.: The 1st DDIExtraction-2011 challenge task: extraction of drug-drug interactions from biomedical texts. Challenge Task on Drug-Drug interaction extraction 2011, 1–9 (2011)

    Google Scholar 

  13. Socher, R., Bauer, J., Manning, C.D., Ng, A.Y.: Parsing with compositional vector grammars. In: Proceedings of the ACL conference, Citeseer (2013)

    Google Scholar 

  14. Thomas, P., Neves, M., Solt, I., Tikk, D., Leser, U.: Relation extraction for drug-drug interactions using ensemble learning. In: Challenge Task on Drug-Drug Interaction Extraction, pp. 11–18 (2011)

    Google Scholar 

  15. Wille, R.: Restructuring lattice theory: an approach based on hierarchies of concepts. In: Ferré, S., Rudolph, S. (eds.) ICFCA 2009. LNCS, vol. 5548, pp. 314–339. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Artuur Leeuwenberg .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Leeuwenberg, A., Buzmakov, A., Toussaint, Y., Napoli, A. (2015). Exploring Pattern Structures of Syntactic Trees for Relation Extraction. In: Baixeries, J., Sacarea, C., Ojeda-Aciego, M. (eds) Formal Concept Analysis. ICFCA 2015. Lecture Notes in Computer Science(), vol 9113. Springer, Cham. https://doi.org/10.1007/978-3-319-19545-2_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-19545-2_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-19544-5

  • Online ISBN: 978-3-319-19545-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics