Skip to main content

Extracting Methodological Sentences from Unstructured Abstracts of Academic Articles

  • Conference paper
  • First Online:
Sustainable Digital Communities (iConference 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12051))

Included in the following conference series:

Abstract

Methodological sentence is the smallest unit that depicts how the research method is used in one paper. Researchers can understand a method by reading these sentences. So, extracting methodological sentences automatically is meaningful for them to evaluate and select appropriate methods in their research process. However, previous studies rely too much on manually annotated corpus, in which the quantity is limited. Furthermore, some studies do not perform well when generalized to testing sets. In this paper, we use structured abstracts as training data to alleviate the burden of manually annotation. The label for each sentence is determined by its corresponding title in the abstract. Moreover, in order to extract methodological sentences more precisely, a rule-based method is applied for pruning the prediction result. In experimental results, the P, R, and F1 value after pruning are 65.14%, 57.00% and 60.80% respectively, which are all higher than those are not pruned.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.nltk.org/.

  2. 2.

    https://onlinelibrary.wiley.com/loi/23301643.

References

  1. Bornmann, L., Mutz, R.: Growth rates of modern science: a bibliometric analysis based on the number of publications and cited references. J. Assoc. Inform. Sci. Technol. 66(11), 2215–2222 (2015)

    Article  Google Scholar 

  2. Noriega-Atala, E., Hein, P.D., Thumsi, S.S., Wong, Z., Wang, X., Morrison, C.T.: Inter-sentence relation extraction for associating biological context with events in biomedical texts. In: 2018 IEEE International Conference on Data Mining Workshops (ICDMW), Singapore, pp. 722–731 (2018)

    Google Scholar 

  3. Wang, H., et al.: Evidence sentence extraction for machine reading comprehension. arXiv preprint arXiv:1902.08852 (2019)

  4. Morid, M.A., Fiszman, M., Raja, K., Jonnalagadda, S.R., Del Fiol, G.: Classification of clinically useful sentences in clinical evidence resources. J. Biomed. Inform. 60, 14–22 (2016)

    Article  Google Scholar 

  5. Venkatesan, R., Li, B.: Convolutional Neural Networks in Visual Computing: A Concise Guide. CRC Press, Florida (2017)

    Book  Google Scholar 

  6. Wakuya, H., Zurada, J.M.: Bi-directional computing architecture for time series prediction. Neural Netw. 14(9), 1307–1321 (2001)

    Article  Google Scholar 

  7. Hadji, I., Wildes, R.P.: What do we understand about convolutional networks? arXiv preprint arXiv:1803.08834 (2018)

  8. Rezaeinia, S.M., Ghodsi, A., Rahmani, R.: Text classification based on multiple block convolutional highways. arXiv preprint arXiv:1807.09602 (2018)

  9. Chengliang, G., Hua, X.U., Kai, G.: Attention-based BiLSTM network with part-of-speech features for chinese text classification. J. Hebei Univ. Sci. Technol. (2018)

    Google Scholar 

  10. Cristianint, N.: An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Printed in the United Kingdom at the University Press (2000)

    Google Scholar 

  11. Palous, J.: Machine Learning and Data Mining. Horwood Publishing, Chichester (2007)

    Google Scholar 

  12. Guo, G., Wang, H., Bell, D., Bi, Y., Greer, K.: KNN model-based approach in classification. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) On The Move to Meaningful Internet Systems 2003: CoopIS, DOA, and ODBASE. OTM 2003. LNCS, vol. 2888, pp. 986–996. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39964-3_62

    Chapter  Google Scholar 

  13. Kingma, D., Ba, J.: Adam: a method for stochastic optimization. Comput. Sci. (2014)

    Google Scholar 

  14. Levy, O., Goldberg, Y.: Dependency-based word embeddings. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (2014)

    Google Scholar 

  15. Mikolov, T., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  16. Cohen, J.: A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 20(1), 37–46 (1960)

    Article  Google Scholar 

  17. Landis, J.R., Koch, G.G.: The measurement of interrater agreement for categorical data. Biometrics 33, 159–174 (1977)

    Article  Google Scholar 

Download references

Acknowledgment

This work is supported by Major Projects of National Social Science Fund (No. 17ZDA291) and Postgraduate Research & Practice Innovation Program of Jiangsu Province (No. KYCX19_0246).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chengzhi Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, R., Zhang, C., Zhang, Y., Zhang, J. (2020). Extracting Methodological Sentences from Unstructured Abstracts of Academic Articles. In: Sundqvist, A., Berget, G., Nolin, J., Skjerdingstad, K. (eds) Sustainable Digital Communities. iConference 2020. Lecture Notes in Computer Science(), vol 12051. Springer, Cham. https://doi.org/10.1007/978-3-030-43687-2_66

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-43687-2_66

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-43686-5

  • Online ISBN: 978-3-030-43687-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics