Skip to main content

Named Entity Recognition in Local Intent Web Search Queries

  • Conference paper
  • First Online:
Database and Expert Systems Applications (DEXA 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11706))

Included in the following conference series:

  • 1468 Accesses

Abstract

Semantic understanding of web queries is a challenging problem as web queries are short, noisy and usually do not observe the grammar of a written language. In this paper, we specifically study the user web search queries with local intent on Bing. Local intent queries deal with searching for local businesses and services in a location. Hence, local query parsing translates into the classical problem of Named Entity Recognition (NER) in NLP. State-of-the-art NER systems rely heavily on hand-crafted features and domain-specific knowledge to effectively learn from the small, supervised training corpora that is available. In this paper, we use deep learnt neural model that relies solely on features extracted from word embeddings learnt in an unsupervised way, using search logs. We propose a novel technique for generating domain specific embeddings and show that they significantly improve the performance of existing models for the NER task. Our model outperforms the existing CRF based parser currently used in production.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: Proceeding of NAACL-HLT (2016)

    Google Scholar 

  2. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient Estimation of Word Representation in Vector Space (2013). https://arxiv.org/abs/1301.3781

  3. Joulin, A., Grave, E., Bojanowski, P., Mikolov, T.: Bag of tricks for efficient text classification. In: Proceedings of European Chapter of the Association for Computational Linguistics (EACL) (2017)

    Google Scholar 

  4. Pennington, J., Socher, R., Manning, C.D.: GloVe: global vectors for word representation. In: Proceeding of Empirical Methods in Natural Language Processing (EMNLP) (2014)

    Google Scholar 

  5. Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for Sequence Tagging (2015). https://arxiv.org/abs/1508.01991

  6. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. (JMLR) 12, 2493–2537 (2011)

    MATH  Google Scholar 

  7. Cowan, B., Zethelius, S., Luk, B., Baras, T., Ukarde, P., Zhang, D.: Named entity recognition in travel-related search queries. In: Proceedings of the Twenty-Seventh Conference on Innovative Applications of Artificial Intelligence (2015)

    Google Scholar 

  8. Blei, D.M., Ng, A.Y., Jordan, M.I., Lafferty, J.: Latent dirichlet allocation. J. Mach. Learn. Res. (JMLR) 3, 993–1022 (2003)

    MATH  Google Scholar 

  9. Lafferty, J., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of the 18th International Conference on Machine Learning (2001)

    Google Scholar 

  10. Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)

    Article  Google Scholar 

  11. Eiselt, A., Figueroa. A.: A two-step named entity recognizer for open-domain search queries. In: Proceedings of the Fourth International Joint Conference on Natural Language Processing (IJCNLP 2013), pp. 829–833 (2013)

    Google Scholar 

  12. Harris, Z.: Distributional structure. Word 10(23), 146–162 (1954)

    Article  Google Scholar 

  13. Pasca, M.: Weakly-supervised discovery of named entities using web search queries. In: Proceedings of the Sixteenth ACM Conference on Conference on Information and Knowledge Management - CIKM 2007, pp. 683, New York (2007)

    Google Scholar 

  14. Du, J., Zhang, Z., Yan, J., Cui, Y., Chen, Z.: Using search session context for named entity recognition in query. In: Proceeding of the 33rd SIGIR (2010)

    Google Scholar 

  15. Guo, J., Xu, G., Cheng, X., Li, H.: 2009 named entity recognition in query. In: Proceedings of the 32nd SIGIR (2009)

    Google Scholar 

  16. Kingma, D.P., Ba, J.: Adam. A Method for Stochastic Optimization (2017). https://arxiv.org/abs/1412.6980

  17. Firth, J.R.: A synopsis of linguistic theory 1930–1955. In: Studies in Linguistic Analysis, pp. 1–32. Oxford Philological Society, Selected Papers of 1952–1959, London (1968)

    Google Scholar 

  18. Glorot, X., Bengio, Y.: Understanding the difficulty in training deep feedforward neural networks. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), Sardinia, Italy (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manoj K. Agarwal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mittal, S., Agarwal, M.K. (2019). Named Entity Recognition in Local Intent Web Search Queries. In: Hartmann, S., Küng, J., Chakravarthy, S., Anderst-Kotsis, G., Tjoa, A., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2019. Lecture Notes in Computer Science(), vol 11706. Springer, Cham. https://doi.org/10.1007/978-3-030-27615-7_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-27615-7_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-27614-0

  • Online ISBN: 978-3-030-27615-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics