The Research and Construction of Complaint Orders Classification Corpus in Mobile Customer Service

Xu, Junli; Zhao, Jiangjiang; Zhao, Ning; Xue, Chao; Fan, Linbo; Qi, Zechuan; Wei, Qiang

doi:10.1007/978-3-319-99501-4_31

Junli Xu¹⁸,
Jiangjiang Zhao¹⁸,
Ning Zhao¹⁸,
Chao Xue¹⁸,
Linbo Fan¹⁸,
Zechuan Qi¹⁸ &
…
Qiang Wei¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11109))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

2032 Accesses
4 Citations

Abstract

Complaint orders in mobile customer service are the records of complaint description, which professional knowledge and information on customer’s complaint intention are kept. Complaint orders classification is important and necessary to be established and completed for further mining, analysis and improve the quality of customer service. Constructed corpus is the basis of research. The lack of complaint orders classification corpus (COCC) in mobile customer service has limited the research of complaint orders classification. This paper first employs K-means algorithm and professional knowledge to determine complaint orders classification labels. Then we craft the annotation rules for complaint orders, and then construct complaint orders classification corpus. The corpus consists of 130044 complaint orders annotated. Finally, we statistically analyze the corpus constructed, and the agreement of each question class reaches over 91%. It indicates that the corpus constructed could provide a great support for complaint orders classification and specialized analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Available at https://www.mturk.com/.
2.
Available at https://www.crowdflower.com.
3.
Available at https://github.com/HIT-SCIR/ltp.
4.
Available at https://code.google.com/p/word2vec/.
5.
Available at https://github.com/zhng1200/COCC.

References

Lowe, R., Pow, N., Serban, I.V., Pineau, J.: The Ubuntu Dialogue Corpus: A Large Dataset for Research in Unstructured Multi-turn Dialogue Systems. arXiv preprint arXiv:1506.08909 (2015)
Hu, B.T., Chen, Q.C., Zhu, F.Z.: Lcsts: A Large Scale Chinese Short Text Summarization Dataset. arXiv preprint arXiv:1506.05865 (2015)
Yang, L.Y.: Research on the establishment and applications of public sentiment corpus based on micro-blog information. Office Inform. 22, 015 (2016)
Google Scholar
Xi, X.F., Zhu, X.M., Sun, Q.Y., Zhou, G.D.: Corpus construction for chinese discourse topic via micro-topic scheme. J. Comput. Res. Develop. 54(8), 1833–1852 (2017)
Google Scholar
Xue, N.W., Chiou, F.D., Palmer, M.: Building a large-scale annotated Chinese corpus. In: Proceedings of COLING, pp. 1–8. ACL, Taipei (2002)
Google Scholar
Aksan, Y., Aksan, M., Koltuksuz, A.: Construction of the Turkish National Corpus (TNC). In: Proceedings of LREC, pp. 3223–3227. European Language Resources Association, Istanbul (2012)
Google Scholar
You, Z.Y., Wang, Y.Q., Shu, H.P.: A corpus-based TCM symptoms of speech tagging. Electron. Technol. Softw. Eng. 21, 177–178 (2017)
Google Scholar
Brockett, C., Dolan, W.B.: Support Vector Machines for Paraphrase Identification and Corpus Construction. In: Proceedings of IWP, pp. 1–8. IWP, Iowa (2005)
Google Scholar
Dolan, W.B., Brockett, C.: Automatically constructing a corpus of sentential paraphrases. In: Proceedings of IWP, pp. 9–16. IWP, Iowa (2005)
Google Scholar
Vincze, V., Szarvas, G., Farkas, R.: The bioscope corpus: biomedical texts annotated for uncertainty, negation and their scopes. BMC Bioinformatics 9(Suppl. 11), S9 (2008)
Article Google Scholar
Zou, B.W., Zhu, Q.M., Zhou, G.D.: Negation and speculation identification in Chinese language. In: Proceedings of ACL-IJCNLP, pp. 656–665. ACL, Beijing (2015)
Google Scholar
Zhou, H.W., Yang, H., Xu, J.L., Kang, S.Y.: Construction of Chinese hedge scope corpus. J. Chin. Inf. Process. 31(3), 77–85 (2017)
Google Scholar
Yin, P.F., Liu, Z., Xu, A.B.: Tone analyzer for online customer service: an unsupervised model with interfered training. In: Proceedings of CIKM, pp. 1887–1895. ACM, Singapore (2017)
Google Scholar
Quan, C.Q., Ren, F.J.: Construction of a blog emotion corpus for chinese emotional expression analysis. In: Proceedings of EMNLP, pp. 1446–1454. ACL, Singapore (2009)
Google Scholar
Chen, J., Nie, J.Y.: Automatic construction of parallel English-Chinese corpus for cross-language information retrieval. In: Proceedings of ANLP, pp. 21–28. ACL, Stroudsburg (2000)
Google Scholar
Feng, G.J., Yu, L., Tian, S.W.: Auto construction of uyghur emotional words corpus based on CRFs. Data Anal. Knowl. Discov. 27(3), 17–21 (2011)
Google Scholar
Yang, J.F., Guan, Y., He, B., Qu, C.Y., Yu, Q.B., Liu, Y.X., Zhao, Y.J.: Corpus construction for named entities and entity relations on chinese electronic medical records. J. Softw. 27(11), 2725–2746 (2016)
Google Scholar
Artstein, R., Poesio, M.: Inter-coder agreement for computational linguistics. Comput. Linguist. 34(4), 555–596 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

IT System Department, China Mobile Online Services Company Limited, Zhengzhou, Henan, China
Junli Xu, Jiangjiang Zhao, Ning Zhao, Chao Xue, Linbo Fan, Zechuan Qi & Qiang Wei

Authors

Junli Xu
View author publications
You can also search for this author in PubMed Google Scholar
Jiangjiang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Ning Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Chao Xue
View author publications
You can also search for this author in PubMed Google Scholar
Linbo Fan
View author publications
You can also search for this author in PubMed Google Scholar
Zechuan Qi
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Junli Xu .

Editor information

Editors and Affiliations

Soochow University, Suzhou, China
Min Zhang
The University of Texas at Dallas, Richardson, Texas, USA
Vincent Ng
Peking University, Beijing, China
Dongyan Zhao
Peking University, Beijing, China
Sujian Li
Zhengzhou University, Zhengzhou, China
Hongying Zan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, J. et al. (2018). The Research and Construction of Complaint Orders Classification Corpus in Mobile Customer Service. In: Zhang, M., Ng, V., Zhao, D., Li, S., Zan, H. (eds) Natural Language Processing and Chinese Computing. NLPCC 2018. Lecture Notes in Computer Science(), vol 11109. Springer, Cham. https://doi.org/10.1007/978-3-319-99501-4_31

Download citation

DOI: https://doi.org/10.1007/978-3-319-99501-4_31
Published: 14 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99500-7
Online ISBN: 978-3-319-99501-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)