Diverse Demands Estimation and Ranking Based on User Behaviors

Chen, Liandong; Li, Shigang; Zhou, Chunbao; Liu, Fang; Xu, Rui; Li, Shuo; Wang, Jue; Zhang, Boyao

doi:10.1007/978-981-32-9987-0_7

Liandong Chen¹¹,
Shigang Li¹³,
Chunbao Zhou¹²,
Fang Liu¹²,
Rui Xu¹¹,
Shuo Li^12,14,
Jue Wang¹² &
…
Boyao Zhang^12,14

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 913))

Included in the following conference series:

International Workshop on High Performance Computing for Advanced Modeling and Simulation in Nuclear Energy and Environmental Science
International Workshop on HPC Supported Data Analytics for Edge Computing

608 Accesses

Abstract

In the big data era, users can get massive information from the Internet, but the value density is very low. In order to help users find the information they need more quickly, this paper presents the mechanism of diverse demands estimation and ranking based on user behaviors. Firstly, a definition of classification system for users query intent is proposed. Secondly, in order to mine the documents on the websites of specific classification, LDA model is used to cluster and annotate the websites. To speed up the inference process of LDA, we take advantage of MPI and OpenMP hybrid parallelism techniques to reduce both internode and intra-node communication cost. Lastly, according to the historical behaviors of users and the search engine return results, we rank the classifications on Map-Reduce platform and present the top-ranking ones to users

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Asuncion, A.U., Smyth, P., Welling, M.: Asynchronous distributed learning of topic models. In: Proceedings of International Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December, pp. 81–88. DBLP (2008)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Chandra, R., Dagum, L., Kohr, D., Maydan, D., Mcdonald, J.: Parallel Programming in OpenMP. Morgan Kaufmann Publishers, Burlington (2001)
Google Scholar
Dagum, L., Enon, R.: OpenMP: an industry-standard API for shared-memory programming. IEEE Comput. Sci. Eng. 5, 46–55 (1998)
Article Google Scholar
Forum, M.P.: MPI: A Message-passing interface standard. University of Tennessee (1994)
Google Scholar
Gabriel, E., Fagg, G.E., Bosilca, G., et al.: Open MPI: goals, concept, and design of a next generation MPI implementation. In: Proceedings of Recent Advances in Parallel Virtual Machine and Message Passing Interface, European Pvm/mpi Users’ Group Meeting, Budapest, Hungary, 19–22 September 2004, Proceedings, pp. 97–104. DBLP (2004)
Google Scholar
Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proc. Natl. Acad. Sci. U.S.A 101(Suppl. 1), 5228–5235 (2004)
Article Google Scholar
Gropp, W., Lusk, E., Skjellum, A.: Using MPI: Portable Parallel Programming with the Message-Passing Interface. MIT Press, Cambridge (1994)
MATH Google Scholar
Li, S., Zhang, Y., Hoefler, T.: Cache-oblivious MPI all-to-all communications based on Morton order. IEEE Trans. Parallel Distrib. Syst. PP(99), 1 (2018)
Google Scholar
Li, S., Zhang, Y., Hoefler, T.: Cache-oblivious MPI all-to-all communications on many-core architectures. ACM SIGPLAN Not. 52(8), 445–446 (2017)
Article Google Scholar
Liu, Z., Zhang, Y., Chang, E.Y.: Sun, M: PLDA+: parallel latent dirichlet allocation with data placement and pipeline processing. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 26 (2011)
Google Scholar
Newman, D., Asuncion, A.U., Smyth, P., Welling, M.: Distributed inference for Latent Dirichlet allocation. In: Proceedings of International Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, pp. 1–6. DBLP, December 2007
Google Scholar
Rabenseifner, R., Hager, G., Jost, G.: Hybrid MPI/OpenMP parallel programming on clusters of multi-core SMP nodes. In: Proceedings of Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 427–436. IEEE (2009)
Google Scholar
Tora, S., Eguchi, K.: MPI/OpenMP hybrid parallel inference for Latent Dirichlet allocation. In: Proceedings of The Workshop on Large Scale Data Mining: Theory and Applications, p. 5. ACM (2011)
Google Scholar
Vrenios, A.: Parallel programming in C with MPI and OpenMP [book review]. IEEE Distrib. Syst. Online 5(1), 7.1–7.3 (2004)
Google Scholar
Wang, J., Liu, C., Huang, Y.: Auto tuning for new energy dispatch problem: a case study. Futur. Gener. Comput. Syst. 54(C), 501–506 (2016)
Article Google Scholar
Wang, J., Gao, F., Vazquez-Poletti, J.L., Li, J.: Preface of high performance computing or advanced modeling and simulation of materials. Comput. Phys. Commun. 211, 1 (2017)
Article Google Scholar
Wang, Yi, Bai, Hongjie, Stanton, Matt, Chen, Wen-Yen, Chang, Edward Y.: PLDA: parallel Latent Dirichlet allocation for large-scale applications. In: Goldberg, Andrew V., Zhou, Yunhong (eds.) AAIM 2009. LNCS, vol. 5564, pp. 301–314. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-02158-9_26
Chapter Google Scholar
Wu, B., Li, S., Zhang, Y., et al.: A hybrid optimization strategy for the communication of large-scale Kinetic Monte Carlo simulation. Comput. Phys. Commun. (2016)
Google Scholar
Zhang, Y., Li, S., Yan, S., et al.: A cross-platform SpMV framework on many-core architectures. ACM Trans. Arch. Code Optim. 13(4), 33 (2016)
MathSciNet Google Scholar
Jiang, C., Han, G., Lin, J., Jia, G., Shi, W., Wan, J.: Characteristics of co-allocated online services and batch jobs in Internet data centers: a case study from Alibaba cloud. IEEE Access 7, 22495–22508. https://doi.org/10.1109/access.2019.2897898
Article Google Scholar
Qiu, Y., Jiang, C., Wang, Y., Ou, D., Li, Y., Wan, J.: Energy aware virtual machine scheduling in data centers. Energies 12, 646 (2019)
Article Google Scholar
Jiang, C., et al.: Interdomain I/O optimization in virtualized sensor networks. Sensors 18, 4395 (2018)
Article Google Scholar

Download references

Acknowledgements

This work is jointly supported by Grant 2017YFB0203504 in the National Major Research High Performance Computing Program of China, and the State Key Program of National Natural Science Foundation of China (No. 91530324).

Author information

Authors and Affiliations

State Grid Hebei Electric Power Supply Co., Ltd., Shijiazhuang, China
Liandong Chen & Rui Xu
Computer Network Information Center, Chinese Academy of Science, Beijing, China
Chunbao Zhou, Fang Liu, Shuo Li, Jue Wang & Boyao Zhang
Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Shigang Li
University of Chinese Academy of Sciences, Beijing, China
Shuo Li & Boyao Zhang

Authors

Liandong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shigang Li
View author publications
You can also search for this author in PubMed Google Scholar
Chunbao Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Fang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Rui Xu
View author publications
You can also search for this author in PubMed Google Scholar
Shuo Li
View author publications
You can also search for this author in PubMed Google Scholar
Jue Wang
View author publications
You can also search for this author in PubMed Google Scholar
Boyao Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fang Liu .

Editor information

Editors and Affiliations

University of Science and Technology, Beijing, China
Changjun Hu
China Institute of Atomic Energy, Beijing, China
Wen Yang
Hangzhou Dianzi University, Hangzhou, China
Congfeng Jiang
Computer Science, Texas Tech University, Lubbock, USA
Dong Dai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, L. et al. (2019). Diverse Demands Estimation and Ranking Based on User Behaviors. In: Hu, C., Yang, W., Jiang, C., Dai, D. (eds) High-Performance Computing Applications in Numerical Simulation and Edge Computing. HPCMS HiDEC 2018 2018. Communications in Computer and Information Science, vol 913. Springer, Singapore. https://doi.org/10.1007/978-981-32-9987-0_7

Download citation

DOI: https://doi.org/10.1007/978-981-32-9987-0_7
Published: 29 August 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-32-9986-3
Online ISBN: 978-981-32-9987-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics