Skip to main content
Log in

Collaborative work with linear classifier and extreme learning machine for fast text categorization

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

The bloom of Internet has made fast text categorization very essential. Generally, the popular methods have good classification accuracy but slow speed, and vice versa. This paper proposes a novel approach for fast text categorization, in which a collaborative work framework based on a linear classifier and an extreme learning machine (ELM) is constructed. The linear classifier, obtained by a modified non-negative matrix factorization algorithm, maps all documents from the original term space into the class space directly such that it performs classification very fast. The ELM, with good classification accuracy via some nonlinear and linear transformations, classifies a few of documents according to some given criteria to improve the classification quality of the total system. Experimental results show that the proposed method not only achieves good accuracy but also performs classification very fast, which improves the averaged speed by 180 % compared with its corresponding method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Cao, J., Lin, Z., Huang, G.B., Liu, N.: Voting based extreme learning machine. Inf. Sci. 185(1), 66–77 (2012)

    Article  MathSciNet  Google Scholar 

  2. De Souza, A., Pedroni, F., Oliveira, E., Ciarelli, P., Henrique, W., Veronese, L., Badue, C.: Automated multi-label text categorization with vg-ram weightless neural networks. Neurocomputing 72(10–12), 2209–2217 (2009)

    Article  Google Scholar 

  3. Gabrilovich, E., Markovitch, S.: Text categorization with many redundant features: using aggressive feature selection to make svms competitive with c4. 5. In: International Conference on Machine Learning, pp. 321–328 (2004)

  4. Henzinger, M., Chang, B.W., Milch, B., Brin, S.: Query-free news search. World Wide Web 8(2), 101–126 (2005)

    Article  Google Scholar 

  5. Hmeidi, I., Hawashin, B., El-Qawasmeh, E.: Performance of knn and svm classifiers on full word arabic articles. Adv. Eng. Inform. 22(1), 106–111 (2008)

    Article  Google Scholar 

  6. Huang, G., Chen, L.: Convex incremental extreme learning machine. Neurocomputing 70(16–18), 3056–3062 (2007)

    Article  Google Scholar 

  7. Huang, G., Ding, X., Zhou, H.: Optimization method based extreme learning machine for classification. Neurocomputing 74, 155–163 (2010)

    Article  Google Scholar 

  8. Huang, G., Wang, D., Lan, Y.: Extreme learning machines: a survey. Int. J. Mach. Learn. Cybern. 2(2), 107–122 (2011)

    Article  Google Scholar 

  9. Huang, G., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multi-class classification. IEEE Trans. Syst. Man Cybern. Part B 42(2), 513–529 (2011)

    Article  Google Scholar 

  10. Huang, G., Zhu, Q., Siew, C.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of the International Joint Conference on Neural Networks, vol. 2, pp. 985–990 (2004)

  11. Huang, G., Zhu, Q., Siew, C.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2006)

    Article  Google Scholar 

  12. Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: 10th European Conference on Machine Learning, pp. 137–142 (1998)

  13. Kumar, M.A., Gopal, M.: A comparison study on multiple binary-class svm methods for unilabel text categorization. Pattern Recogn. Lett. 31(11), 1437–1444 (2010)

    Article  Google Scholar 

  14. Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401(6755), 788–791 (1999)

    Article  Google Scholar 

  15. Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, pp. 556–562 (2001)

  16. Lewis, D., Ringuette, M.: A comparison of two learning algorithms for text categorization. In: Third Annual Symposium on Document Analysis and Information Retrieval, vol. 33, pp. 81–93 (1994)

  17. Li, W., Zhong, N., Yao, Y., Liu, J.: An operable email based intelligent personal assistant. World Wide Web 12(2), 125–147 (2009)

    Article  Google Scholar 

  18. Liu, Y., Loh, H., Tor, S.: Comparison of extreme learning machine with support vector machine for text classification. Innov. Appl. Artif. Intell. 3533, 390–399 (2005)

    Article  Google Scholar 

  19. Man, Z., Lee, K., Wang, D., Cao, Z., Khoo, S.: Robust single-hidden layer feedforward network-based pattern classifier. IEEE Trans. Neural Netw. Learning Syst. 23(12), 1974–1986 (2012)

    Article  Google Scholar 

  20. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)

    Article  Google Scholar 

  21. Savitha, R., Suresh, S., Sundararajan, N.: Fast learning circular complex-valued extreme learning machine (cc-elm) for real-valued classification problems. Inf. Sci. 187(1), 277–290 (2012)

    Article  MathSciNet  Google Scholar 

  22. Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)

    Article  Google Scholar 

  23. Silva, C., Ribeiro, B.: Knowledge extraction with non-negative matrix factorization for text classification. In: Proceedings of Intelligent Data Engineering and Automated Learning, vol. 5788, pp. 300–308 (2009)

  24. Soucy, P., Mineau, G.: A simple knn algorithm for text categorization. In: International Conference on Data Mining, pp. 647–648 (2001)

  25. Wang, X., Chen, A., Feng, H.: Upper integral network with extreme learning mechanism. Neurocomputing 74(16), 2520–2525 (2011)

    Article  Google Scholar 

  26. Xing, H.J., Wang, X.M.: Training extreme learning machine via regularized correntropy criterion. Neural Comput. Appl. (2012). doi:10.1007/s00521-012-1184-y

    Google Scholar 

  27. Yang, Y., Chute, C.: An example-based mapping method for text categorization and retrieval. ACM Trans. Inf. Syst. 12(3), 252–277 (1994)

    Article  Google Scholar 

  28. Yeung, D.S., Ng, W.W., Wang, D., Tsang, E.C., Wang, X.Z.: Localized generalization error model and its application to architecture selection for radial basis function neural network. IEEE Trans. Neural Netw. 18(5), 1294–1305 (2007)

    Article  Google Scholar 

  29. Zakos, J., Verma, B.: A novel context-based technique for web information retrieval. World Wide Web 9(4), 485–503 (2006)

    Article  Google Scholar 

  30. Zhang, R., Huang, G., Sundararajan, N., Saratchandran, P.: Multicategory classification using an extreme learning machine for microarray gene expression cancer diagnosis. IEEE/ACM Trans. Comput. Biol. Bioinf. 4(3), 485–495 (2007)

    Article  Google Scholar 

  31. Zhang, T., Oles, F.: Text categorization based on regularized linear classification methods. Inf. Retr. 4, 5–31 (2001)

    Article  MATH  Google Scholar 

  32. Zheng, W., Qian, Y., Lu, H.: Text categorization based on regularization extreme learning machine. Neural Comput. Appl. 22(3), 447–456 (2013)

    Article  Google Scholar 

  33. Zheng, W., Zhang, H., Qian, Y.: Fast text categorization based on collaborative work in the semantic and class spaces. In: International Conference on Machine Learning and Cybernetics, vol. 4, pp. 1497–1502 (2011)

  34. Zhu, Q., Qin, A., Suganthan, P., Huang, G.: Evolutionary extreme learning machine. Pattern Recogn. 38(10), 1759–1763 (2005)

    Article  MATH  Google Scholar 

  35. Zong, W., Huang, G.B.: Face recognition based on extreme learning machine. Neurocomputing 74(16), 2541–2551 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wenbin Zheng.

Additional information

This work was supported by the 973 Program (No. 2012CB316400), and the National Natural Science Foundation of China (No. 11202202, No. 61272315 and No. 61171151), and the Natural Science Foundation of Zhejiang Province (No. Y6110147).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zheng, W., Tang, H. & Qian, Y. Collaborative work with linear classifier and extreme learning machine for fast text categorization. World Wide Web 18, 235–252 (2015). https://doi.org/10.1007/s11280-013-0225-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-013-0225-5

Keywords

Navigation