Positive Unlabeled Link Prediction via Transfer Learning for Gene Network Reconstruction

Mignone, Paolo; Pio, Gianvito

doi:10.1007/978-3-030-01851-1_2

Paolo Mignone¹⁸ &
Gianvito Pio¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11177))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

992 Accesses
5 Citations

Abstract

Transfer learning can be employed to leverage knowledge from a source domain in order to better solve tasks in a target domain, where the available data is exiguous. While most of the previous papers work in the supervised setting, we study the more challenging case of positive-unlabeled transfer learning, where few positive labeled instances are available for both the source and the target domains. Specifically, we focus on the link prediction task on network data, where we consider known existing links as positive labeled data and all the possible remaining links as unlabeled data. In many real applications (e.g., in bioinformatics), this usually leads to few positive labeled data and a huge amount of unlabeled data. The transfer learning method proposed in this paper exploits the unlabeled data and the knowledge of a source network in order to improve the reconstruction of a target network. Experiments, conducted in the biological field, showed the effectiveness of the proposed approach with respect to the considered baselines, when exploiting the Mus Musculus gene network (source) to improve the reconstruction of the Homo Sapiens Sapiens gene network (target).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The less the distance between an unlabeled example and the hyperplane, the higher the probability of the existence of the link.
2.
https://thebiogrid.org.

References

Platt, J.C.: Probabilistic outputs for support vector machine and comparisons to regularized likelihood methods. In: Advances in Large Margin Classifiers (1999)
Google Scholar
Ceci, M., Pio, G., Kuzmanovski, V., Džeroski, S.: Semi-supervised multi-view learning for gene network reconstruction. Plos One, 10(12), e0144031 (2015)
Article Google Scholar
Dai, W., Yang, Q., Xue, G., Yu, Y.: Boosting for transfer learning. In: Proceedings of ICML, pp. 193–200 (2007)
Google Scholar
Elkan, C., Noto, K.: Learning classifiers from only positive and unlabeled data. In: Proceedings of ACM SIGKDD, pp. 213–220 (2008)
Google Scholar
Jowkar, G., Mansoori, E.: Perceptron ensemble of graph-based positive unlabeled learning for disease gene identification. Comput. Biol. Chem. 64, 263–270 (2016)
Article MathSciNet Google Scholar
Levatic, J., Ceci, M., Kocev, D., Dzeroski, S.: Self-training for multi-target regression with tree ensembles. Knowl. Based Syst. 123, 41–60 (2017)
Article Google Scholar
Levatic, J., Kocev, D., Ceci, M., Dzeroski, S.: Semi-supervised trees for multi-target regression. Inf. Sci. 450, 109–127 (2018)
Article MathSciNet Google Scholar
Liu, B., Lee, W.S., Yu, P.S., Li, X.: Partially supervised classification of text documents. In: Proceedings of ICML, pp. 387–394 (2002)
Google Scholar
Marbach, D., et al.: Wisdom of crowds for robust gene network inference. Nat. Meth. 9(8), 796–804 (2016)
Article Google Scholar
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)
Article Google Scholar
Pan, S.J., Zheng, V.W., Yang, Q., Hu, D.H.: Transfer learning for wifi-based indoor localization. In: Workshop on Transfer Learning for Complex Task AAAI (2008)
Google Scholar
Pio, G., Ceci, M., Malerba, D., D’Elia, D.: ComiRNet:a web-based system for the analysis of miRNA-gene regulatory networks. BMC Bioinform. 16(S-9), S7 (2015)
Article Google Scholar
Pio, G., Malerba, D., D’Elia, D., Ceci, M.: Integrating microRNA target predictions for the discovery of gene regulatory networks: a semi-supervised ensemble learning approach. BMC Bioinform. 15(S-1), S4 (2014)
Article Google Scholar
Weiss, K.R., Khoshgoftaar, T.M., Wang, D.: A survey of transfer learning. J. Big Data 3, 9 (2016)
Article Google Scholar
Yang, X., Song, Q., Wand, Y.: A weighted support vector machine for data classification. Int. J. Pattern Recogn. 21, 961–976 (2007)
Article Google Scholar
Zhang, B., Zuo, W.: Learning from positive and unlabeled examples: a survey. In: ISIP/WMWA, pp. 650–654 (2008)
Google Scholar

Download references

Acknowledgments

We would like to acknowledge the European project MAESTRA - Learning from Massive, Incompletely annotated, and Structured Data (ICT-2013-612944).

Author information

Authors and Affiliations

Department of Computer Science, University of Bari Aldo Moro, Via Orabona, 4, 70125, Bari, Italy
Paolo Mignone & Gianvito Pio

Authors

Paolo Mignone
View author publications
You can also search for this author in PubMed Google Scholar
Gianvito Pio
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gianvito Pio .

Editor information

Editors and Affiliations

Università degli Studi di Bari Aldo Moro, Bari, Italy
Michelangelo Ceci
American University, Washington, DC, USA
Nathalie Japkowicz
Hong Kong Baptist University, Kowloon, Hong Kong
Jiming Liu
University of Cyprus, Nicosia, Cyprus
George A. Papadopoulos
University of North Carolina, Charlotte, NC, USA
Zbigniew W. Raś

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mignone, P., Pio, G. (2018). Positive Unlabeled Link Prediction via Transfer Learning for Gene Network Reconstruction. In: Ceci, M., Japkowicz, N., Liu, J., Papadopoulos, G., Raś, Z. (eds) Foundations of Intelligent Systems. ISMIS 2018. Lecture Notes in Computer Science(), vol 11177. Springer, Cham. https://doi.org/10.1007/978-3-030-01851-1_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-01851-1_2
Published: 07 October 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01850-4
Online ISBN: 978-3-030-01851-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics