Skip to main content

Gender Inference for Facebook Picture Owners

  • Conference paper
  • First Online:
Trust, Privacy and Security in Digital Business (TrustBus 2019)

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 11711))

Included in the following conference series:

Abstract

Social media such as Facebook provides a new way to connect, interact and learn. Facebook allows users to share photos and express their feelings by using comments. However, its users are vulnerable to attribute inference attacks where an attacker intends to guess private attributes (e.g., gender, age, political view) of target users through their online profiles and/or their vicinity (e.g., what their friends reveal). Given user-generated pictures on Facebook, we explore in this paper how to launch gender inference attacks on their owners from pictures meta-data composed of: (i) alt-texts generated by Facebook to describe the content of pictures, and (ii) comments posted by friends, friends of friends or regular users. We assume these two meta-data are the only available information to the attacker. Evaluation results demonstrate that our attack technique can infer the gender with an accuracy of 84% by leveraging only alt-texts, 96% by using only comments, and 98% by combining alt-texts and comments. We compute a set of sensitive words that enable attackers to perform effective gender inference attacks. We show the adversary prediction accuracy is decreased by hiding these sensitive words. To the best of our knowledge, this is the first inference attack on Facebook that exploits comments and alt-texts solely.

This work is supported by DIGITRUST (http://lue.univ-lorraine.fr/fr/article/digitrust/).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    \(\bigcup \) is the disjoint union.

  2. 2.

    https://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.chi2.html.

  3. 3.

    https://www.bogotobogo.com/python/scikit-learn/scikt_machine_learning_Decision_Tree_Learning_Informatioin_Gain_IG_Impurity_Entropy_Gini_Classification_Error.php.

  4. 4.

    https://www.scikit-yb.org/en/latest/api/features/importances.html.

  5. 5.

    https://scikit-learn.org/stable/auto_examples/feature_selection/plot_feature_selection.html.

References

  1. Bargh, J.A., McKenna, K.Y., Fitzsimons, G.M.: Can you see the real me? Activation and expression of the “true self” on the internet. J. Soc. Issues 58(1), 33–48 (2002)

    Article  Google Scholar 

  2. Chaabane, A., Acs, G., Kaafar, M.A., et al.: You are what you like! information leakage through users’ interests. In: Proceedings of the 19th Annual Network & Distributed System Security Symposium (NDSS) (2012)

    Google Scholar 

  3. Chen, Y.-Y., Chen, T., Hsu, W.H., Liao, H.-Y.M., Chang, S.-F.: Predicting viewer affective comments based on image content in social media. In: Proceedings of International Conference on Multimedia Retrieval, p. 233. ACM (2014)

    Google Scholar 

  4. Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990)

    Google Scholar 

  5. Datta, A., Tschantz, M.C., Datta, A.: Automated experiments on ad privacy settings. Proc. Privacy Enhancing Technol. 2015(1), 92–112 (2015)

    Article  Google Scholar 

  6. Fkih, F., Omri, M.N.: Learning the size of the sliding window for the collocations extraction: a ROC-based approach. In: Proceedings of the 2012 International Conference on Artificial Intelligence (ICAI 2012), pp. 1071–1077 (2012)

    Google Scholar 

  7. Gong, N.Z., Liu, B.: Attribute inference attacks in online social networks. ACM Trans. Privacy Secur. (TOPS) 21(1), 3 (2018)

    Google Scholar 

  8. Gong, N.Z., et al.: Joint link prediction and attribute inference using a social-attribute network. ACM Trans. Intell. Syst. Technol. (TIST) 5(2), 27 (2014)

    Google Scholar 

  9. He, J., Chu, W.W., Liu, Z.V.: Inferring privacy information from social networks. In: Mehrotra, S., Zeng, D.D., Chen, H., Thuraisingham, B., Wang, F.-Y. (eds.) ISI 2006. LNCS, vol. 3975, pp. 154–165. Springer, Heidelberg (2006). https://doi.org/10.1007/11760146_14

    Chapter  Google Scholar 

  10. Kang, R., Brown, S., Kiesler, S.: Why do people seek anonymity on the internet? Informing policy and design. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 2657–2666. ACM (2013)

    Google Scholar 

  11. Lee, K.: Combining multiple feature selection methods. In: Mid-Atlantic Student Workshop on Programming Languages and Systems (MASPLAS 2002), p. 12. Citeseer (2002)

    Google Scholar 

  12. Filho, J.A.B.L., Pasti, R., de Castro, L.N.: Gender classification of twitter data based on textual meta-attributes extraction. New Advances in Information Systems and Technologies. AISC, vol. 444, pp. 1025–1034. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-31232-3_97

    Chapter  Google Scholar 

  13. Merler, M., Cao, L., Smith, J.R.: You are what you tweet... pic! gender prediction based on semantic analysis of social media images. In: 2015 IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6. IEEE (2015)

    Google Scholar 

  14. Mihaljević, H., Santamaría, L.: Telling the gender from a name (2018)

    Google Scholar 

  15. Miller, Z., Dickinson, B., Hu, W.: Gender prediction on Twitter using stream algorithms with n-gram character features. Int. J. Intell. Sci. 2(04), 143 (2012)

    Article  Google Scholar 

  16. Stavrianou, A., Brun, C., Silander, T., Roux, C.: NLP-based feature extraction for automated tweet classification. Interact. Data Mining Nat. Lang. Process. 145 (2014)

    Google Scholar 

  17. Tobin, A., Merrill, J.B.: Facebook is letting job advertisers target only men (2018)

    Google Scholar 

  18. Tsai, C.-F., Hsiao, Y.-C.: Combining multiple feature selection methods for stock prediction: union, intersection, and multi-intersection approaches. Decis. Support Syst. 50(1), 258–269 (2010)

    Article  Google Scholar 

  19. Weinsberg, U., Bhagat, S., Ioannidis, S., Taft, N.: BlurMe: inferring and obfuscating user gender based on ratings. In: Proceedings of the sixth ACM Conference on Recommender Systems, pp. 195–202. ACM (2012)

    Google Scholar 

  20. Wu, S., Wieland, J., Farivar, O., Schiller, J.: Automatic alt-text: computer-generated image descriptions for blind users on a social network service. In: Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pp. 1180–1192. ACM (2017)

    Google Scholar 

  21. Xu, W., Zhou, X., Li, L.: Inferring privacy information via social relations. In: 2008 IEEE 24th International Conference on Data Engineering Workshop, pp. 525–530. IEEE (2008)

    Google Scholar 

  22. Yurchisin, J., Watchravesringkan, K., McCabe, D.B.: An exploration of identity re-creation in the context of Internet dating. Soc. Beh. Pers.: Int. J. 33(8), 735–750 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Bizhan Alipour , Abdessamad Imine or Michaël Rusinowitch .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Alipour, B., Imine, A., Rusinowitch, M. (2019). Gender Inference for Facebook Picture Owners. In: Gritzalis, S., Weippl, E., Katsikas, S., Anderst-Kotsis, G., Tjoa, A., Khalil, I. (eds) Trust, Privacy and Security in Digital Business. TrustBus 2019. Lecture Notes in Computer Science(), vol 11711. Springer, Cham. https://doi.org/10.1007/978-3-030-27813-7_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-27813-7_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-27812-0

  • Online ISBN: 978-3-030-27813-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics