Resolving Ambiguity Using Word Embeddings for Personalized Information Retrieval in Folksonomy Systems

Document Type : Original Article

Authors

1 Department of Computer Engineering, University of Science and Culture, Tehran, Iran

2 Scientific Information Database(SID), Academic Center for Education, Culture and Research ACECR

Abstract

The diversity and high volume of available information on the web make data retrieval a serious challenge in this environment. On the other hand, obtaining user satisfaction is difficult, which is one of the main challenges of data retrieval systems. Depending on their information about interests and needs for the same keyword, different people expect different responses from Information Retrieval (IR) systems. Achieving this goal requires an effective method to retrieve information. Personalized Information Retrieval (PIR) is an effective method to achieve this goal which is considered by researchers today.  Folksonomy is the process that allows users to tag in a specific domain of information in a social environment (tags are accessible to other users). Folksonomy systems are made collaborative tagging systems. Due to the large volume and variety of tags produced, resolving ambiguity is a severe challenge in these systems. In recent years, word embedding methods have been considered by researchers as a successful method to fix the ambiguity of texts.
This study proposes a model which, in addition to using word embedding methods to remove tag ambiguity, provides search results in a personalized approach by fixing ambiguity and sentiment analysis combination tailored to users' interests. In this research, different models of word embeddings were applied. The experiments' results show that after applying the fixing ambiguity, the mean accuracy criterion improved by 1.93% and the mean MRR (Mean Reciprocal Rank)  by 0.38%.

Keywords

Main Subjects


  • Xie et al., "Incorporating sentiment into tag-based user profiles and resource profiles for personalized search in folksonomy," Information Processing & Management, vol. 52, no. 1, pp. 61-72, 2016.
  • Zhou, X. Wu, W. Zhao, S. Lawless and J. Liu, "Query expansion with enriched user profiles for personalized search utilizing folksonomy data," IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 7, pp. 1536-1548, 2017.
  • Mikolov, K. Chen, G. Corrado and J. Dean, "Efficient estimation of word representations in vector space," arXiv preprint arXiv:1301.3781, 2013.
  • Mikolov, I. Sutskever, K. Chen, G. S. Corrado and J. Dean, "Distributed representations of words and phrases and their compositionality," in Advances in neuralinformation processing systems, 2013, pp. 3111-3119.
  • Saleh and N. El-Tazi, "Finding semantic relationships in folksonomies," In 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI), IEEE, 2018, pp. 174-181.
  • Pittaras, G. Giannakopoulos, G. Papadakis, and V. Karkaletsis, "Text classification with semantically enriched word embeddings," Natural Language Engineering, pp. 1-35, 2020.
  • C. Fernández-Reyes, J. Hermosillo-Valadez, and M. Montes-y-Gómez, “A prospect-guided global queryexpansion strategy using word embeddings,” Information Processing & Management, vol. 54, no. 1, pp. 1-13, 2018.
  • Wang, M. Wang, and H. Fujita, "Word sense disambiguation: A comprehensive knowledge exploitation framework," Knowledge-Based Systems, vol. 190, p. 105030, 2020.
  • D. Vo, Q. P. Nguyen, and C.-Y. Ock, "Semantic and syntactic analysis in learning representation based on a Sentiment analysis model," Applied Intelligence, vol. 50, no. 3, pp. 663-680, 2020.
  • Habimana, Y. Li, R. Li, X. Gu, and G. Yu, "Sentiment analysis using deep learning approaches: an overview," Science China Information Sciences, vol. 63, no. 1, pp. 1-36, 2020.
  • C. Dang, M. N. Moreno-García, and F. De la Prieta, "Sentiment analysis based on deep learning: A comparative study," Electronics, vol. 9, no. 3, p. 483, 2020.
  • Luo, S. Chen, G. Xu, and J. Zhou, Trust-based collective view prediction, Springer, 2013.
  • Cai and Q. Li, "Personalized search by tag-based user profile and resource profile incollaborative tagging systems," In Proceedings of the 19th ACM international conference on information and knowledge management, 2010, pp. 969-978.
  • G. Noll and C. Meinel, "Web search personalization via social bookmarking and tagging," In The semantic web, Springer, 2007, pp. 367-380.
  • Xu, S. Bao, B. Fei, Z. Su, and Y. Yu, "Exploring folksonomy for personalized search," In Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval, 2008, pp. 155-162.
  • Rong, "word2vec parameter learning explained," arXiv preprint arXiv:1411.2738, 2014.
  • Cambria, D. Olsher, and D. Rajagopal, "SenticNet 3: a common and common-sense knowledge base for cognition-driven Sentiment analysis," In Twenty-eighth AAAI conference on artificial intelligence, 2014.

 

 

Ghazale Etemadikhu is a graduate student in master of software engineering at the University of Science and Culture, Tehran, Iran. Her research interest is information retrieval.

Fatemeh Azimzadeh received a Ph.D. degree in Information Technology from University Putra Malaysia in 2012. Currently, she is an assistant professor in ACECR, Tehran, Iran. She is also the director of SID (Scientific Information Database) in Iran. Her research interests include information retrieval and information quality.

Abdalsamad Keramatfar is a Ph.D. student in information technology at the University of Qom and a data scientist at SID. He currently works in natural language processing and machine learning and specifically on multi-thread modeling of context for social media sentiment analysis. His research interests are artificial intelligence and natural language