Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/17479
Наслов: COMPARATIVE ANALYSIS OF WORD EMBEDDINGS FOR CAPTURING WORD SIMILARITIES
Authors: Toshevska, Martina
Stojanovska, Frosina
Kalajdjieski, Jovan
Keywords: Word Embeddings, Distributed Word Representation, Word Similarity
Issue Date: 8-мај-2020
Journal: arXiv preprint arXiv:2005.03812
Abstract: Distributed language representation has become the most widely used technique for language representation in various natural language processing tasks. Most of the natural language processing models that are based on deep learning techniques use already pre-trained distributed word representations, commonly called word embeddings. Determining the most qualitative word embeddings is of crucial importance for such models. However, selecting the appropriate word embeddings is a perplexing task since the projected embedding space is not intuitive to humans. In this paper, we explore different approaches for creating distributed word representations. We perform an intrinsic evaluation of several state-of-the-art word embedding methods. Their performance on capturing word similarities is analysed with existing benchmark datasets for word pairs similarities. The research in this paper conducts a correlation analysis between ground truth word similarities and similarities obtained by different word embedding methods.
URI: http://hdl.handle.net/20.500.12188/17479
Appears in Collections:Faculty of Computer Science and Engineering: Journal Articles

Files in This Item:
File Опис SizeFormat 
2005.03812.pdf1.42 MBAdobe PDFView/Open
Прикажи целосна запис

Page view(s)

29
checked on 18.7.2024

Download(s)

13
checked on 18.7.2024

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.