Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/8894
Title: Evaluation of Sentiment Analysis in Finance: From Lexicons to Transformers
Authors: Mishev, Kostadin 
Gjorgjevikj, Ana
Vodenska, Irena
Chitkushev, Lubomir T.
Trajanov, Dimitar 
Issue Date: Jun-2020
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Journal: IEEE Access
Abstract: Financial and economic news is continuously monitored by financial market participants. According to the efficient market hypothesis, all past information is reflected in stock prices and new information is instantaneously absorbed in determining future stock prices. Hence, prompt extraction of positive or negative sentiments from news is very important for investment decision-making by traders, portfolio managers and investors. Sentiment analysis models can provide an efficient method for extracting actionable signals from the news. However, financial sentiment analysis is challenging due to domain-specific language and unavailability of large labeled datasets. General sentiment analysis models are ineffective when applied to specific domains such as finance. To overcome these challenges, we design an evaluation platform which we use to assess the effectiveness and performance of various sentiment analysis approaches, based on combinations of text representation methods and machine-learning classifiers. We perform more than one hundred experiments using publicly available datasets, labeled by financial experts. We start the evaluation with specific lexicons for sentiment analysis in finance and gradually build the study to include word and sentence encoders, up to the latest available NLP transformers. The results show improved efficiency of contextual embeddings in sentiment analysis compared to lexicons and fixed word and sentence encoders, even when large datasets are not available. Furthermore, distilled versions of NLP transformers produce comparable results to their larger teacher models, which makes them suitable for use in production environments.
URI: http://hdl.handle.net/20.500.12188/8894
DOI: 10.1109/access.2020.3009626
Appears in Collections:Faculty of Computer Science and Engineering: Journal Articles

Files in This Item:
File Description SizeFormat 
09142175.pdf3.6 MBAdobe PDFView/Open
Show full item record

Page view(s)

88
checked on Apr 19, 2024

Download(s)

113
checked on Apr 19, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.