Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)

Rizinski, Maryan; Peshov, Hristijan; Mishev, Kostadin; Jovanovik, Milos; Trajanov, Dimitar

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/27436

Title:	Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)
Authors:	Rizinski, Maryan Peshov, Hristijan Mishev, Kostadin Jovanovik, Milos Trajanov, Dimitar
Keywords:	Machine learning, natural language processing, text classification, sentiment analysis, finance, lexicons, lexicon learning, transformers, SHAP, explainabilit
Issue Date:	6-Jun-2023
Journal:	arXiv preprint arXiv:2306.03997
Abstract:	Lexicon-based sentiment analysis in finance leverages specialized, manually annotated lexicons created by human experts to effectively extract sentiment from financial texts. Although lexiconbased methods are simple to implement and fast to operate on textual data, they require considerable manual annotation efforts to create, maintain, and update the lexicons. These methods are also considered inferior to the deep learning-based approaches, such as transformer models, which have become dominant in various natural language processing (NLP) tasks due to their remarkable performance. However, their efficacy comes at a cost: these models require extensive data and computational resources for both training and testing. Additionally, they involve significant prediction times, making them unsuitable for real-time production environments or systems with limited processing capabilities. In this paper, we introduce a novel methodology named eXplainable Lexicons (XLex) that combines the advantages of both lexicon-based methods and transformer models. We propose an approach that utilizes transformers and SHapley Additive exPlanations (SHAP) for explainability to automatically learn financial lexicons. Our study presents four main contributions. Firstly, we demonstrate that transformer-aided explainable lexicons can enhance the vocabulary coverage of the benchmark Loughran-McDonald (LM) lexicon. This enhancement leads to a significant reduction in the need for human involvement in the process of annotating, maintaining, and updating the lexicons. Secondly, we show that the resulting lexicon outperforms the standard LM lexicon in sentiment analysis of financial datasets. Thirdly, we illustrate that the lexicon-based approach is significantly more efficient in terms of model speed and size compared to transformers. Lastly, the proposed XLex approach is inherently more interpretable than transformer models. This interpretability is advantageous as lexicon models rely on predefined rules, unlike transformers, which have complex inner workings. The interpretability of the models allows for better understanding and insights into the results of sentiment analysis, making the XLex approach a valuable tool for financial decision-making.
URI:	http://hdl.handle.net/20.500.12188/27436
Appears in Collections:	Faculty of Computer Science and Engineering: Journal Articles

Files in This Item:

File	Description	Size	Format
2306.03997.pdf		3.34 MB	Adobe PDF	View/Open

Show full item record

Page view(s)

40

checked on Nov 11, 2024

Download(s)

23

checked on Nov 11, 2024

Google Scholar^TM

Check

Repository of UKIM

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM