Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/28998
Title: Parallelizing file-type conversion for financial analysis
Authors: Alek Jarmov
Zdraveski, Vladimir 
Kostoska, Magdalena 
Keywords: XLSX , XML , CSV , process pool , parallelization , financial analysis
Issue Date: Nov-2023
Publisher: IEEE
Conference: 2023 31st Telecommunications Forum (TELFOR)
Abstract: Data analysis has gained significant traction, particularly in the era of artificial intelligence, offering novel approaches for financial data analysis. However, a data storage challenge arises prior to analysis. Financial data is commonly stored in the XLSX format, whereas for faster analysis and reduced server storage, the preferred format is CSV. This paper investigates the acceleration of XLSX to CSV conversion. The XLSX file’s main content is represented as a tree structure in XML format. Leveraging the independent nature of rows and files, we propose two methods for parallelizing the conversion process: single file parallelization and simultaneous parallel conversion of multiple files. Our results demonstrate the effectiveness of parallelization, resulting in reduced workflow waiting times.
URI: http://hdl.handle.net/20.500.12188/28998
DOI: 10.1109/TELFOR59449.2023.10372813
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Show full item record

Page view(s)

22
checked on Apr 29, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.