Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/28998
Наслов: Parallelizing file-type conversion for financial analysis
Authors: Alek Jarmov
Zdraveski, Vladimir 
Kostoska, Magdalena 
Keywords: XLSX , XML , CSV , process pool , parallelization , financial analysis
Issue Date: ное-2023
Publisher: IEEE
Conference: 2023 31st Telecommunications Forum (TELFOR)
Abstract: Data analysis has gained significant traction, particularly in the era of artificial intelligence, offering novel approaches for financial data analysis. However, a data storage challenge arises prior to analysis. Financial data is commonly stored in the XLSX format, whereas for faster analysis and reduced server storage, the preferred format is CSV. This paper investigates the acceleration of XLSX to CSV conversion. The XLSX file’s main content is represented as a tree structure in XML format. Leveraging the independent nature of rows and files, we propose two methods for parallelizing the conversion process: single file parallelization and simultaneous parallel conversion of multiple files. Our results demonstrate the effectiveness of parallelization, resulting in reduced workflow waiting times.
URI: http://hdl.handle.net/20.500.12188/28998
DOI: 10.1109/TELFOR59449.2023.10372813
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Прикажи целосна запис

Page view(s)

35
checked on 8.8.2024

Google ScholarTM

Проверете

Altmetric


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.