Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/30402
Наслов: Real-Time Clustering of Text Data for News Aggregation
Authors: Najkov, D
Zdraveski, Vladimir 
Gusev, Marjan
Keywords: K-Means , MPI , parallelization , news aggregation
Issue Date: 21-ное-2023
Publisher: IEEE
Conference: 2023 31st Telecommunications Forum (TELFOR)
Abstract: This paper explores real-time text data clustering in news aggregation using the Message Passing Interface (MPI) with parallelized K-Means algorithm variants. We evaluate batch-based, centroid-based, and fusion-based methods, measuring their training time in two experiments—one based on cluster complexity and the other on dataset size. Our study aims to identify the most effective method and analyze trade-offs between parallelization strategies. Results indicate that MPI-based solutions substantially accelerate training time compared to serial K-Means implementation in this context.
URI: http://hdl.handle.net/20.500.12188/30402
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Прикажи целосна запис

Page view(s)

50
checked on 3.5.2025

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.