Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.12188/30402
Title: | Real-Time Clustering of Text Data for News Aggregation | Authors: | Najkov, D Zdraveski, Vladimir Gusev, Marjan |
Keywords: | K-Means , MPI , parallelization , news aggregation | Issue Date: | 21-Nov-2023 | Publisher: | IEEE | Conference: | 2023 31st Telecommunications Forum (TELFOR) | Abstract: | This paper explores real-time text data clustering in news aggregation using the Message Passing Interface (MPI) with parallelized K-Means algorithm variants. We evaluate batch-based, centroid-based, and fusion-based methods, measuring their training time in two experiments—one based on cluster complexity and the other on dataset size. Our study aims to identify the most effective method and analyze trade-offs between parallelization strategies. Results indicate that MPI-based solutions substantially accelerate training time compared to serial K-Means implementation in this context. | URI: | http://hdl.handle.net/20.500.12188/30402 |
Appears in Collections: | Faculty of Computer Science and Engineering: Conference papers |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.