Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/30402
Title: Real-Time Clustering of Text Data for News Aggregation
Authors: Najkov, D
Zdraveski, Vladimir 
Gusev, Marjan
Keywords: K-Means , MPI , parallelization , news aggregation
Issue Date: 21-Nov-2023
Publisher: IEEE
Conference: 2023 31st Telecommunications Forum (TELFOR)
Abstract: This paper explores real-time text data clustering in news aggregation using the Message Passing Interface (MPI) with parallelized K-Means algorithm variants. We evaluate batch-based, centroid-based, and fusion-based methods, measuring their training time in two experiments—one based on cluster complexity and the other on dataset size. Our study aims to identify the most effective method and analyze trade-offs between parallelization strategies. Results indicate that MPI-based solutions substantially accelerate training time compared to serial K-Means implementation in this context.
URI: http://hdl.handle.net/20.500.12188/30402
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Show full item record

Page view(s)

38
checked on Oct 31, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.