Improving bag-of-visual-words image retrieval with predictive clustering trees

Dimitrovski, Ivica; Kocev, Dragi; Loshkovska, Suzana; Djeroski, Sasho

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/23160

DC Field	Value	Language
dc.contributor.author	Dimitrovski, Ivica	en_US
dc.contributor.author	Kocev, Dragi	en_US
dc.contributor.author	Loshkovska, Suzana	en_US
dc.contributor.author	Djeroski, Sasho	en_US
dc.date.accessioned	2022-09-28T10:11:56Z	-
dc.date.available	2022-09-28T10:11:56Z	-
dc.date.issued	2016-02-01	-
dc.identifier.uri	http://hdl.handle.net/20.500.12188/23160	-
dc.description.abstract	The recent overwhelming increase in the amount of available visual information, especially digital images, has brought up a pressing need to develop efficient and accurate systems for image retrieval. State-of-the-art systems for image retrieval use the bag-of-visual-words representation of images. However, the computational bottleneck in all such systems is the construction of the visual codebook, i.e., obtaining the visual words. This is typically performed by clustering hundreds of thousands or millions of local descriptors, where the resulting clusters correspond to visual words. Each image is then represented by a histogram of the distribution of its local descriptors across the codebook. The major issue in retrieval systems is that by increasing the sizes of the image databases, the number of local descriptors to be clustered increases rapidly: Thus, using conventional clustering techniques is infeasible. Considering this, we propose to construct the visual codebook by using predictive clustering trees (PCTs), which can be constructed and executed efficiently and have good predictive performance. Moreover, to increase the stability of the model, we propose to use random forests of predictive clustering trees. We create a random forest of PCTs that represents both the codebook and the indexing structure. We evaluate the proposed improvement of the bag-of-visual-words approach on three reference datasets and two additional datasets of 100 K images and 1 M images, compare it to two state-of-the-art methods based on approximate k-means and extremely randomized tree ensembles. The results reveal that the proposed method produces a visual codebook with superior discriminative power and thus better retrieval performance while maintaining excellent computational efficiency.	en_US
dc.publisher	Elsevier	en_US
dc.relation.ispartof	Information Sciences	en_US
dc.subject	Image retrieval Feature extraction Visual codebook Predictive clustering	en_US
dc.title	Improving bag-of-visual-words image retrieval with predictive clustering trees	en_US
dc.type	Journal Article	en_US
item.fulltext	With Fulltext	-
item.grantfulltext	open	-
crisitem.author.dept	Faculty of Computer Science and Engineering	-
crisitem.author.dept	Faculty of Computer Science and Engineering	-
Appears in Collections:	Faculty of Computer Science and Engineering: Journal Articles

Files in This Item:

File	Description	Size	Format
2015-DimitrovskiEtAl-INFSCI.pdf		3.74 MB	Adobe PDF	View/Open

Show simple item record

Page view(s)

28

checked on Jun 13, 2024

Download(s)

11

checked on Jun 13, 2024

Google Scholar^TM

Check

Repository of UKIM

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM