Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/20825
Наслов: Row key designs of NoSQL database tables and their impact on write performance
Authors: Zdravevski, Eftim 
Lameski, Petre 
Kulakov, Andrea 
Keywords: NoSQL, HBase, Hadoop, table design, row key, primary key, clustered index
Issue Date: 17-фев-2016
Publisher: IEEE
Conference: 2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)
Abstract: In several NoSQL database systems, among which is HBase, only one index is available for the tables, which is also the row key and the clustered index. Using other indexes does not come out of the box. As a result, the row key design is the most important thing when designing tables, because an inappropriate design can lead to detrimental consequences on performances and costs. Particular row key designs are suitable for different problems, and in this paper we analyze the performance, characteristics and applicability of each of them. In particular we investigate the effect of using various techniques for modeling row keys: sequences, salting, padding, hashing, and modulo operations. We propose four different designs based on these techniques and we analyze their performance on different HBase clusters when loading HDFS files with various sizes. The experiments show that particular designs consistently outperform others on differently sized clusters in both execution time and even load distribution across nodes.
URI: http://hdl.handle.net/20.500.12188/20825
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Files in This Item:
File Опис SizeFormat 
2015_HBase_Rowkeys_PDP_2016_EftimZdravevski.pdf1.35 MBAdobe PDFView/Open
Прикажи целосна запис

Page view(s)

81
checked on 11.10.2024

Download(s)

37
checked on 11.10.2024

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.