Ве молиме користете го овој идентификатор да го цитирате или поврзете овој запис: http://hdl.handle.net/20.500.12188/21384
Наслов: Simplifying parallel implementation of algorithms on Hadoop with Pig Latin
Authors: Zdravevski, Eftim 
Lameski, Petre 
Kulakov, Andrea 
Filiposka, Sonja 
Trajanov, Dimitar 
Keywords: Hadoop, MapReduce, HBase, Pig, parallel algorithms, distributed algorithms
Issue Date: 2015
Conference: CIIT
Abstract: In this paper we present a general technique for parallelizing regular algorithms with the tools the Hadoop ecosystem offers: MapReduce, HDFS, HBase and Pig. This framework can be applied for parallelizing algorithms for feature selection, clustering, machine learning etc. It consists of several steps: load the datasets in HDFS, apply some transformations if they are needed, store the datasets in HBase, and implement the algorithm in Pig with the help of User Defined Functions.
URI: http://hdl.handle.net/20.500.12188/21384
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Files in This Item:
File Опис SizeFormat 
SimplifyingMapReducedevelopmentonHadoopandHBasewithPigLatin-EftimZdravevski.pdf312.49 kBAdobe PDFView/Open
Прикажи целосна запис

Page view(s)

51
checked on 4.5.2025

Download(s)

16
checked on 4.5.2025

Google ScholarTM

Проверете


Записите во DSpace се заштитени со авторски права, со сите права задржани, освен ако не е поинаку наведено.