Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/20785
DC FieldValueLanguage
dc.contributor.authorZdravevski, Eftimen_US
dc.contributor.authorLameski, Petreen_US
dc.contributor.authorKulakov, Andreaen_US
dc.contributor.authorKalajdziski, Slobodanen_US
dc.date.accessioned2022-07-15T09:11:56Z-
dc.date.available2022-07-15T09:11:56Z-
dc.date.issued2015-09-13-
dc.identifier.urihttp://hdl.handle.net/20.500.12188/20785-
dc.description.abstractMachine learning has received increased interest by both the scientific community and the industry. Most of the machine learning algorithms rely on certain distance metrics that can only be applied to numeric data. This becomes a problem in complex datasets that contain heterogeneous data consisted of numeric and nominal (i.e. categorical) features. Thus the need of transformation from nominal to numeric data. Weight of evidence (WoE) is one of the parameters that can be used for transformation of the nominal features to numeric. In this paper we describe a method that uses WoE to transform the features. Although the applicability of this method is researched to some extent, in this paper we extend its applicability for multi-class problems, which is a novelty. We compared it with the method that generates dummy features. We test both methods on binary and multi-class classification problems with different machine learning algorithms. Our experiments show that the WoE based transformation generates smaller number of features compared to the technique based on generation of dummy features while also improving the classification accuracy, reducing memory complexity and shortening the execution time. Be that as it may, we also point out some of its weaknesses and make some recommendations when to use the method based on dummy features generation instead.en_US
dc.publisherIEEEen_US
dc.subjectWeight of Evidence, WoE, dummy features, data transformation, nominal features, categorical features, heterogeneous dataen_US
dc.titleTransformation of nominal features into numeric in supervised multi-class problems based on the weight of evidence parameteren_US
dc.typeProceeding articleen_US
dc.relation.conference2015 Federated Conference on Computer Science and Information Systems (FedCSIS)en_US
item.grantfulltextopen-
item.fulltextWith Fulltext-
crisitem.author.deptFaculty of Computer Science and Engineering-
crisitem.author.deptFaculty of Computer Science and Engineering-
crisitem.author.deptFaculty of Computer Science and Engineering-
crisitem.author.deptFaculty of Computer Science and Engineering-
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers
Files in This Item:
File Description SizeFormat 
Transformation_of_nominal_features_into.pdf203.67 kBAdobe PDFView/Open
Show simple item record

Page view(s)

33
checked on May 20, 2024

Download(s)

16
checked on May 20, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.