Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.12188/20785
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Zdravevski, Eftim | en_US |
dc.contributor.author | Lameski, Petre | en_US |
dc.contributor.author | Kulakov, Andrea | en_US |
dc.contributor.author | Kalajdziski, Slobodan | en_US |
dc.date.accessioned | 2022-07-15T09:11:56Z | - |
dc.date.available | 2022-07-15T09:11:56Z | - |
dc.date.issued | 2015-09-13 | - |
dc.identifier.uri | http://hdl.handle.net/20.500.12188/20785 | - |
dc.description.abstract | Machine learning has received increased interest by both the scientific community and the industry. Most of the machine learning algorithms rely on certain distance metrics that can only be applied to numeric data. This becomes a problem in complex datasets that contain heterogeneous data consisted of numeric and nominal (i.e. categorical) features. Thus the need of transformation from nominal to numeric data. Weight of evidence (WoE) is one of the parameters that can be used for transformation of the nominal features to numeric. In this paper we describe a method that uses WoE to transform the features. Although the applicability of this method is researched to some extent, in this paper we extend its applicability for multi-class problems, which is a novelty. We compared it with the method that generates dummy features. We test both methods on binary and multi-class classification problems with different machine learning algorithms. Our experiments show that the WoE based transformation generates smaller number of features compared to the technique based on generation of dummy features while also improving the classification accuracy, reducing memory complexity and shortening the execution time. Be that as it may, we also point out some of its weaknesses and make some recommendations when to use the method based on dummy features generation instead. | en_US |
dc.publisher | IEEE | en_US |
dc.subject | Weight of Evidence, WoE, dummy features, data transformation, nominal features, categorical features, heterogeneous data | en_US |
dc.title | Transformation of nominal features into numeric in supervised multi-class problems based on the weight of evidence parameter | en_US |
dc.type | Proceeding article | en_US |
dc.relation.conference | 2015 Federated Conference on Computer Science and Information Systems (FedCSIS) | en_US |
item.fulltext | With Fulltext | - |
item.grantfulltext | open | - |
crisitem.author.dept | Faculty of Computer Science and Engineering | - |
crisitem.author.dept | Faculty of Computer Science and Engineering | - |
crisitem.author.dept | Faculty of Computer Science and Engineering | - |
crisitem.author.dept | Faculty of Computer Science and Engineering | - |
Appears in Collections: | Faculty of Computer Science and Engineering: Conference papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Transformation_of_nominal_features_into.pdf | 203.67 kB | Adobe PDF | View/Open |
Page view(s)
42
checked on Oct 11, 2024
Download(s)
19
checked on Oct 11, 2024
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.