Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12188/20778
Title: Weight of evidence as a tool for attribute transformation in the preprocessing stage of supervised learning algorithms
Authors: Zdravevski, Eftim 
Lameski, Petre 
Kulakov, Andrea 
Keywords: data transformation, data preprocessing, weight of evidence, information value, feature selection
Issue Date: 31-Jul-2011
Publisher: IEEE
Conference: The 2011 international joint conference on neural networks
Abstract: Transformation of features is a common task in the data preprocessing stage while solving data mining and classification problems. Many classification algorithms have preference of continual attributes over nominal attributes, and sometimes the distance between different data points cannot be estimated if the values of the attributes are not continual and normalized. The Weight of Evidence has some very desirable properties that make it very useful tool for the transformation of attributes, but unfortunately there are some preconditions that need to be met in order to calculate it. In this paper we propose a modified calculation of the Weight of Evidence that overcomes these preconditions, and additionally makes it usable for test examples that were not present in the training set. The proposed transformation can be used for all supervised learning problems. At the end, we present the results from the proposed transformation and discuss the benefits of the transformed nominal and continual attributes from the PAKDD 2009 dataset. The results show that the proposed transformation contributes towards a better performance in all tested classification algorithms than the method that generates dummy (i.e. binary) variables for each value of the nominal attributes.
URI: http://hdl.handle.net/20.500.12188/20778
Appears in Collections:Faculty of Computer Science and Engineering: Conference papers

Show full item record

Page view(s)

41
checked on May 6, 2024

Download(s)

65
checked on May 6, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.