Central European Journal of Computer Science, Volume (1), No (2), Year (2011-1) , Pages (52-67)

Title : ( Correlation based splitting criterionin multi branch decision tree )

Authors: Hadi Sadoghi Yazdi , Nima Salehi Moghaddami , Hanieh Poostchi Mohammadabadi ,

Citation: BibTeX | EndNote

Abstract

One of the most commonly used predictive models in classification is the decision tree (DT). The task of a DT is to map observations to target values. In the DT, each branch represents a rule. A rule’s consequent is the leaf of the branch and its antecedent is the conjunction of the features. Most applied algorithms in this field use the concept of Information Entropy and Gini Index as the splitting criterion when building a tree. In this paper, a new splitting criterion to build DTs is proposed. A splitting criterion specifies the tree’s best splitting variable as well as the variable’s threshold for further splitting. Using the idea from classical Forward Selection method and its enhanced versions, the variable having the largest absolute correlation with the target value is chosen as the best splitting variable at each node. Then, the idea of maximizing the margin between classes in a support vector machine (SVM) is used to find the best classification threshold on the selected variable. This procedure will execute recursively at each node, until reaching the leaf nodes. The final decision tree has a shorter height than previous methods, which effectively reduces useless variables and the time needed for classification of future data. Unclassified regions are also generated under the proposed method, which can be interpreted as an advantage or disadvantage. The simulation results demonstrate an improvement in the generated decision tree compared to previous methods.

Keywords

decision tree – splitting criterion – support vector machine – correlation – unclassified region
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@article{paperid:1022800,
author = {Sadoghi Yazdi, Hadi and Salehi Moghaddami, Nima and Poostchi Mohammadabadi, Hanieh},
title = {Correlation based splitting criterionin multi branch decision tree},
journal = {Central European Journal of Computer Science},
year = {2011},
volume = {1},
number = {2},
month = {January},
issn = {1896-1533},
pages = {52--67},
numpages = {15},
keywords = {decision tree – splitting criterion – support vector machine – correlation – unclassified region},
}

[Download]

%0 Journal Article
%T Correlation based splitting criterionin multi branch decision tree
%A Sadoghi Yazdi, Hadi
%A Salehi Moghaddami, Nima
%A Poostchi Mohammadabadi, Hanieh
%J Central European Journal of Computer Science
%@ 1896-1533
%D 2011

[Download]