An integration of cloud transform and rough set theory to induction of decision trees

Jing Song, Tianrui Li, Da Ruan, Özgür Kabak

    Research outputpeer-review

    Abstract

    Decision trees are one of the most popular data-mining techniques for knowledge discovery. Many approaches for induction of decision trees often deal with the continuous data and missing values in information systems. However, they do not perform well in real situations. This paper presents a new algorithm, decision tree construction based on the Cloud transform and Rough set theory under the characteristic relation (CR), for mining classification knowledge from a given data set. The continuous data is transformed into discrete qualitative concepts via the cloud transformation and then the attribute with the smallest weighted mean roughness under the characteristic relation is selected as the current splitting node. Experimental evaluation shows the decision trees constructed by the CR algorithm tend to have a simpler structure, much higher classification accuracy and more understandable rules than those by C5.0 in most cases.
    Original languageEnglish
    Pages (from-to)261-273
    JournalFundamenta Informaticae
    Volume94
    Issue number2
    DOIs
    StatePublished - Nov 2009

    Cite this