TY - GEN
T1 - Association rules mining for handling continuous attributes using genetic network programming and fuzzy membership functions
AU - Taboada, Karla
AU - Shimada, Kaoru
AU - Mabu, Shingo
AU - Hirasawa, Kotaro
AU - Hu, Jinglu
PY - 2007/12/1
Y1 - 2007/12/1
N2 - Data Mining is the process of extracting useful hidden knowledge from large volumes of data and its results can be used in decision support systems. Several data mining algorithms have been developed; one example is the association rule mining, which discovers associations among items encoded within a database. When the values of the attributes in a database are continuous such as height, length or weight, their domain is usually discretized into several intervals, as a result, such attributes are handled as discrete attributes. In this study, a new approach of mining association rules for handling continuous attributes that does not use any discretization is proposed. The methodology is based on a new graph-based evolutionary algorithm named "Genetic Network Programming (GNP)" and fuzzy membership functions. Our data mining method first needs to transform the continuous values in transactions into linguistic terms, then judge them and find association rules using GNP. GNP represent its individuals using graph structures that evolve in order to find a solution; this feature contributes to creating quite compact programs and implicitly memorizing past action sequences. The proposed method can measure the significance of the extracted association rules by using support, confidence and χ2 test, and obtains a sufficient number of important association rules in a short time.
AB - Data Mining is the process of extracting useful hidden knowledge from large volumes of data and its results can be used in decision support systems. Several data mining algorithms have been developed; one example is the association rule mining, which discovers associations among items encoded within a database. When the values of the attributes in a database are continuous such as height, length or weight, their domain is usually discretized into several intervals, as a result, such attributes are handled as discrete attributes. In this study, a new approach of mining association rules for handling continuous attributes that does not use any discretization is proposed. The methodology is based on a new graph-based evolutionary algorithm named "Genetic Network Programming (GNP)" and fuzzy membership functions. Our data mining method first needs to transform the continuous values in transactions into linguistic terms, then judge them and find association rules using GNP. GNP represent its individuals using graph structures that evolve in order to find a solution; this feature contributes to creating quite compact programs and implicitly memorizing past action sequences. The proposed method can measure the significance of the extracted association rules by using support, confidence and χ2 test, and obtains a sufficient number of important association rules in a short time.
KW - Association rules
KW - Data mining
KW - Fuzzy membership functions
KW - Genetic network programming
UR - http://www.scopus.com/inward/record.url?scp=50249127237&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50249127237&partnerID=8YFLogxK
U2 - 10.1109/SICE.2007.4421451
DO - 10.1109/SICE.2007.4421451
M3 - Conference contribution
AN - SCOPUS:50249127237
SN - 4907764286
SN - 9784907764289
T3 - Proceedings of the SICE Annual Conference
SP - 2723
EP - 2729
BT - SICE Annual Conference, SICE 2007
T2 - SICE(Society of Instrument and Control Engineers)Annual Conference, SICE 2007
Y2 - 17 September 2007 through 20 September 2007
ER -