Loading [MathJax]/extensions/MathZoom.js
A Data Mining Approach to Discover Genetic and Environmental Factors involved in Multifactoral Diseases
Jourdan, L. ; Dhaenens, C. ; Talbi, E.G. ; Gallina, S.
HAL, inria-00001181 / Harvested from HAL
In this paper, we are interested in discovering genetic factors that are involved in multifactorial diseases. Therefore, experiments have been achieved by the Biological Institute of Lille and a lot of data has been generated. To exploit this data, data mining tools are required and we propose a 2-phase optimization approach using a specific genetic algorithm. During the first step, we select significant features with a specific genetic algorithm. Then, during the second step, we cluster affected individuals according to the features selected by the first phase. The paper describes the specificities of the genetic problem that we are studying and presents in details the genetic algorithm that we have developed to deal with this very large size problem of feature selection. Results on both artificial and real data are presented.
Publié le : 2002-05-05
Classification:  [MATH.MATH-CO]Mathematics [math]/Combinatorics [math.CO]
@article{inria-00001181,
     author = {Jourdan, L. and Dhaenens, C. and Talbi, E.G. and Gallina, S.},
     title = {A Data Mining Approach to Discover Genetic and Environmental Factors involved in Multifactoral Diseases},
     journal = {HAL},
     volume = {2002},
     number = {0},
     year = {2002},
     language = {en},
     url = {http://dml.mathdoc.fr/item/inria-00001181}
}
Jourdan, L.; Dhaenens, C.; Talbi, E.G.; Gallina, S. A Data Mining Approach to Discover Genetic and Environmental Factors involved in Multifactoral Diseases. HAL, Tome 2002 (2002) no. 0, . http://gdmltest.u-ga.fr/item/inria-00001181/