Linear Discriminant Analysis (LDA) and the related Fisher's linear discriminant are very important techniques used for classification and for dimensionality reduction. A certain complication occurs in applying these methods to real data. We have to estimate the class means and common covariance matrix, which are not known. A problem arises if the number of features exceeds the number of observations. In this case the estimate of the covariance matrix does not have full rank, and so cannot be inverted. There are a number of ways to deal with this problem. In our previous paper, we proposed improving LDA in this area, and we presented a new approach which uses a generalization of the Moore-Penrose (MP) pseudo-inverse to remove this weakness. However, for data sets with a larger number of features, our method was computationally too slow to achieve good results. Now we propose a model selection method with a genetic algorithm to solve this problem. Experimental results on different data sets demonstrate that the improvement is efficient.
Publié le : 2016-11-02
Classification:  Theoretical Foundations,  Linear discriminant analysis, Moore-Penrose pseudo-inverse, genetic algorithm,  62-H30
@article{cai2275,
     author = {Tomasz G\'orecki; Faculty of Mathematics and Computer Science, Adam Mickiewicz University, Pozna\'n and Maciej \L uczak; Faculty of Civil Engineering, Environmental and Geodetic Sciences, Koszalin University of Technology, Koszalin},
     title = {Evolutionarily Tuned Generalized Pseudo-Inverse in Linear Discriminant Analysis},
     journal = {Computing and Informatics},
     volume = {34},
     number = {4},
     year = {2016},
     language = {en},
     url = {http://dml.mathdoc.fr/item/cai2275}
}
Tomasz Górecki; Faculty of Mathematics and Computer Science, Adam Mickiewicz University, Poznań; Maciej Łuczak; Faculty of Civil Engineering, Environmental and Geodetic Sciences, Koszalin University of Technology, Koszalin. Evolutionarily Tuned Generalized Pseudo-Inverse in Linear Discriminant Analysis. Computing and Informatics, Tome 34 (2016) no. 4, . http://gdmltest.u-ga.fr/item/cai2275/