dRAP-Independent: A Data Distribution Algorithm for Mining First-Order Frequent Patterns
Jan Blaťák ; Luboš Popelínský
Computing and Informatics, Tome 28 (2012) no. 1, / Harvested from Computing and Informatics
In this paper we present dRAP-Independent, an algorithm for independent distributed mining of first-order frequent patterns. This system is based on RAP, an algorithm for finding maximal frequent patterns in first-order logic. dRAP-Independent utilizes a modified data partitioning schema introduced by Savasere et al. and offers good performance and low communication overhead. We analyze the performance of the algorithm on four different tasks: Mutagenicity prediction -- a standard ILP benchmark, information extraction from biological texts, context-sensitive spelling correction, and morphological disambiguation of Czech. The results of the analysis show that the algorithm can generate more patterns than the serial algorithm RAP in the same overall time.
Publié le : 2012-01-26
Classification:  frequent patterns; inductive logic programming; parallel and distributed data mining; propositionalization
@article{cai314,
     author = {Jan Bla\v t\'ak and Lubo\v s Popel\'\i nsk\'y},
     title = {dRAP-Independent: A Data Distribution Algorithm for Mining First-Order Frequent Patterns},
     journal = {Computing and Informatics},
     volume = {28},
     number = {1},
     year = {2012},
     language = {en},
     url = {http://dml.mathdoc.fr/item/cai314}
}
Jan Blaťák; Luboš Popelínský. dRAP-Independent: A Data Distribution Algorithm for Mining First-Order Frequent Patterns. Computing and Informatics, Tome 28 (2012) no. 1, . http://gdmltest.u-ga.fr/item/cai314/