Ontea: Platform for Pattern Based Automated Semantic Annotation
Michal Laclavík ; Martin Šeleng ; Marek Ciglan ; Ladislav Hluchý
Computing and Informatics, Tome 28 (2012) no. 1, p. 555-579 / Harvested from Computing and Informatics
Automated annotation of web documents is a key challenge of the Semantic Web effort. Semantic metadata can be created manually or using automated annotation or tagging tools. Automated semantic annotation tools with best results are built on various machine learning algorithms which require training sets. Other approach is to use pattern based semantic annotation solutions built on natural language processing, information retrieval or information extraction methods. The paper presents Ontea platform for automated semantic annotation or semantic tagging. Implementation based on regular expression patterns is presented with evaluation of results. Extensible architecture for integrating pattern based approaches is presented. Most of existing semi-automatic annotation solutions can not prove it real usage on large scale data such as web or email communication, but semantic web can be exploited only when computer understandable metadata will reach critical mass. Thus we also present approach to large scale pattern based annotation.
Publié le : 2012-01-26
Classification: 
@article{cai49,
     author = {Michal Laclav\'\i k and Martin \v Seleng and Marek Ciglan and Ladislav Hluch\'y},
     title = {Ontea: Platform for Pattern Based Automated Semantic Annotation},
     journal = {Computing and Informatics},
     volume = {28},
     number = {1},
     year = {2012},
     pages = { 555-579},
     language = {en},
     url = {http://dml.mathdoc.fr/item/cai49}
}
Michal Laclavík; Martin Šeleng; Marek Ciglan; Ladislav Hluchý. Ontea: Platform for Pattern Based Automated Semantic Annotation. Computing and Informatics, Tome 28 (2012) no. 1, pp.  555-579. http://gdmltest.u-ga.fr/item/cai49/