The Web has become a tremendously huge data source hidden under linked documents. A significant number of Web documents include HTML tables generated dynamically from relational databases. Often, there is no direct public access to the databases themselves. On the other hand, RDF (Resource Description Framework) gives an efficient mechanism to represent directly data on the Web based on a Web-scalable architecture for identification and interpretation of terms. This leads to the concept of Linked Data on the Web. To allow direct access to data on the Web as Linked Data, we propose in this paper an approach to transform HTML tables into RDF triples. It consists of three main phases: refining, pre-treatment and mapping. The whole process is assisted by a domain ontology and the WordNet lexical database. A tool called Htab2RDF has been implemented. Experiments have been carried out to evaluate and show efficiency of the proposed approach.
Publié le : 2018-02-09
Classification:  Knowledge and Information Engineering; Semantic Web; Linked Data engineering,  HTML tables, RDF, relational databases, Linked Data, domain ontology, WordNet,  68N99
@article{cai2017_6_1467,
     author = {Djelloul Bouchiha; EEDIS Laboratory, Djillali Liabes University of Sidi Bel Abbes and Mimoun Malki; EEDIS Laboratory, Djillali Liabes University of Sidi Bel Abbes and Abdullah Alghamdi; College of Computer and Information Sciences, KSU, Riyadh and Khalid Alnafjan; College of Computer and Information Sciences, KSU, Riyadh},
     title = {Htab2RDF: Mapping HTML Tables to RDF Triples},
     journal = {Computing and Informatics},
     volume = {36},
     number = {6},
     year = {2018},
     language = {en},
     url = {http://dml.mathdoc.fr/item/cai2017_6_1467}
}
Djelloul Bouchiha; EEDIS Laboratory, Djillali Liabes University of Sidi Bel Abbes; Mimoun Malki; EEDIS Laboratory, Djillali Liabes University of Sidi Bel Abbes; Abdullah Alghamdi; College of Computer and Information Sciences, KSU, Riyadh; Khalid Alnafjan; College of Computer and Information Sciences, KSU, Riyadh. Htab2RDF: Mapping HTML Tables to RDF Triples. Computing and Informatics, Tome 36 (2018) no. 6, . http://gdmltest.u-ga.fr/item/cai2017_6_1467/