In this article, we investigate the problem of cross-document person name disambiguation, which aimed at resolving ambiguities between person names and clustering web documents according to their association to different persons sharing the same name. The majority of previous work often formulated cross-document name disambiguation as a clustering problem. These methods employed various syntactic and semantic features either from the local corpus or distant knowledge bases to compute similarities between entities and group similar entities. However, these approaches show limitations regarding robustness and performance. We propose an unsupervised, graph-based name disambiguation approach to improve the performance and robustness of the state-of-the-art. Our approach exploits both local information extracted from the given corpus, and global information obtained from distant knowledge bases. We show the effectiveness of our approach by testing it on standard WePS datasets. The experimental results are encouraging and show that our proposed method outperforms several baseline methods and also its counterparts. The experiments show that our approach not only improves the performances, but also increases the robustness of name disambiguation.
Publié le : 2019-02-05
Classification:
Knowledge and Information Engineerin; other areas of Computing and Informatics,
Web mining, cross-document name disambiguation, social links, profile enrichment, clustering,
97R40, 97R50, 68T50, 68U35, 90B40
@article{cai2018_6_1485,
author = {Hojjat Emami; Social Network and Intelligent Systems Laboratory, Faculty of Artificial Intelligence, Malek-Ashtar University of Technology, Tehran and Hossein Shirazi; Social Network and Intelligent Systems Laboratory, Faculty of Artificial Intelligence, Malek-Ashtar University of Technology, Tehran and Ahmad Abdollahzadeh Barforoush; Intelligent Systems Laboratory, Computer Engineering and IT Department, Amir Kabir University of Technology, Tehran},
title = {Web Person Name Disambiguation Using Social Links and Enriched Profile Information},
journal = {Computing and Informatics},
volume = {37},
number = {6},
year = {2019},
language = {en},
url = {http://dml.mathdoc.fr/item/cai2018_6_1485}
}
Hojjat Emami; Social Network and Intelligent Systems Laboratory, Faculty of Artificial Intelligence, Malek-Ashtar University of Technology, Tehran; Hossein Shirazi; Social Network and Intelligent Systems Laboratory, Faculty of Artificial Intelligence, Malek-Ashtar University of Technology, Tehran; Ahmad Abdollahzadeh Barforoush; Intelligent Systems Laboratory, Computer Engineering and IT Department, Amir Kabir University of Technology, Tehran. Web Person Name Disambiguation Using Social Links and Enriched Profile Information. Computing and Informatics, Tome 37 (2019) no. 6, . http://gdmltest.u-ga.fr/item/cai2018_6_1485/