Extraction of Biographical Data from Wikipedia

Extraction of Biographical Data from Wikipedia

Robert Viseur, Extraction of Biographical Data from Wikipedia, Proceedings of International Conference on Data Technologies and Applications 2013, Iceland, July 29-31, 2013.

Date: 29 juillet 2013

Publication: Communication scientifique 

Expertises:

Science des données 

Domaine: Média 

A propos du projet: CE-IQS 

Abstract

Using the content of Wikipedia articles is common in academic research. However the practicalities are rarely analysed. Our research focuses on extracting biographical information about personalities from Belgium. Our research is divided into three sections. The first section describes the state of the art for data extraction from Wikipedia. A second section presents the case study about data extraction for biographies of Belgian personalities. Different solutions are discussed and the solution adopted is implemented. In the third section, the quality of the extraction is discussed. Practical recommendations for researchers wishing to use Wikipedia are also proposed on the basis of our case study.