Proceedings:
Vol. 10 No. 2 (2016): The Workshops of the Tenth International AAAI Conference on Web and Social Media
Volume
Issue:
Vol. 10 No. 2 (2016): The Workshops of the Tenth International AAAI Conference on Web and Social Media
Track:
Wiki
Downloads:
Abstract:
Wikipedia is one of the most popular sources of free data on the Internet and subject to extensive use in numerous areas of research. Wikidata on the other hand, the knowledge base behind Wikipedia, is less popular as a source of data, despite having the "data" already in its name, and despite the fact that many applications in Natural Language Processing in general and Information Extraction in particular benefit immensely from the integration of knowledge bases. In part, this imbalance is owed to the younger age of Wikidata, which launched over a decade after Wikipedia. However, this is also owed to challenges posed by the still evolving properties of Wikidata that make its content more difficult to consume for third parties than is desirable. In this article, we analzye the causes of these challenges from the viewpoint of a data consumer and discuss possible avenues of research and advancement that both the scientific and the Wikidata community can collaborate on to turn the knowledge base into the invaluable asset that it is uniquely positioned to become.
DOI:
10.1609/icwsm.v10i2.14832
ICWSM
Vol. 10 No. 2 (2016): The Workshops of the Tenth International AAAI Conference on Web and Social Media