Automatic gloss finding for a knowledge base using ontological constraints

Bhavana Dalvi, Einat Minkov, Partha P. Talukdar, William W. Cohen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

While there has been much research on automatically constructing structured Knowledge Bases (KBs), most of it has focused on generating facts to populate a KB. However, a useful KB must go beyond facts. For example, glosses (short natural language definitions) have been found to be very useful in tasks such as Word Sense Disambiguation. However, the important problem of Automatic Gloss Finding, i.e., assigning glosses to entities in an initially gloss-free KB, is relatively unexplored. We address that gap in this paper. In particular, we propose GLOFIN, a hierarchical semi-supervised learning algorithm for this problem which makes effective use of limited amounts of supervision and available ontological constraints. To the best of our knowledge, GLOFIN is the first system for this task. Through extensive experiments on real-world datasets, we demonstrate GLOFIN's effectiveness. It is encouraging to see that GLOFIN outperforms other state-of-the-art SSL algorithms, especially in low supervision settings. We also demonstrate GLOFIN's robustness to noise through experiments on a wide variety of KBs, ranging from user contributed (e.g., Freebase) to automatically constructed (e.g., NELL). To facilitate further research in this area, we have made the datasets and code used in this paper publicly available.

Original languageEnglish
Title of host publicationWSDM 2015 - Proceedings of the 8th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery
Pages369-378
Number of pages10
ISBN (Electronic)9781450333177
DOIs
StatePublished - 2 Feb 2015
Event8th ACM International Conference on Web Search and Data Mining, WSDM 2015 - Shanghai, China
Duration: 31 Jan 20156 Feb 2015

Publication series

NameWSDM 2015 - Proceedings of the 8th ACM International Conference on Web Search and Data Mining

Conference

Conference8th ACM International Conference on Web Search and Data Mining, WSDM 2015
Country/TerritoryChina
CityShanghai
Period31/01/156/02/15

Bibliographical note

Publisher Copyright:
Copyright © 2015 ACM.

Keywords

  • Gloss finding
  • Hierarchical learning
  • Web mining.

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Automatic gloss finding for a knowledge base using ontological constraints'. Together they form a unique fingerprint.

Cite this