Multi-source named entity typing for social media

Reuth Vexler, Einat Minkov

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Typed lexicons that encode knowledge about the semantic types of an entity name, e.g., that 'Paris' denotes a geolocation, product, or person, have proven useful for many text processing tasks. While lexicons may be derived from large-scale knowledge bases (KBs), KBs are inherently imperfect, in particular they lack coverage with respect to long tail entity names. We infer the types of a given entity name using multi-source learning, considering information obtained by alignment to the Freebase knowledge base, Web-scale distributional patterns, and global semi-structured contexts retrieved by means of Web search. Evaluation in the challenging domain of social media shows that multi-source learning improves performance compared with rule-based KB lookups, boosting typing results for some semantic categories.

Original languageEnglish
Title of host publicationProceedings of NEWS 2016
Subtitle of host publication6th Named Entity Workshop at the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
EditorsXiangyu Duan, Rafael E. Banchs, Min Zhang, Haizhou Li, A. Kumara
PublisherAssociation for Computational Linguistics (ACL)
Pages11-20
Number of pages10
ISBN (Electronic)9781945626166
StatePublished - 2016
Event6th Named Entity Workshop, NEWS 2016 at the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Berlin, Germany
Duration: 12 Aug 2016 → …

Publication series

NameProceedings of NEWS 2016: 6th Named Entity Workshop at the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016

Conference

Conference6th Named Entity Workshop, NEWS 2016 at the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
Country/TerritoryGermany
CityBerlin
Period12/08/16 → …

Bibliographical note

Publisher Copyright:
© Proceedings of NEWS 2016: 6th Named Entity Workshop at the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016. All rights reserved.

ASJC Scopus subject areas

  • Computer Science Applications
  • Computational Theory and Mathematics
  • Software

Fingerprint

Dive into the research topics of 'Multi-source named entity typing for social media'. Together they form a unique fingerprint.

Cite this