Contextual search and name disambiguation in email using graphs

Einat Minkov, William W. Cohen, Andrew Y. Ng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Similarity measures for text have historically been an important tool for solving information retrieval problems. In many interesting settings, however, documents are often closely connected to other documents, as well as other non-textual objects: for instance, email messages are connected to other messages via header information. In this paper we consider extended similarity metrics for documents and other objects embedded in graphs, facilitated via a lazy graph walk. We provide a detailed instantiation of this framework for email data, where content, social networks and a timeline are integrated in a structural graph. The suggested framework is evaluated for two email-related problems: disambiguating names in email documents, and threading. We show that reranking schemes based on the graph-walk similarity measures often outperform baseline methods, and that further improvements can be obtained by use of appropriate learning methods.

Original languageEnglish
Title of host publicationProceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
PublisherAssociation for Computing Machinery
Pages27-34
Number of pages8
ISBN (Print)1595933697, 9781595933690
DOIs
StatePublished - 2006
Externally publishedYes
Event29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - Seatttle, WA, United States
Duration: 6 Aug 200611 Aug 2006

Publication series

NameProceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Volume2006

Conference

Conference29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Country/TerritoryUnited States
CitySeatttle, WA
Period6/08/0611/08/06

Keywords

  • Email
  • Graph-based retrieval
  • Name disambiguation
  • Threading

ASJC Scopus subject areas

  • General Engineering
  • Information Systems
  • Software
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Contextual search and name disambiguation in email using graphs'. Together they form a unique fingerprint.

Cite this