Dependence of Binary Associations on Co-occurrence Granularity in News Documents

Krishnaprasad Thirunarayan, Trivikram Immaneni, Mastan Vali Shaik

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We describe and formalize an approach to correlate binary associations (such as between entities and events, between persons and events, etc.) implied by News documents on the co-occurrence granularity (such as document-level, paragraph-level, sentence-level, etc.) of the corresponding text phrases in the documents. Specifically, we present both qualitative and quantitative characterization of searching News documents: former in terms of the nature of the content and the queries, and latter in terms of a metric obtained by adapting the notions of precision and recall. Specifically, the approach tries to reduce the manual effort required to analyze the News documents to compare the three alternatives for granularity of co-occurrence. Furthermore, the analysis suggests ways to improve retrieval performance as illustrated by applying our findings to News documents for the year 2005.
Original languageEnglish
Title of host publicationProceedings of the 2008 International Conference on Information and Knowledge Engineering, IKE 2008
PublisherCSREA Press
Pages193-198
Number of pages6
ISBN (Print)1601320752, 9781601320759
StatePublished - 2008
Event2008 International Conference on Information and Knowledge Engineering, IKE 2008 - Las Vegas, NV, United States
Duration: Jul 14 2008Jul 17 2008

Publication series

NameProceedings of the 2008 International Conference on Information and Knowledge Engineering, IKE 2008

Conference

Conference2008 International Conference on Information and Knowledge Engineering, IKE 2008
Country/TerritoryUnited States
CityLas Vegas, NV
Period7/14/087/17/08

ASJC Scopus Subject Areas

  • Software
  • Artificial Intelligence
  • Computer Science Applications
  • Information Systems

Keywords

  • Binary associations
  • Co-occurrence granularity
  • Index and search
  • News documents
  • Precision and recall
  • Timelines

Disciplines

  • Cataloging and Metadata
  • Databases and Information Systems

Cite this