Skip to main content
Full-text links:

Download:

Current browse context:

cs.CL
new | recent | 1608

Change to browse by:

References & Citations

Bookmark

BibSonomy logo Mendeley logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title:Viewpoint and Topic Modeling of Current Events

Abstract: There are multiple sides to every story, and while statistical topic models have been highly successful at topically summarizing the stories in corpora of text documents, they do not explicitly address the issue of learning the different sides, the viewpoints, expressed in the documents. In this paper, we show how these viewpoints can be learned completely unsupervised and represented in a human interpretable form. We use a novel approach of applying CorrLDA2 for this purpose, which learns topic-viewpoint relations that can be used to form groups of topics, where each group represents a viewpoint. A corpus of documents about the Israeli-Palestinian conflict is then used to demonstrate how a Palestinian and an Israeli viewpoint can be learned. By leveraging the magnitudes and signs of the feature weights of a linear SVM, we introduce a principled method to evaluate associations between topics and viewpoints. With this, we demonstrate, both quantitatively and qualitatively, that the learned topic groups are contextually coherent, and form consistently correct topic-viewpoint associations.
Comments: 16 pages, 4 figures, 4 tables
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (stat.ML)
Cite as: arXiv:1608.04089 [cs.CL]
  (or arXiv:1608.04089v1 [cs.CL] for this version)

Submission history

From: Kerry Zhang [view email]
[v1] Sun, 14 Aug 2016 11:36:52 UTC (38 KB)