group23IRE.github.io

Project Page - Group 23 .

Problem Statement

Creating short summaries of documents with respect to a query has applications in for example search engines, where it may help inform users of the most relevant results. Constructing such a summary automatically, with the potential expressiveness of a human-written summary, is a difficult problem yet to be fully solved.

Introduction

Applications

Dataset

The dataset is from Debatepedia an encyclopedia of pro and con arguments and quotes on critical debate topics. There are 663 debates in the corpus (only those debates are considered which have at least one query with one document). These 663 debates belong to 53 overlapping categories such as Politics, Law, Crime, Environment, Health, Morality, Religion, etc. A given topic can belong to more than one category. For example, the topic “Eye for an Eye philos

Scores and Results

rouge-1:  28.074
rouge-2:  2.183
rouge-L:  21.681

Challenges and limitations ==========================

Link to code base, dataset

References