| DATE: | Monday, May 30, 2011 |
| TIME: | 3:30 pm |
| PLACE: | Council Room (SITE 5-084) |
| TITLE: | Transductive Learning over Automatically Detected Themes for Multi-Document Summarization |
| PRESENTER: | Massih-Reza Amini NRC |
| ABSTRACT: We propose a new method for query-biased multi-document summarization, based on sentence extraction. The summary of multiple documents is created in two steps. Sentences are first clustered; where each cluster corresponds to one of the main themes present in the collection. Inside each theme, sentences are then ranked using a transductive learning-to-rank algorithm based on RankNet, in order to better identify those which are relevant to the query. The final summary contains the top-ranked sentences of each theme. Our approach is validated on DUC 2006 and DUC 2007 datasets. | |