Query-biased summarization considering difference of paragraphs

Chikara Otani, Yasushi Oda, Osamu Yoshie

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Most conventional query-biased summarization methods generate the summary using extracted sentences based on similarity measure between all sentences in a document and the query. If there are plural sentences having high similarity to the query in the document, these methods cannot decide the sentence which the summary should be from. This paper proposes an algorithm adopting new indicator that shows the difference between one paragraph and the others. In a word space which is composed of all words in the target document, the algorithm determines the axis that maximizes the difference when a paragraph and the others are projected onto it. There are many combinations of a paragraph and a set of other paragraphs. For each combination, the above-mentioned axis that maximizes the difference and gives a conformity degree to the given query is calculated. With these conformities, the algorithm decides one paragraph for generating the summary. To obtain the axis, topic distinctiveness factor analysis is applied. The basic idea for making final summary is concatenating the sentences extracted from the paragraph. The resultant summary is evaluated from the following points of view: readability, understandability and the easiness to judge whether the link works well or not.

Original languageEnglish
Title of host publicationiiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services
Pages535-541
Number of pages7
DOIs
Publication statusPublished - 2010
Event12th International Conference on Information Integration and Web-Based Applications and Services, iiWAS2010 - Paris
Duration: 2010 Nov 82010 Nov 10

Other

Other12th International Conference on Information Integration and Web-Based Applications and Services, iiWAS2010
CityParis
Period10/11/810/11/10

Fingerprint

Factor analysis

Keywords

  • Information search
  • Query-biased summarization
  • Topic distinctiveness factor analysis

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Information Systems

Cite this

Otani, C., Oda, Y., & Yoshie, O. (2010). Query-biased summarization considering difference of paragraphs. In iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services (pp. 535-541) https://doi.org/10.1145/1967486.1967569

Query-biased summarization considering difference of paragraphs. / Otani, Chikara; Oda, Yasushi; Yoshie, Osamu.

iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services. 2010. p. 535-541.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Otani, C, Oda, Y & Yoshie, O 2010, Query-biased summarization considering difference of paragraphs. in iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services. pp. 535-541, 12th International Conference on Information Integration and Web-Based Applications and Services, iiWAS2010, Paris, 10/11/8. https://doi.org/10.1145/1967486.1967569
Otani C, Oda Y, Yoshie O. Query-biased summarization considering difference of paragraphs. In iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services. 2010. p. 535-541 https://doi.org/10.1145/1967486.1967569
Otani, Chikara ; Oda, Yasushi ; Yoshie, Osamu. / Query-biased summarization considering difference of paragraphs. iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services. 2010. pp. 535-541
@inproceedings{baba41cee8b54a93b5bf86db6b05a9cf,
title = "Query-biased summarization considering difference of paragraphs",
abstract = "Most conventional query-biased summarization methods generate the summary using extracted sentences based on similarity measure between all sentences in a document and the query. If there are plural sentences having high similarity to the query in the document, these methods cannot decide the sentence which the summary should be from. This paper proposes an algorithm adopting new indicator that shows the difference between one paragraph and the others. In a word space which is composed of all words in the target document, the algorithm determines the axis that maximizes the difference when a paragraph and the others are projected onto it. There are many combinations of a paragraph and a set of other paragraphs. For each combination, the above-mentioned axis that maximizes the difference and gives a conformity degree to the given query is calculated. With these conformities, the algorithm decides one paragraph for generating the summary. To obtain the axis, topic distinctiveness factor analysis is applied. The basic idea for making final summary is concatenating the sentences extracted from the paragraph. The resultant summary is evaluated from the following points of view: readability, understandability and the easiness to judge whether the link works well or not.",
keywords = "Information search, Query-biased summarization, Topic distinctiveness factor analysis",
author = "Chikara Otani and Yasushi Oda and Osamu Yoshie",
year = "2010",
doi = "10.1145/1967486.1967569",
language = "English",
isbn = "9781450304214",
pages = "535--541",
booktitle = "iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services",

}

TY - GEN

T1 - Query-biased summarization considering difference of paragraphs

AU - Otani, Chikara

AU - Oda, Yasushi

AU - Yoshie, Osamu

PY - 2010

Y1 - 2010

N2 - Most conventional query-biased summarization methods generate the summary using extracted sentences based on similarity measure between all sentences in a document and the query. If there are plural sentences having high similarity to the query in the document, these methods cannot decide the sentence which the summary should be from. This paper proposes an algorithm adopting new indicator that shows the difference between one paragraph and the others. In a word space which is composed of all words in the target document, the algorithm determines the axis that maximizes the difference when a paragraph and the others are projected onto it. There are many combinations of a paragraph and a set of other paragraphs. For each combination, the above-mentioned axis that maximizes the difference and gives a conformity degree to the given query is calculated. With these conformities, the algorithm decides one paragraph for generating the summary. To obtain the axis, topic distinctiveness factor analysis is applied. The basic idea for making final summary is concatenating the sentences extracted from the paragraph. The resultant summary is evaluated from the following points of view: readability, understandability and the easiness to judge whether the link works well or not.

AB - Most conventional query-biased summarization methods generate the summary using extracted sentences based on similarity measure between all sentences in a document and the query. If there are plural sentences having high similarity to the query in the document, these methods cannot decide the sentence which the summary should be from. This paper proposes an algorithm adopting new indicator that shows the difference between one paragraph and the others. In a word space which is composed of all words in the target document, the algorithm determines the axis that maximizes the difference when a paragraph and the others are projected onto it. There are many combinations of a paragraph and a set of other paragraphs. For each combination, the above-mentioned axis that maximizes the difference and gives a conformity degree to the given query is calculated. With these conformities, the algorithm decides one paragraph for generating the summary. To obtain the axis, topic distinctiveness factor analysis is applied. The basic idea for making final summary is concatenating the sentences extracted from the paragraph. The resultant summary is evaluated from the following points of view: readability, understandability and the easiness to judge whether the link works well or not.

KW - Information search

KW - Query-biased summarization

KW - Topic distinctiveness factor analysis

UR - http://www.scopus.com/inward/record.url?scp=79956001874&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79956001874&partnerID=8YFLogxK

U2 - 10.1145/1967486.1967569

DO - 10.1145/1967486.1967569

M3 - Conference contribution

SN - 9781450304214

SP - 535

EP - 541

BT - iiWAS2010 - 12th International Conference on Information Integration and Web-Based Applications and Services

ER -