Two-encoder pointer-generator network for summarizing segments of long articles

Junhao Li, Mizuho Iwaihara

研究成果: Conference contribution

抜粋

Usually long documents contain many sections and segments. In Wikipedia, one article can usually be divided into sections and one section can be divided into segments. But although one article is already divided into smaller segments, one segment can still be too long to read. So, we consider that segments should have a short summary for readers to grasp a quick view of the segment. This paper discusses applying neural summarization models including Seq2Seq model and pointer generator network model to segment summarization. These models for summarization can take target segments as the only input to the model. However, in our case, it is very likely that the remaining segments in the same article contain descriptions related to the target segment. Therefore, we propose several ways to extract an additional sequence from the whole article and then combine with the target segment, to be supplied as the input for summarization. We compare the results against the original models without additional sequences. Furthermore, we propose a new model that uses two encoders to process the target segment and additional sequence separately. Our results show our two-encoder model outperforms the original models in terms of ROGUE and METEOR scores.

元の言語English
ホスト出版物のタイトルWeb and Big Data - 3rd International Joint Conference, APWeb-WAIM 2019, Proceedings
編集者Jie Shao, Man Lung Yiu, Masashi Toyoda, Dongxiang Zhang, Wei Wang, Bin Cui
出版者Springer Verlag
ページ299-313
ページ数15
ISBN(印刷物)9783030260712
DOI
出版物ステータスPublished - 2019 1 1
イベント3rd APWeb and WAIM Joint Conference on Web and Big Data, APWeb-WAIM 2019 - Chengdu, China
継続期間: 2019 8 12019 8 3

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
11641 LNCS
ISSN(印刷物)0302-9743
ISSN(電子版)1611-3349

Conference

Conference3rd APWeb and WAIM Joint Conference on Web and Big Data, APWeb-WAIM 2019
China
Chengdu
期間19/8/119/8/3

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

フィンガープリント Two-encoder pointer-generator network for summarizing segments of long articles' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Li, J., & Iwaihara, M. (2019). Two-encoder pointer-generator network for summarizing segments of long articles. : J. Shao, M. L. Yiu, M. Toyoda, D. Zhang, W. Wang, & B. Cui (版), Web and Big Data - 3rd International Joint Conference, APWeb-WAIM 2019, Proceedings (pp. 299-313). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 巻数 11641 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-030-26072-9_23