Selecting Article Segment Titles Based on Keyphrase Features and Semantic Relatedness

Yuming Guo, Mizuho Iwaihara

研究成果: Conference contribution

抄録

Nowadays people can find almost all kinds of information they want from the Internet. However, in most cases, users are not willing to find their target among segment among long paragraphs, by spending much time browsing texts. Existing work on topic labeling works effectively and performs well on document categorization, but inadequate for granularity of detailed contents. Thus we propose a method for selecting titles for segments in long documents. We analyze the characteristics of high quality titles for article segments, from the aspect of semantic relatedness between the target segment and related articles as well as other segments. Then we revise three features proposed before. We improve the phraseness feature, for giving appropriate scores for long titles. Meanwhile, we combine the features SimPF and Embedding-vector to enhance the efficiency and rationality. We use Wikipedia articles for experimental evaluations, in which a large number of article segments are titled manually, and a great number of articles lack detailed segment titles. We evaluate scoring functions by where hidden original segment titles are ranked, through precision@K. Through rigorous evaluations, we show an optimum combination of the features.

本文言語English
ホスト出版物のタイトルProceedings - 2018 7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018
出版社Institute of Electrical and Electronics Engineers Inc.
ページ129-132
ページ数4
ISBN(電子版)9781538674475
DOI
出版ステータスPublished - 2019 4月 16
イベント7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018 - Yonago, Japan
継続期間: 2018 7月 82018 7月 13

出版物シリーズ

名前Proceedings - 2018 7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018

Conference

Conference7th International Congress on Advanced Applied Informatics, IIAI-AAI 2018
国/地域Japan
CityYonago
Period18/7/818/7/13

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信
  • 通信
  • 情報システム
  • 情報システムおよび情報管理
  • 教育

フィンガープリント

「Selecting Article Segment Titles Based on Keyphrase Features and Semantic Relatedness」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル