Grounding in social media: An approach to building a chit-chat dialogue model

Ritvik Choudhary, Daisuke Kawahara

研究成果: Conference contribution

抄録

Building open-domain dialogue systems capable of rich human-like conversational ability is one of the fundamental challenges in language generation. However, even with recent advancements in the field, existing open-domain generative models fail to capture and utilize external knowledge, leading to repetitive or generic responses to unseen utterances. Current work on knowledge-grounded dialogue generation primarily focuses on persona incorporation or searching a fact-based structured knowledge source such as Wikipedia. Our method takes a broader and simpler approach, which aims to improve the raw conversation ability of the system by mimicking the human response behavior through casual interactions found on social media. Utilizing a joint retriever-generator setup, the model queries a large set of filtered comment data from Reddit to act as additional context for the seq2seq generator. Automatic and human evaluations on open-domain dialogue datasets demonstrate the effectiveness of our approach.

本文言語English
ホスト出版物のタイトルNAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics
ホスト出版物のサブタイトルHuman Language Technologies, Proceedings of the Student Research Workshop
出版社Association for Computational Linguistics (ACL)
ページ9-15
ページ数7
ISBN(電子版)9781955917735
出版ステータスPublished - 2022
イベントNAACL 2022 Student Research Workshop, SRW 2022, at 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022 - Seattle, United States
継続期間: 2022 7月 102022 7月 15

出版物シリーズ

名前NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Student Research Workshop

Conference

ConferenceNAACL 2022 Student Research Workshop, SRW 2022, at 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022
国/地域United States
CitySeattle
Period22/7/1022/7/15

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信
  • ハードウェアとアーキテクチャ
  • 情報システム
  • ソフトウェア

フィンガープリント

「Grounding in social media: An approach to building a chit-chat dialogue model」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル