Geometric Features Informed Multi-person Human-Object Interaction Recognition in Videos

Tanqiu Qiao, Qianhui Men, Frederick W.B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P.H. Shum*

*この研究の対応する著者

研究成果: Conference contribution

抄録

Human-Object Interaction (HOI) recognition in videos is important for analyzing human activity. Most existing work focusing on visual features usually suffer from occlusion in the real-world scenarios. Such a problem will be further complicated when multiple people and objects are involved in HOIs. Consider that geometric features such as human pose and object position provide meaningful information to understand HOIs, we argue to combine the benefits of both visual and geometric features in HOI recognition, and propose a novel Two-level Geometric feature-informed Graph Convolutional Network (2G-GCN ). The geometric-level graph models the interdependency between geometric features of humans and objects, while the fusion-level graph further fuses them with visual features of humans and objects. To demonstrate the novelty and effectiveness of our method in challenging scenarios, we propose a new multi-person HOI dataset (MPHOI-72 ). Extensive experiments on MPHOI-72 (multi-person HOI), CAD-120 (single-human HOI) and Bimanual Actions (two-hand HOI) datasets demonstrate our superior performance compared to state-of-the-arts.

本文言語English
ホスト出版物のタイトルComputer Vision – ECCV 2022 - 17th European Conference, 2022, Proceedings
編集者Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
出版社Springer Science and Business Media Deutschland GmbH
ページ474-491
ページ数18
ISBN(印刷版)9783031197710
DOI
出版ステータスPublished - 2022
イベント17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, Israel
継続期間: 2022 10月 232022 10月 27

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13664 LNCS
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

Conference

Conference17th European Conference on Computer Vision, ECCV 2022
国/地域Israel
CityTel Aviv
Period22/10/2322/10/27

ASJC Scopus subject areas

  • 理論的コンピュータサイエンス
  • コンピュータ サイエンス(全般)

フィンガープリント

「Geometric Features Informed Multi-person Human-Object Interaction Recognition in Videos」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル