Relational Context Learning for Human-Object Interaction Detection|电子爱好者

admin管理员组
文章数量:1589768

Relational Context Learning for Human-Object Interaction Detection

Abstract
Method
- The overall architecture of MUREN
- Multiplex Relation Embedding Module (MURE)
- Attentive Fusion
Comment

Paper Link
Code Link

Abstract

Most one-stage methods for HOI detection typically build on transformer architectures with two decoder branches, one for human-object pair detection and the other for interaction classification. Such disentangled transformers, however, may suffer from insufficient context exchange between the branches and lead to a lack of context information for relational reasoning. This work propose the multiplex relation network (MUREN) that performs rich context exchange between three decoder branches using unary, pairwise, and ternary relations of human, object, and interaction tokens.

Method

Existing transformer-based methods for HOI detection can be roughly divided into two types: single-branch and two-branch.

The single-branch methods update a token set through a single transformer decoder and detect HOI instances using the subsequent FFNs directly. As a single transformer decoder is responsible for all sub-tasks (i.e.,human detection, object detection, and interaction classification), they are limited in adapting to the different subtasks with multi-task learning, simultaneously.
The two-branch methods adopt two separated transformer decoder branches where
one detects human-object pairs from a human-object token set while the other classifies interaction classes between human-object pairs from an interaction token set. However, the insufficient context exchange between the branches prevents the two-branch methods from learning relational contexts.

The overall architecture of MUREN

The MUtiplex RElation Network (MUREN) adopts three decoder branches which are responsible for three sub-tasks: human detection, object detection, and interaction classification:

First, the input image is fed into the CNN backbone followed by the transformer encoder to extract the image tokens.
A transformer decoder layer in each branch layer extracts the task-specific tokens for predicting the sub-task.
The MURE takes the task-specific tokens as input and generates the multiplex relation context for relational reasoning.
The attentive fusion module propagates the multiplex relation context to each sub-task for context exchange.
The outputs at the last layer of each branch are fed into to predict the HOI instances.

Multiplex Relation Embedding Module (MURE)

Since the task-specific tokens are generated from the separated branches, the tokens suffer from a lack of relational context information. To mitigate this issue, the multiplex relation embedding module (MURE) generates multiplex relation context for relational reasoning. The multiplex relation context contains the unary, pairwise, and ternary relation contexts to exploit useful information in each relation context.

MURE takes i-th task-specific tokens and the image tokens as input, and embed the unary [ f i H ; f i O ; f i I ] [f_i^H; f_i^O;f_i^I] [fiH;fiO;fiI] and pairwise ( [ f i H O ; f i H I ; f i O I ] [f_i^{HO}; f_i^{HI};f_i^{OI}] [fiHO;fiHI;fiOI]) relation contexts into the ternary relation context. The multiplex relation context, the output of MURE, is fed into subsequent attentive fusion module for context exchange.

Attentive Fusion

TODO

Comment

The context exchange between the branches is important for the effectiveness of MUREN.
The target of MUREN is limited on single human, even in the multi-person scenarios.

本文标签： Learning Context Relational Human Detection

版权声明：本文标题：Relational Context Learning for Human-Object Interaction Detection 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://www.elefans.com/xitong/1728075599a1144430.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

Relational Context Learning for Human-Object Interaction Detection

Relational Context Learning for Human-Object Interaction Detection

Abstract

Method

The overall architecture of MUREN

Multiplex Relation Embedding Module (MURE)

Attentive Fusion

Comment

更多相关文章

Deep Learning for Visual Tracking: A Comprehensive Survey(单目标跟踪目前最好的综述类文章)

多智能体强化学习经典综述A Comprehensive Survey of Multi-Agent Reinforcement Learning翻译

【论文阅读】Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

The E-Learning Handbook: A Comprehensive Guide to Online Learning

Hands-on Machine Learning with Scikit-Learn, keras, and Tensorflow（机器学习系统的类型）学习笔记（一）

警告: Exception encountered during context initialization

警告: Exception encountered during context initialization - cancelling refresh attempt

其中之一原因Exception encountered during context initialization - cancelling refresh attempt: org

Improvements in Deep Q Learning: Dueling Double DQN, Prioritized Experience Replay, and fixed…

[论文详读2]Intelligent Manufacturing in the Context of Industry 4.0: A Review

《Quantization for Sustainable Reinforcement Learning》

Sound Event Detection: A Tutorial 学习笔记

Kaggle | 金融交易欺诈检测(Synthetic Financial Datasets For Fraud Detection)

论文阅读：Early depression detection in social media based on deep learning and underlying emotions

【虚拟人综述论文】Human-Computer Interaction System: A Survey of Talking-Head Generation

【DeepInteraction复现】DeepInteraction: 3D Object Detection via Modality Interaction

人物交互（human object interaction）论文汇总-2019年

HOTR: End-to-End Human-Object Interaction Detection with Transformers

阅读笔记《Learning Attentive Pairwise Interaction for Fine-Grained Classification》

【论文阅读】AutoInt: Automatic Feature Interaction Learning via Self-Attentive Neural Networks(CIKM,19)

发表评论

推荐文章

网站推广方法

重装系统中遇到的问题：（1）请检查你的介质驱动器,错误代码0x80300024（2）选中的磁盘采用gpt分区形式

研华工控机linux改win7,windows7上不了网研华工控机怎么设置u盘启动_研华工控机U盘引导方法...

win10系统设置默认登录用户

计算机启动硬盘响,电脑开机时硬盘响个不停的原因及解决方法

热门文章

在线html转txt文件,html网页转txt文件、文本转换器

Linux常用命令----cp 命令

Linux shell常用命令

Mac系统改坏.bash_profile文件解决方案

cad打开卡死_cad一点打开文件就卡死么办_cad打开某个文件很卡解决方法-win7之家...

python 并发编程之多线程

win10忘记开机密码怎么办？

手把手带了快速了解CleanMyMac X4.15.6 中文破解版安装激活图文教程

怎么用U盘制作原版系统启动盘

了解Nearby Interaction

最新文章

C++备忘录070：benchmark 说传参时引用是好的

Linux系统管理实践(11)：网络诊断的基本技巧

训练自己业务的行业垂类大模型-生成式模型：从0到1复现ChatGLM的p-tuning和lora 微调

Spring源码分析之一BeanFactory相关

NginX issues HTTP 499 error after 60 seconds despite config. (PHP and AWS)

该文件未上传到服务器是怎么回事,WinSCP错误“没有这样的文件”，当上传文件到服务器...

java日常错误总结

SpringCloud整合spring security+ oauth2+Redis实现认证授权

网络安全之NMAP工具

解决依赖循环：the dependency cycle between beans could not be broken

gitlab-runner 进行 npm run build 一直失败

Spring5源码浅析(八)—FactoryBeanRegistrySupport

全新版大学英语综合教程第二册学习笔记（原文及全文翻译）——6A - I‘M Going To Buy The Brooklyn Bridge（我要买下布鲁克林桥）

peftbitsandbytes包windows安装问题

调用transformers及bitsandbytes库CUDA报错

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载