PMFNet——Pose-aware Multi-level Feature Network for Human Object Interaction Detection - 代码天地

PMFNet——Pose-aware Multi-level Feature Network for Human Object Interaction Detection

其他 2020-10-15 07:05:33 阅读次数: 0

本文提出了一种新颖的人-物体交互检测模型，在多个数据集上该方法展现出大大优于现有最佳方法的性能。在人-物体交互检测任务中，人与物体交互方式的多样性及交互场景的复杂性，相比于传统的视觉任务存在更多挑战。研究人员提出了一种多层级（multi-level）的交互关系识别策略，包括交互域、物体、人体语义三个层级。

具体来说，本文提出了一种多分枝网络结构的模型，该模型利用人体姿态信息，通过基于注意力机制动态放大（Zoom-in）交互关系相关人体语义区域以增强该区域的特征，并在此基础上对全局特征进行融合，从而进一步提高模型对于人-物体交互的细粒度检测能力与健壮性。

人物交互模型结构总览，模型的主要输入为输入图片的特征图和人物交互关系的几何信息及人体的关键点。这两大信息将由Holistic model 和Zoom-in module在多层级上对特征进行处理和融合，最后对特征进行融合并给出预测。

朴素的想法：

对于一张图片先做目标检测，得到人体和物体所在区域，然后再提取①人②物③人∩物区域的特征，进行分类

但是作者觉得这样的做法只能得到整体的一些特征，模型不容易学到一些局部特征，于是我们就使用人体关键点来作为指导，关键点所在区域当成attention mask，这样可以得到更多的局部特征。

上述即为整体流程，backbone用来提取特征，在得到特征图的基础上预测人物框，人体关键点，然后将相应的特征送到需要的模块中，做分类，即可得到结果

创新点：

使用Pose作指导，起到attention map 的作用
Pose可以起到全局和局部指导的作用。

猜你喜欢

转载自blog.csdn.net/qq_41251963/article/details/108801685

PMFNet——Pose-aware Multi-level Feature Network for Human Object Interaction Detection

论文笔记之Pose-aware Multi-level Feature Network for Human Object Interaction Detection

iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection 论文阅读笔记

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network

M2Det:A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network 论文理解

【论文笔记】M2Det: A Single-Shot Object Detector Based on Multi-Level Feature Pyramid Network

Parallel Feature Pyramid Network for Object Detection

论文笔记之Learning Human-Object Interaction Detection using Interaction Points

论文笔记之Transferable Interactiveness Knowledge for Human-Object Interaction Detection

FPN（ Feature Pyramid Network for Object Detection）论文详读

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid

论文笔记之PPDM（Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection）

论文阅读--ssFPN: Scale Sequence (S2 ) Feature Based Feature Pyramid Network for Object Detection

论文阅记 M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid

论文阅读 Multi-Scale Structure-Aware Network for Human Pose Estimation

【论文阅读笔记】Multi-Scale Structure-Aware Network for Human Pose Estimation

【论文笔记】RCM-Fusion: Radar-Camera Multi-Level Fusion for 3D Object Detection

Feature Pyramid Networks for Object Detection

【Network Architecture】Feature Pyramid Networks for Object Detection(FPN)论文解析（转）

《Residual Bi-Fusion Feature Pyramid Network for Accurate Single-shot Object Detection》论文笔记

AFPN: Asymptotic Feature Pyramid Network for Object Detection-全新特征融合模块AFPN，完胜PAFPN

【目标检测论文阅读笔记】Attentional feature pyramid network for small object detection(2022)

【目标检测论文阅读笔记】Extended Feature Pyramid Network for Small Object Detection

阅读笔记《Changer: Feature Interaction is What You Need for Change Detection》

Fully Motion-Aware Network for Video Object Detection

Attentive Feedback Network for Boundary-Aware Salient Object Detection

人物交互（human object interaction）论文汇总-2020年

人物交互（human object interaction）论文汇总-2019年

人物交互（human object interaction）论文汇总-2018年

[论文阅读笔记27]Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object Tracking

今日推荐

探索 api.maynor1024.live：一站式 AI 服务平台

AI一键去衣技术：窥见深度学习在图像处理领域的革命(最后有彩蛋)

艾体宝案例 | 使用Redis和Spring Ai构建rag应用程序

Apple M1 vs 高通8Gen2 vs Apple A12Z各方面比较

【升职加薪必备架构图】Springboot学习路线汇总_springboot四层架构流程图

与Apollo共创生态：Apollo7周年大会自动驾驶生态利剑出鞘

Spring Boot 3.0：未来企业应用开发的基石

Java 的 AI 前景光明

国内首个智能体生态大会！2024百度万象大会定档5月30日

开源一周年，青语言新版发布

深入浅出：大型语言模型（LLM）的全面解读

顶会ICLR2024论文Time-LLM：基于大语言模型的时间序列预测

周排行

学习笔记(01):Python入门教程-计算机如何区分数字和字符

命令行提示符_颜色

五步轻松搞定Linux下的文件同步(备份)

Visio 2010，如何打开多个窗口

西安新起点|MBA考研十大热门城市

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

【蓝桥杯】ADV-73 数组输出

[DeeplearningAI笔记]卷积神经网络4.11一维和三维卷积

Java 逻辑运算符

Python爬虫入门——2. 5 利用正则表达式爬取豆瓣电影 Top 250

每日归档

更多

2024-06-01(60)

2024-05-31(47)

2024-05-30(4)

2024-05-29(65)

2024-05-28(2)

2024-05-27(56)

2024-05-26(6)

2024-05-25(68)

2024-05-24(65)

2024-05-23(9)