CVPR2019 | 论文之行为/动作识别、手势识别、时序动作检测及视频相关

CVPR2019 | 论文之行为/动作识别、手势识别、时序动作检测及视频相关

行为/动作识别、手势识别

1、An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition
中文:《一种用于骨架动作识别的注意增强型图卷积LSTM网络》
作者:Chenyang Si, Wentao Chen, Wei Wang, Liang Wang, Tieniu Tan
论文链接:https://arxiv.org/abs/1902.09130

2、Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition with Multimodal Training
中文:多模态训练提高单模态动态手势识别性能
作者:Mahdi Abavisani, Hamid Reza Vaezi Joze, Vishal M. Patel
链接:https://arxiv.org/abs/1812.06145

3、Collaborative Spatio-temporal Feature Learning for Video Action Recognition
中文:协同时空特征学习在视频动作识别中的应用
作者:Chao Li, Qiaoyong Zhong, Di Xie, Shiliang Pu
论文链接:https://arxiv.org/abs/1903.01197

4、Peeking into the Future: Predicting Future Person Activities and Locations in Videos(行为预测)
中文:窥视未来:在视频中预测未来人的活动和位置
作者:Junwei Liang, Lu Jiang, Juan Carlos Niebles, Alexander Hauptmann, Li Fei-Fei
论文链接:https://arxiv.org/abs/1902.03748

5、Neural Scene Decomposition for Multi-Person Motion Capture
中文:多人运动捕捉的神经场景分解
作者:Helge Rhodin, Victor Constantin, Isinsu Katircioglu, Mathieu Salzmann, Pascal Fua
论文链接:https://arxiv.org/abs/1903.05684

6、Action Recognition from Single Timestamp Supervision in Untrimmed Videos(动作识别)
中文:基于单时间戳监督的未剪辑视频动作识别
作者:Davide Moltisanti, Sanja Fidler, Dima Damen
论文链接:https://arxiv.org/abs/1904.04689

7、Pushing the Envelope for RGB-based Dense 3D Hand Pose Estimation via Neural Rendering
中文:基于RGB的神经绘制密集三维手部姿态估计
作者:Seungryul Baek, Kwang In Kim, Tae-Kyun Kim
论文链接:https://arxiv.org/abs/1904.04196

8、Relational Action Forecasting(oral)
中文:关系动作预测
作者:Chen Sun, Abhinav Shrivastava, Carl Vondrick, Rahul Sukthankar, Kevin Murphy, Cordelia Schmid
论文链接:https://arxiv.org/abs/1904.04231

9、H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions(Oral)
中文:H+O:统一的以自我为中心的三维手势和交互识别
作者:Bugra Tekin, Federica Bogo, Marc Pollefeys
论文链接:https://arxiv.org/abs/1904.05349

10、Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition
中文:广义零射击动作识别的分布外检测
作者:Devraj Mandal, Sanath Narayan, Saikumar Dwivedi, Vikram Gupta, Shuaib Ahmed, Fahad Shahbaz Khan, Ling Shao
论文链接:https://arxiv.org/abs/1904.08703

11、Actional-Structural Graph Convolutional Networks for Skeleton-based Action Recognition
中文:基于骨架动作识别的动作结构图卷积网络
作者:Maosen Li, Siheng Chen, Xu Chen, Ya Zhang, Yanfeng Wang, and Qi Tian
论文链接:https://arxiv.org/pdf/1904.12659

12、A neural network based on SPD manifold learning for skeleton-based hand gesture recognition
中文:基于SPD流形学习的神经网络在基于骨架的手势识别中的应用
作者:Xuan Son Nguyen, Luc Brun, Olivier Lézoray, Sébastien Bougleux
论文链接:https://arxiv.org/abs/1904.12970

13、DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition(Facebook)
中文:DMC-Net:一种用于快速压缩视频动作识别的区分性运动线索生成技术
作者:Zheng Shou, Xudong Lin, Yannis Kalantidis, Laura Sevilla-Lara, Marcus Rohrbach, Shih-Fu Chang, Zhicheng Yan
论文链接:https://arxiv.org/abs/1901.03460




时序动作检测及视频相关

1、Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning
中文:时空动态和语义属性丰富的视频字幕视觉编码
作者:Nayyer Aafaq, Naveed Akhtar, Wei Liu, Syed Zulqarnain Gilani, Ajmal Mian
论文链接:https://arxiv.org/abs/1902.10322
来源:https://mp.weixin.qq.com/s/61C-k3Ijy_7ry5B5lRML6Q

2、Single-frame Regularization for Temporally Stable CNNs(视频处理)
中文:时间稳定CNNs的单帧正则化方法
作者:Gabriel Eilertsen, Rafał K. Mantiuk, Jonas Unger
论文链接:https://arxiv.org/abs/1902.10424
来源:https://mp.weixin.qq.com/s/61C-k3Ijy_7ry5B5lRML6Q

3、Neural RGB-D Sensing: Depth estimation from a video Camera
中文:神经RGB-D传感:摄像机深度估计
作者:Chao Liu, Jinwei Gu, Kihwan Kim, Srinivasa Narasimhan, Jan Kautz
论文链接:https://arxiv.org/abs/1901.02571
project链接:https://research.nvidia.com/publication/2019-06_Neural-RGBD

4、Competitive Collaboration: Joint Unsupervised Learning of Depth, CameraMotion, Optical Flow and Motion Segmentation
中文:竞争合作:深度、摄像运动、光流和运动分割的联合无监督学习
作者:Anurag Ranjan, Varun Jampani, Kihwan Kim, Deqing Sun, Jonas Wulff, Michael J. Black
论文链接:https://arxiv.org/abs/1805.09806

5、Representation Flow for Action Recognition
中文:动作识别的表示流
作者:AJ Piergiovanni, Michael S. Ryoo
论文链接:https://arxiv.org/abs/1810.01455
项目链接:https://piergiaj.github.io/rep-flow-site/
代码链接:https://github.com/piergiaj/representation-flow-cvpr19

6、Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos
中文:视频异常检测中骨架轨迹的学习规律
作者:Romero Morais, Vuong Le, Truyen Tran, Budhaditya Saha, Moussa Mansour, Svetha Venkatesh
论文链接:https://arxiv.org/abs/1903.03295

7、Video Generation from Single Semantic Label Map
中文:从单个语义标签图生成视频
作者:Junting Pan, Chengyu Wang, Xu Jia, Jing Shao, Lu Sheng, Junjie Yan, Xiaogang Wang
论文链接:https://arxiv.org/abs/1903.04480
源码链接:https://github.com/junting/seg2vid/tree/master

8、Inserting Videos into Videos
中文:将视频插入视频
作者:Donghoon Lee, Tomas Pfister, Ming-Hsuan Yang
论文链接:https://arxiv.org/abs/1903.06571

9、Recurrent Back-Projection Network for Video Super-Resolution
中文:用于视频超分辨率的循环反投影网络
作者:Muhammad Haris, Greg Shakhnarovich, Norimichi Ukita
论文链接:https://arxiv.org/abs/1903.10128
代码链接:https://github.com/alterzero/RBPN-PyTorch
项目链接:https://alterzero.github.io/projects/RBPN.html

10、Depth-Aware Video Frame Interpolation
中文:
作者:Wenbo Bao Wei-Sheng Lai, Chao Ma, Xiaoyun Zhang, Zhiyong Gao, and Ming-Hsuan Yang
论文链接:https://sites.google.com/view/wenbobao/dain
     https://arxiv.org/abs/1904.00830
代码链接:https://github.com/baowenbo/DAIN

11、Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
中文:使用门控时空能量图的视频关系推理
作者:Yao-Hung Hubert Tsai, Santosh Divvala, Louis-Philippe Morency, Ruslan Salakhutdinov, Ali Farhadi
论文链接:https://arxiv.org/abs/1903.10547

12、Dual Encoding for Zero-Example Video Retrieval
中文:双重编码实现零样本视频检索
作者:Jianfeng Dong, Xirong Li, Chaoxi Xu, Shouling Ji, Yuan He, Gang Yang and Xun Wang
论文链接:https://arxiv.org/abs/1809.06181
代码链接:https://github.com/danieljf24/dual_encoding

13、Rethinking the Evaluation of Video Summaries
中文:重新思考视频摘要的评估
作者:Jacques Manderscheid, Amos Sironi, Nicolas Bourdis, Davide Migliore, Vincent Lepetit
论文链接:https://arxiv.org/abs/1903.11328

14、End-to-End Time-Lapse Video Synthesis from a Single Outdoor Image
中文:从单个室外图像进行端到端延时视频合成
作者:Seonghyeon Nam, Chongyang Ma, Menglei Chai, William Brendel, Ning Xu, Seon Joo Kim
论文链接:https://arxiv.org/abs/1904.00680

15、GolfDB: A Video Database for Golf Swing Sequencing
中文:GolfDB:用于高尔夫挥杆定序的视频数据库
作者:William McNally, Kanav Vats, Tyler Pinto, Chris Dulhanty, John McPhee, Alexander Wong
论文链接:https://arxiv.org/abs/1903.06528v1

16、VORNet: Spatio-temporally Consistent Video Inpainting for Object Removal
中文:VORNet:时空一致的视频修补,用于对象移除
作者:Ya-Liang Chang, Zhe Yu Liu, Winston Hsu
论文链接:https://arxiv.org/abs/1904.06726

17、STEP: Spatio-Temporal Progressive Learning for Video Action Detection(Oral)
中文:步骤:时空渐进学习,用于视频动作检测
作者:Xitong Yang, Xiaodong Yang, Ming-Yu Liu, Fanyi Xiao, Larry Davis, Jan Kautz
论文链接:https://arxiv.org/abs/1904.09288

18、UnOS: Unified Unsupervised Optical-flow and Stereo-depth Estimation by Watching Videos
中文:UnOS:通过观看视频实现统一的无监督光流和立体深度估计
作者:Yang Wang, Peng Wang, Zhenheng Yang, Chenxu Luo, Yi Yang, and Wei Xu
论文链接:https://arxiv.org/abs/1810.03654

19、Memory-Attended Recurrent Network for Video Captioning
中文:用于视频字幕的内存专用循环网络
作者:Wenjie Pei, Jiyuan Zhang, Xiangrong Wang, Lei Ke, Xiaoyong Shen, Yu-Wing Tai
论文链接:https://arxiv.org/abs/1905.03966

猜你喜欢

转载自blog.csdn.net/leiduifan6944/article/details/109624879