CVPR学习(三):CVPR2019-各个方向
一、各个方向
视频人体骨架跟踪
【1】Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos
论文地址:https://arxiv.org/abs/1903.03295
野外人群计数
【2】Learning from Synthetic Data for Crowd Counting in the Wild
论文地址:https://arxiv.org/abs/1903.03303
场景图生成
【3】Knowledge-Embedded Routing Network for Scene Graph Generation
论文地址:https://arxiv.org/abs/1903.03326
图像检索、语义绑定
【4】Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval(Anjan Dutta, Zeynep Akata)
论文地址:https://arxiv.org/abs/1903.03372
结构化知识精馏用于语义分割
【5】Structured Knowledge Distillation for Semantic Segmentation
https://arxiv.org/pdf/1903.04197.pdf
用于自适应目标检测的强-弱分布对齐
【6】Strong-Weak Distribution Alignment for Adaptive Object Detection(Kuniaki Saito1、Yoshitaka Ushiku2、Tatsuya Harada2,3、Kate Saenko1,波士顿大、学东京大学)
论文地址:https://arxiv.org/pdf/1812.04798.pdf
一种用于细粒度和层次形状分割的递归零件分解网络
【7】PartNet: A Recursive Part Decomposition Network for Fine-grained and Hierarchical Shape Segmentation(Fenggen Yu、Kun Liu1、Yan Zhang1、Chenyang Zhu、Kai Xu,南京大学、国防科技大学)
论文地址:https://arxiv.org/pdf/1903.00709.pdf
理解和可视化深层视觉显著性模型
【9】Understanding and Visualizing Deep Visual Saliency Models(Sen He、Hamed R. Tavakoli、Ali Borji、Yang Mi、Nicolas Pugeault,埃克塞特大学、阿尔托大学)
论文地址:https://arxiv.org/pdf/1903.02501.pdf
深度完成的深度系数
【9】Depth Coefficients for Depth Completion(Saif Imran、Yunfei Long、Xiaoming Liu、Daniel Morris,密歇根州立大学)
论文地址:https://arxiv.org/pdf/1903.05421.pdf
用于视频对象分割的端到端递归网络
【10】RVOS: End-to-End Recurrent Network for Video Object Segmentation
论文地址:https://arxiv.org/pdf/1903.05612.pdf
模式为不同的图像合成寻找生成的对抗性网络
【11】Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis(北京大学、加利福尼亚大学)
论文地址:https://arxiv.org/pdf/1903.05628.pdf
通过重新描述学习文本到图像的生成
【12】MirrorGAN: Learning Text-to-image Generation by Redescription
论文地址:https://arxiv.org/pdf/1903.05854.pdf
基于深度迁移学习的多类新颖性检测
【13】Deep Transfer Learning for Multiple Class Novelty Detection
论文地址:https://arxiv.org/abs/1903.02196
通过自动编码转换而不是数据进行无监督表示学习
【14】AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations rather than Data
https://arxiv.org/pdf/1901.04596.pdf
一种用于人群理解的注意注入可变形卷积网络
【15】ADCrowdNet: An Attention-injective Deformable Convolutional Network for Crowd Understanding
https://arxiv.org/pdf/1811.11968.pdf
快速在线对象跟踪和分割
【16】Fast Online Object Tracking and Segmentation: A Unifying Approach
开源:https://github.com/foolwood/SiamMask
双编码的零示例视频检索
【17】Dual Encoding for Zero-Example Video Retrieval
论文地址:https://arxiv.org/abs/1809.06181
开源地址:https://github.com/danieljf24/dual_encoding
几何原语对三维点云的监督拟合
【18】Supervised Fitting of Geometric Primitives to 3D Point Clouds
https://arxiv.org/abs/1811.08988
从视频学习三维人体动力学
【19】Learning 3D Human Dynamics from Video
https://arxiv.org/abs/1812.01601
对场景图进行可解释和显式的视觉推理
【20】Explainable and Explicit Visual Reasoning over Scene Graphs
https://arxiv.org/abs/1812.01855
学习视差注意对立体图像的超分辨率
【21】Learning Parallax Attention for Stereo Image Super-Resolution
https://arxiv.org/abs/1903.05784
AdaGraph:通过图形统一预测和连续域适应
【22】AdaGraph: Unifying Predictive and Continuous Domain Adaptation through Graphs
https://arxiv.org/abs/1903.07062
QATM:深度学习的质量感知模板匹配
【23】QATM: Quality-Aware Template Matching For Deep Learning
https://arxiv.org/abs/1903.07254
Graph Convolutional Label Noise Cleaner:训练一个即插即用的动作分类器来检测异常
【24】Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
https://arxiv.org/abs/1903.07256
自动校准深光度立体网络
【25】Self-calibrating Deep Photometric Stereo Networks(oral)
https://arxiv.org/abs/1903.07366
基于cnn的绝对相机位姿回归的局限性
【26】Understanding the Limitations of CNN-based Absolute Camera Pose Regression
https://arxiv.org/abs/1903.07504
从时间的循环一致性中学习对应关系
【27】Learning Correspondence from the Cycle-Consistency of Time
https://arxiv.org/abs/1903.07593
http://ajabri.github.io/timecycle
视觉深度估计的伪激光雷达:填补了自动驾驶三维目标检测的空白
【28】Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving
https://arxiv.org/abs/1812.07179
单视图人体性能捕获与布仿真
【29】SimulCap : Single-View Human Performance Capture with Cloth Simulation
https://arxiv.org/abs/1903.06323
神经序列短语
【30】Neural Sequential Phrase Grounding (SeqGROUND)
https://arxiv.org/abs/1903.07669
【31】Direct Object Recognition Without Line-of-Sight Using Optical Coherence
https://arxiv.org/abs/1903.07705
【32】SceneCode: Monocular Dense Semantic Reconstruction using Learned Encoded Scene Representations
https://arxiv.org/abs/1903.06482
【33】Probabilistic End-to-end Noise Correction for Learning with Noisy Labels
https://arxiv.org/abs/1903.07788
【34】Semantic Image Synthesis with Spatially-Adaptive Normalization(oral)
https://arxiv.org/abs/1903.07291
【35】Inverse Path Tracing for Joint Material and Lighting Estimation(oral)
https://arxiv.org/abs/1903.07145
【36】Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis
https://arxiv.org/abs/1903.05628
https://github.com/HelenMao/MSGAN
【37】Selective Kernel Networks
https://arxiv.org/abs/1903.06586
【38】A Cross-Season Correspondence Dataset for Robust Semantic Segmentation
https://arxiv.org/abs/1903.06916
【39】Unsupervised Part-Based Disentangling of Object Shape and Appearance
https://arxiv.org/abs/1903.06946
【40】Inserting Videos into Videos
https://arxiv.org/abs/1903.06571
【41】Disentangling Latent Space for VAE by Label Relevant/Irrelevant Dimensions
https://arxiv.org/abs/1812.09502
【42】Domain Generalization by Solving Jigsaw Puzzles
https://arxiv.org/abs/1903.06864
【43】Fast Interactive Object Annotation with Curve-GCN
https://arxiv.org/abs/1903.06874
【44】MFAS: Multimodal Fusion Architecture Search
https://arxiv.org/abs/1903.06496
【45】OCGAN: One-class Novelty Detection Using GANs with Constrained Latent Representations
https://arxiv.org/abs/1903.08550
【46】An Efficient Schmidt-EKF for 3D Visual-Inertial SLAM
https://arxiv.org/abs/1903.08636
【47】Photometric Mesh Optimization for Video-Aligned 3D Object Reconstruction
https://arxiv.org/abs/1903.08642
code: https://chenhsuanlin.bitbucket.io/photometric-mesh-optim/
【48】Towards Robust Curve Text Detection with Conditional Spatial Expansion
https://arxiv.org/abs/1903.08836
【49】Learning with Batch-wise Optimal Transport Loss for 3D Shape Recognition
https://arxiv.org/abs/1903.08923
【50】Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation
https://arxiv.org/pdf/1903.08839.pdf
【51】Patch-based Progressive 3D Point Set Upsampling
https://arxiv.org/abs/1811.11286
【52】Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos(Romero Morais; Vuong Le; Truyen Tran; Budhaditya Saha; Moussa Mansour; Svetha Venkatesh )
论文地址:https://arxiv.org/abs/1903.03295
【53】Learning from Synthetic Data for Crowd Counting in the Wild(Qi Wang, Junyu Gao, Wei Lin, Yuan Yuan)
论文地址:https://arxiv.org/abs/1903.03303
【54】Knowledge-Embedded Routing Network for Scene Graph Generation(Tianshui Chen, Weihao Yu, Riquan Chen, Liang Lin)
论文地址:https://arxiv.org/abs/1903.03326
【55】Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval(Anjan Dutta, Zeynep Akata)
论文地址:https://arxiv.org/abs/1903.03372
【56】Generalized Zero- and Few-Shot Learning via Aligned Variational Autoencoders
作者:Edgar Schönfeld, Sayna Ebrahimi, Samarth Sinha, Trevor Darrell, Zeynep Akata
论文链接:https://arxiv.org/abs/1812.01784
源码链接:https://github.com/edgarschnfld/CADA-VAE-PyTorch
【57】PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
作者:Shaoshuai Shi, Xiaogang Wang, Hongsheng Li
论文链接:https://arxiv.org/abs/1812.04244
源码链接:https://github.com/sshaoshuai/PointRCNN
【58】FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation from a Single Image
作者:Tsun-Yi Yang, Yi-Ting Chen, Yen-Yu Lin, and Yung-Yu Chuang
论文链接:https://github.com/shamangary/FSA-Net/blob/master/0191.pdf
源码链接:https://github.com/shamangary/FSA-Net
【59】Learning Attraction Field Representation for Robust Line Segment Detection
作者:Nan Xue, Song Bai, Fudong Wang, Gui-Song Xia, Tianfu Wu, Liangpei Zhang
论文链接:https://arxiv.org/abs/1812.02122
源码链接:https://github.com/cherubicXN/afm_cvpr2019
【60】DFANet:Deep Feature Aggregation for Real-Time Semantic Segmentation(旷视)
作者:Hanchao Li, Pengfei Xiong,Haoqiang Fan,Jian Sun
论文链接:https://share.weiyun.com/5NgHbWH
【61】Live Reconstruction of Large-Scale Dynamic Outdoor Worlds
作者:Ondrej Miksik, Vibhav Vineet
论文链接:https://arxiv.org/abs/1903.06708
【62】Automatic adaptation of object detectors to new domains using self-training
作者:Aruni RoyChowdhury, Prithvijit Chakrabarty, Ashish Singh, SouYoung Jin, Huaizu Jiang, Liangliang Cao, Erik Learned-Miller
论文链接:https://arxiv.org/abs/1904.07305
【63】A Realistic Dataset and Baseline Temporal Model for Early Drowsiness Detection
作者:Reza Ghoddoosian, Marnim Galib, Vassilis Athitsos
论文链接:https://arxiv.org/abs/1904.07312
【64】Exploiting Computation Power of Blockchain for Biomedical Image Segmentation
作者:Boyang Li, Changhao Chenli, Xiaowei Xu, Taeho Jung, Yiyu Shi
论文链接:https://arxiv.org/abs/1904.07349
【65】NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection(目标检测)
作者:Golnaz Ghiasi, Tsung-Yi Lin, Ruoming Pang, Quoc V. Le
论文链接:https://arxiv.org/abs/1904.07392
【66】A Bayesian Perspective on the Deep Image Prior
作者:Zezhou Cheng, Matheus Gadelha, Subhransu Maji, Daniel Sheldon
论文链接:https://arxiv.org/abs/1904.07457
源码链接:https://github.com/ZezhouCheng/GP-DIP
【67】Fashion-AttGAN: Attribute-Aware Fashion Editing with Multi-Objective GAN
作者:Qing Ping, Jiangbo Yuan, Bing Wu, Wanying Ding
论文链接:https://arxiv.org/abs/1904.07460
【68】Focus Is All You Need: Loss Functions For Event-based Vision
作者:Guillermo Gallego, Mathias Gehrig, Davide Scaramuzza
论文链接:https://arxiv.org/abs/1904.07235
【69】Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
作者:Yanhong Zeng, Jianlong Fu, Hongyang Chao, Baining Guo
论文链接:https://arxiv.org/abs/1904.07475
【70】Relation-Shape Convolutional Neural Network for Point Cloud Analysis
作者:Yongcheng Liu, Bin Fan, Shiming Xiang, Chunhong Pan
论文链接:https://arxiv.org/abs/1904.07601
项目链接:https://yochengliu.github.io/Relation-Shape-CNN/
源码链接:https://github.com/Yochengliu/Relation-Shape-CNN
【71】LBVCNN: Local Binary Volume Convolutional Neural Network for Facial Expression Recognition from Image Sequences(人脸识别)
作者:Sudhakar Kumawat, Manisha Verma, Shanmuganathan Raman
论文链接:https://arxiv.org/abs/1904.07647
【72】Semantically Aligned Bias Reducing Zero Shot Learning
作者:Akanksha Paul, Narayanan C. Krishnan, Prateek Munjal
论文链接:https://arxiv.org/abs/1904.07659
【73】Camera Lens Super-Resolution
作者:Chang Chen, Zhiwei Xiong, Xinmei Tian, Zheng-Jun Zha, Feng Wu
论文链接:http://staff.ustc.edu.cn/~zwxiong/cameraSR.pdf
源码链接:https://github.com/ngchc/CameraSR
【74】GolfDB: A Video Database for Golf Swing Sequencing
作者:William McNally, Kanav Vats, Tyler Pinto, Chris Dulhanty, John McPhee, Alexander Wong
论文链接:https://arxiv.org/abs/1903.06528v1
【75】R2GAN: Cross-modal Recipe Retrieval with Generative Adversarial Network
作者:Bin Zhu, Chong-Wah Ngo, Jingjing Chen, and Yanbin Hao
论文链接:http://vireo.cs.cityu.edu.hk/papers/R2GAN.pdf
【76】Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes
作者:Chengquan Zhang, Borong Liang, Zuming Huang, Mengyi En, Junyu Han, Errui Ding, Xinghao Ding
论文链接:https://arxiv.org/abs/1904.06535
【77】GA-Net: Guided Aggregation Net for End-to-end Stereo Matching(Oral)
作者:Feihu Zhang, Victor Prisacariu, Ruigang Yang, Philip H.S. Torr
论文链接:https://arxiv.org/abs/1904.06587
【78】LiveSketch: Query Perturbations for Guided Sketch-based Visual Search
作者:John Collomosse, Tu Bui, Hailin Jin
论文链接:https://arxiv.org/abs/1904.06611
【79】Multi-Similarity Loss with General Pair Weighting for Deep Metric Learning
论文链接:https://arxiv.org/abs/1904.06627
源码链接:https://github.com/MalongTech/research-ms-loss
【80】Conditional Single-view Shape Generation for Multi-view Stereo Reconstruction
作者:Yi Wei, Shaohui Liu, Wang Zhao, Jiwen Lu, Jie Zhou
论文链接:https://arxiv.org/abs/1904.06699
【81】VORNet: Spatio-temporally Consistent Video Inpainting for Object Removal
作者:Ya-Liang Chang, Zhe Yu Liu, Winston Hsu
论文链接:https://arxiv.org/abs/1904.06726
【82】Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation(Oral)
作者:Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan
论文链接:https://arxiv.org/abs/1904.06807
源码链接:https://github.com/Ha0Tang/SelectionGAN
【83】ContactDB: Analyzing and Predicting Grasp Contact via Thermal Imaging(Oral)
作者:Samarth Brahmbhatt, Cusuh Ham, Charles C. Kemp, James Hays
论文链接:https://arxiv.org/abs/1904.06830
【84】Pedestrian Detection in Thermal Images using Saliency Maps(行人检测)
作者:Debasmita Ghose, Shasvat Mukeshkumar Desai, Sneha Bhattacharya, Deep Chakraborty, Madalina Fiterau, Tauhidur Rahman
论文链接:https://arxiv.org/abs/1904.06859
【85】Self-critical n-step Training for Image Captioning(图像生成)
作者:Junlong Gao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao
论文链接:https://arxiv.org/abs/1904.06861
【86】Gait Recognition via Disentangled Representation Learning(Oral 步态识别)
作者:Ziyuan Zhang, Luan Tran, Xi Yin, Yousef Atoum, Xiaoming Liu, Jian Wan, Nanxin Wang
论文链接:https://arxiv.org/abs/1904.04925
【87】Towards High-fidelity Nonlinear 3D Face Morphable Model
作者:Luan Tran, Feng Liu, Xiaoming Liu
论文链接:https://arxiv.org/abs/1904.04933
项目链接:http://cvlab.cse.msu.edu/project-nonlinear-3dmm.html
【88】Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations(Oral)
作者:Jiwoon Ahn, Sunghyun Cho, Suha Kwak
论文链接:https://arxiv.org/abs/1904.05044
【89】Heavy Rain Image Restoration: Integrating Physics Model and Conditional Adversarial Learning
作者:Ruotent Li, Loong Fah Cheong, Robby T. Tan
论文链接:https://arxiv.org/abs/1904.05050
【90】C3AE: Exploring the Limits of Compact Model for Age Estimation
作者:Chao Zhang, Shuaicheng Liu, Xun Xu, Ce Zhu
论文链接:https://arxiv.org/abs/1904.05059
【91】DAVANet: Stereo Deblurring with View Aggregation(Oral)
作者:Shangchen Zhou, Jiawei Zhang, Wangmeng Zuo, Haozhe Xie, Jinshan Pan, Jimmy Ren
论文链接:https://arxiv.org/abs/1904.05065
【92】Text Guided Person Image Synthesis
作者:Xingran Zhou, Siyu Huang, Bin Li, Yingming Li, Jiachen Li, Zhongfei Zhang
论文链接:https://arxiv.org/abs/1904.05118
【93】Actor-Critic Instance Segmentation
作者:Kwang In Kim, Hyung Jin Chang
论文链接:https://arxiv.org/abs/1904.05126
【94】Joint Manifold Diffusion for Combining Predictions on Decoupled Observations
作者:Kwang In Kim, Hyung Jin Chang
论文链接:https://arxiv.org/abs/1904.05159
【95】Iterative Residual Refinement for Joint Optical Flow and Occlusion Estimation
作者:Junhwa Hur, Stefan Roth
论文链接:https://arxiv.org/abs/1904.05290
【96】H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions(Oral)
作者:Bugra Tekin, Federica Bogo, Marc Pollefeys
论文链接:https://arxiv.org/abs/1904.05349
【97】Pixel-Adaptive Convolutional Neural Networks
作者:Hang Su, Varun Jampani, Deqing Sun, Orazio Gallo, Erik Learned-Miller, Jan Kautz
论文链接:https://arxiv.org/abs/1904.05373
【98】Spherical Regression: Learning Viewpoints, Surface Normals and 3D Rotations on n-Spheres
作者:Shuai Liao, Efstratios Gavves, Cees G. M. Snoek
论文链接:https://arxiv.org/abs/1904.05404
【99】Sliced Wasserstein Generative Models
作者:Jiqing Wu, Zhiwu Huang, Dinesh Acharya, Wen Li, Janine Thoma, Danda Pani Paudel, Luc Van Gool
论文链接:https://arxiv.org/abs/1904.05408
源码链接:https://github.com/musikisomorphie/swd
【100】Learning to Generate Synthetic Data via Compositing
作者:Shashank Tripathi, Siddhartha Chandra, Amit Agrawal, Ambrish Tyagi, James M. Rehg, Visesh Chari
论文链接:https://arxiv.org/abs/1904.05475
【101】Mitigating Information Leakage in Image Representations: A Maximum Entropy Approach(Oral)
作者:Proteek Chandan Roy, Vishnu Naresh Boddeti
论文链接:https://arxiv.org/abs/1904.05514
【102】Unified Visual-Semantic Embeddings: Bridging Vision and Language with Structured Meaning Representations
作者:Hao Wu, Jiayuan Mao, Yufeng Zhang, Yuning Jiang, Lei Li, Weiwei Sun, Wei-Ying Ma
论文链接:https://arxiv.org/abs/1904.05521
【103】Generating Multiple Hypotheses for 3D Human Pose Estimation with Mixture Density Network
作者:Chen Li, Gim Hee Lee
论文链接:https://arxiv.org/abs/1904.05547
【104】Reasoning Visual Dialogs with Structural and Partial Observations(Oral)
作者:Zilong Zheng, Wenguan Wang, Siyuan Qi, Song-Chun Zhu
论文链接:https://arxiv.org/abs/1904.05548
【105】C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection
作者:Fang Wan, Chang Liu, Wei Ke, Xiangyang Ji, Jianbin Jiao, Qixiang Ye
论文链接:https://arxiv.org/abs/1904.05647
【106】TAFE-Net: Task-Aware Feature Embeddings for Low Shot Learning
作者:Xin Wang, Fisher Yu, Ruth Wang, Trevor Darrell, Joseph E. Gonzalez
论文链接:https://arxiv.org/abs/1904.05967
【107】Real-Time Dense Stereo Embedded in A UAV for Road Inspection
作者:Rui Fan, Jianhao Jiao, Jie Pan, Huaiyang Huang, Shaojie Shen, Ming Liu
论文链接:https://arxiv.org/abs/1904.06017
【108】Adaptive Weighting Multi-Field-of-View CNN for Semantic Segmentation in Pathology
作者:Hiroki Tokunaga, Yuki Teramoto, Akihiko Yoshizawa, Ryoma Bise
论文链接:https://arxiv.org/abs/1904.06040
【109】Unifying Heterogeneous Classifiers with Distillation
作者:Jayakorn Vongkulbhisal, Phongtharin Vinayavekhin, Marco Visentini-Scarzanella
论文链接:https://arxiv.org/abs/1904.06062
【110】YUVMultiNet: Real-time YUV multi-task CNN for autonomous driving
作者:Thomas Boulay, Said El-Hachimi, Mani Kumar Surisetti, Pullarao Maddu, Saranya Kandan
论文链接:https://arxiv.org/abs/1904.05673
【111】A Relation-Augmented Fully Convolutional Network for Semantic Segmentationin Aerial Scenes
作者:Lichao Mou, Yuansheng Hua, Xiao Xiang Zhu
论文链接:https://arxiv.org/abs/1904.05730
【112】Learning joint reconstruction of hands and manipulated objects
作者:Yana Hasson, Gül Varol, Dimitrios Tzionas, Igor Kalevatykh, Michael J. Black, Ivan Laptev, Cordelia Schmid
论文链接:https://arxiv.org/abs/1904.05767
【113】Probabilistic Permutation Synchronization using the Riemannian Structure of the Birkhoff Polytope(Oral)
作者:Tolga Birdal, Umut Şimşekli
论文链接:https://arxiv.org/abs/1904.05814
【114】Variational Information Distillation for Knowledge Transfer
作者:Sungsoo Ahn, Shell Xu Hu, Andreas Damianou, Neil D. Lawrence, Zhenwen Dai
论文链接:https://arxiv.org/abs/1904.05835
【115】Expressive Body Capture: 3D Hands, Face, and Body from a Single Image
作者:Georgios Pavlakos, Vasileios Choutas, Nima Ghorbani, Timo Bolkart, Ahmed A. A. Osman, Dimitrios Tzionas, Michael J. Black
论文链接:https://arxiv.org/abs/1904.05866
【116】A Simple Baseline for Audio-Visual Scene-Aware Dialog
作者:Idan Schwartz, Alexander Schwing, Tamir Hazan
论文链接:https://arxiv.org/abs/1904.05876
【117】Max-Sliced Wasserstein Distance and its use for GANs
作者:Ishan Deshpande, Yuan-Ting Hu, Ruoyu Sun, Ayis Pyrros, Nasir Siddiqui, Sanmi Koyejo, Zhizhen Zhao, David Forsyth, Alexander Schwing
论文链接:https://arxiv.org/abs/1904.05877
【118】Two Body Problem: Collaborative Visual Task Completion
作者:Unnat Jain, Luca Weihs, Eric Kolve, Mohammad Rastegari, Svetlana Lazebnik, Ali Farhadi, Alexander Schwing, Aniruddha Kembhavi
论文链接:https://arxiv.org/abs/1904.05879
【119】Factor Graph Attention
作者:Idan Schwartz, Seunghak Yu, Tamir Hazan, Alexander Schwing
论文链接:https://arxiv.org/abs/1904.05880
【120】Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning
作者:Wenbin Li, Lei Wang, Jinglin Xu, Jing Huo, Yang Gao, Jiebo Luo
论文链接:http://cs.nju.edu.cn/rl/people/liwb/CVPR19.pdf
源码链接:https://github.com/WenbinLee/DN4.git
【121】Large-Scale Long-Tailed Recognition in an Open World(Oral)
作者:Ziwei Liu*, Zhongqi Miao*, Xiaohang Zhan, Jiayun Wang, Boqing Gong, Stella X. Yu
论文链接:https://github.com/ofsoundof/3D_Appearance_SR/blob/master/code/scripts/3d_appearance_sr.pdf
源码链接:
https://github.com/zhmiao/OpenLongTailRecognition-OLTR
【122】3D Appearance Super-Resolution with Deep Learning
作者:待补充
论文链接:https://github.com/ofsoundof/3D_Appearance_SR/blob/master/code/scripts/3d_appearance_sr.pdf
源码链接:https://github.com/ofsoundof/3D_Appearance_SR
【123】High-level Semantic Feature Detection: A New Perspective for Pedestrian Detection(行人检测)
作者:Zhao-Min Chen, Xiu-Shen Wei Peng Wang3Yanwen Guo1
论文链接:https://github.com/liuwei16/CSP/blob/master/docs/2019CVPR-CSP.pdf
源码链接:https://github.com/liuwei16/CSP
【124】Multi-Label Image Recognition with Graph Convolutional Networks(多标记图像识别)
作者:Zhao-Min Chen, Xiu-Shen Wei, Peng Wang, Yanwen Guo
论文链接:https://arxiv.org/abs/1904.03582
源码链接:https://github.com/chenzhaomin123/ML_GCN
简介:本工作针对多标记识别的核心问题,即“如何有效建模标记间的协同关系”进行探索,提出基于图卷积(GCN)的端到端系统,通过data-driven方式建立标记间有向图(directed graph)并由GCN将类别标记映射(mapping)为对应类别分类器,以此建模类别关系,同时可提升表示学习能力。此外针对GCN中的关键元素correlation matrix进行了深入分析和重设计,使其更胜任多标记问题。
【125】Cycle-Consistency for Robust Visual Question Answering(VQA)
作者:Gao Peng, Zhengkai Jiang, Haoxuan You, Zhengkai Jiang, Pan Lu, Steven Hoi, Xiaogang Wang, Hongsheng Li
论文链接:https://arxiv.org/pdf/1812.05252.pdf
【126】Data augmentation using learned transformsfor one-shot medical image segmentation
作者:Amy Zhao, Guha Balakrishnan, Frédo Durand, John V. Guttag, Adrian V. Dalca
论文链接:https://arxiv.org/pdf/1902.09383.pdf
源码链接:https://github.com/xamyzhao/brainstorm
【127】DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency (Oral )
作者:Kuang-Jui Hsu, Yen-Yu Lin, Yung-Yu Chuang
论文链接:http://cvlab.citi.sinica.edu.tw/images/paper/cvpr-hsu19.pdf
源码链接:https://github.com/KuangJuiHsu/DeepCO3
【128】Calibration of Asynchronous Camera Networks for Object Reconstruction Tasks
作者:Amy Tabb, Henry Medeiros
论文链接:https://arxiv.org/abs/1903.06811
【129】LP-3DCNN: Unveiling Local Phase in 3D Convolutional Neural Networks
作者:Sudhakar Kumawat, Shanmuganathan Raman
论文链接:https://arxiv.org/abs/1904.03498
【130】A Variational Auto-Encoder Model for Stochastic Point Processes
作者:Nazanin Mehrasa, Akash Abdu Jyothi, Thibaut Durand, Jiawei He, Leonid Sigal, Greg Mori
论文链接:https://arxiv.org/abs/1904.03273
【131】2.5D Visual Sound(FAIR Oral)
作者:Ruohan Gao, Kristen Grauman
论文链接:https://arxiv.org/abs/1812.04204
项目链接:http://vision.cs.utexas.edu/projects/2.5D_visual_sound/
源码链接:https://github.com/facebookresearch/FAIR-Play
【132】DeepLight: Learning Illumination for Unconstrained Mobile Mixed Reality
作者:Chloe LeGendre, Wan-Chun Ma, Graham Fyffe, John Flynn, Laurent Charbonnel, Jay Busch, Paul Debevec
论文链接:https://arxiv.org/abs/1904.01175
【133】Kervolutional Neural Networks
作者:Chen Wang, Jianfei Yang, Lihua Xie, Junsong Yuan
论文链接:https://arxiv.org/abs/1904.03955
【134】SoDeep: a Sorting Deep net to learn ranking loss surrogates
作者:Martin Engilberge, Louis Chevallier, Patrick Pérez, Matthieu Cord
论文链接:https://arxiv.org/abs/1904.04272
【135】3D Local Features for Direct Pairwise Registration
作者:Haowen Deng, Tolga Birdal, Slobodan Ilic
论文链接:https://arxiv.org/abs/1904.04281
【136】Neural Rerendering in the Wild(Oral)
作者:Moustafa Meshry, Dan B Goldman, Sameh Khamis, Hugues Hoppe, Rohit Pandey, Noah Snavely, Ricardo Martin-Brualla
论文链接:https://arxiv.org/abs/1904.04290
【137】End-to-end Projector Photometric Compensation
作者:Bingyao Huang, Haibin Ling
论文链接:https://arxiv.org/abs/1904.04335
【138】What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment
作者:Paritosh Parmar, Brendan Tran Morris
论文链接:https://arxiv.org/abs/1904.04346
【139】Towards Universal Object Detection by Domain Attention
作者:Xudong Wang, Zhaowei Cai, Dashan Gao, Nuno Vasconcelos
论文链接:https://arxiv.org/abs/1904.04402
项目链接:http://www.svcl.ucsd.edu/projects/universal-detection/
【140】Efficient Decision-based Black-box Adversarial Attacks on Face Recognition(人脸识别)
作者:Yinpeng Dong, Hang Su, Baoyuan Wu, Zhifeng Li, Wei Liu, Tong Zhang, Jun Zhu
论文链接:https://arxiv.org/abs/1904.04433
【141】Reliable and Efficient Image Cropping: A Grid Anchor based Approach
作者:Hui Zeng, Lida Li, Zisheng Cao, Lei Zhang
论文链接:https://arxiv.org/abs/1904.04441
代码链接:https://github.com/HuiZeng/Grid-Anchor-based-Image-Cropping
【142】SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking(视觉跟踪)
作者:Guangting Wang, Chong Luo, Zhiwei Xiong, Wenjun Zeng
论文链接:https://arxiv.org/abs/1904.04452
【143】Graphonomy: Universal Human Parsing via Graph Transfer Learning
作者:Ke Gong, Yiming Gao, Xiaodan Liang, Xiaohui Shen, Meng Wang, Liang Lin
论文链接:https://arxiv.org/abs/1904.04536
源码链接:https://github.com/Gaoyiminggithub/Graphonomy
【144】Deep Virtual Networks for Memory Efficient Inference of Multiple Tasks
作者:Eunwoo Kim, Chanho Ahn, Philip H.S. Torr, Songhwai Oh
论文链接:https://arxiv.org/abs/1904.04562
【145】Holistic and Comprehensive Annotation of Clinically Significant Findings on Diverse CT Images: Learning from Radiology Reports and Label Ontology(Oral)
作者:Ke Yan, Yifan Peng, Veit Sandfort, Mohammadhadi Bagheri, Zhiyong Lu, Ronald M. Summers
论文链接:https://arxiv.org/abs/1904.04661
【146】Domain-Symmetric Networks for Adversarial Domain Adaptation
作者:Yabin Zhang, Hui Tang, Kui Jia, Mingkui Tan
论文链接:https://arxiv.org/abs/1904.04663
【147】Action Recognition from Single Timestamp Supervision in Untrimmed Videos(动作识别)
作者:Davide Moltisanti, Sanja Fidler, Dima Damen
论文链接:https://arxiv.org/abs/1904.04689
【148】Label Propagation for Deep Semi-supervised Learning
作者:Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondrej Chum
论文链接:https://arxiv.org/abs/1904.04717
【149】Cross-Modal Self-Attention Network for Referring Image Segmentation
作者:Linwei Ye, Mrigank Rochan, Zhi Liu, Yang Wang
论文链接:https://arxiv.org/abs/1904.04745
【150】Leveraging the Invariant Side of Generative Zero-Shot Learning
作者:Jingjing Li, Mengmeng Jin, Ke Lu, Zhengming Ding, Lei Zhu, Zi Huang
论文链接:https://arxiv.org/abs/1904.04092
【151】Learning monocular depth estimation infusing traditional stereo knowledge
作者:Fabio Tosi, Filippo Aleotti, Matteo Poggi, Stefano Mattoccia
论文链接:https://arxiv.org/abs/1904.04144
代码链接:https://github.com/fabiotosi92/monoResMatch-Tensorflow
【152】Unsupervised learning of action classes with continuous temporal embedding
作者:Anna Kukleva, Hilde Kuehne, Fadime Sener, Juergen Gall
论文链接:https://arxiv.org/abs/1904.04189
【153】Pushing the Envelope for RGB-based Dense 3D Hand Pose Estimation via Neural Rendering
作者:Seungryul Baek, Kwang In Kim, Tae-Kyun Kim
论文链接:https://arxiv.org/abs/1904.04196
【154】Relational Action Forecasting(oral)
作者:Chen Sun, Abhinav Shrivastava, Carl Vondrick, Rahul Sukthankar, Kevin Murphy, Cordelia Schmid
论文链接:https://arxiv.org/abs/1904.04231
Cascaded Partial Decoder for Fast and Accurate Salient Object Detection
https://arxiv.org/abs/1904.08739
【155】A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning
https://arxiv.org/abs/1904.08720
【156】Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition
https://arxiv.org/abs/1904.08703
【157】DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition
https://arxiv.org/abs/1904.08634
【158】Fooling automated surveillance cameras: adversarial patches to attack person detection
https://arxiv.org/abs/1904.08653
【159】Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization
https://arxiv.org/abs/1904.08631
【160】Progressive Attention Memory Network for Movie Story Question Answering
https://arxiv.org/abs/1904.08607
【161】Unsupervised Person Image Generation with Semantic Parsing Transformation
https://arxiv.org/abs/1904.03379
【162】Unsupervised Person Image Generation with Semantic Parsing Transformation
论文链接:https://arxiv.org/abs/1904.03379
项目链接:https://github.com/SijieSong/person_generation_spt
【163】Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
论文链接:https://arxiv.org/abs/1806.07550
【164】Self-Supervised GANs via Auxiliary Rotation Loss
论文链接:https://arxiv.org/abs/1811.11212
【165】Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
论文链接:https://arxiv.org/abs/1904.04776
【166】L3-Net: Towards Learning based LiDAR Localization for Autonomous Driving
论文链接:https://songshiyu01.github.io/pdf/L3Net_W.Lu_Y.Zhou_S.Song_CVPR2019.pdf
【167】Deep Convolutional Networks on 3D Point Clouds
论文链接:https://arxiv.org/pdf/1811.07246.pdf
源码链接:https://github.com/DylanWusee/pointconv
【168】CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection
论文链接:https://drive.google.com/open?id=1JcZMHBXEX-7AR1P010OXg_wCCC5HukeZ(需要申请)
源码链接:https://github.com/zhangludl/code-and-dataset-for-CapSal
【169】Segmentation-driven 6D Object Pose Estimation
论文链接:https://arxiv.org/abs/1812.02541
源码链接:https://github.com/cvlab-epfl/segmentation-driven-pose
【170】LBS Autoencoder: Self-supervised Fitting of Articulated Meshes to Point Clouds
论文链接:https://arxiv.org/abs/1904.10037
【171】Learning Actor Relation Graphs for Group Activity Recognition
论文链接:https://arxiv.org/abs/1904.10117
【172】Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More
论文链接:https://arxiv.org/abs/1904.10167
【173】Attention-guided Network for Ghost-free High Dynamic Range Imaging
论文链接:https://arxiv.org/abs/1904.10293
【174】Data-Driven Neuron Allocation for Scale Aggregation Networks
论文链接:https://arxiv.org/abs/1904.09460
【175】A Simple Pooling-Based Design for Real-Time Salient Object Detection
论文链接:https://arxiv.org/abs/1904.09569
源码链接:http://mmcheng.net/poolnet/
【176】TransGaGa: Geometry-Aware Unsupervised Image-to-Image Translation
论文链接:https://arxiv.org/abs/1904.09571
【177】Deep Metric Learning Beyond Binary Supervision(Oral)
论文链接:https://arxiv.org/abs/1904.09626
【178】Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks
论文链接:https://arxiv.org/abs/1904.09791
【179】PCAN: 3D Attention Map Learning Using Contextual Information for Point Cloud Based Retrieval
论文链接:https://arxiv.org/abs/1904.09793
【180】Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids
论文链接:https://arxiv.org/abs/1904.09970
源码链接:https://github.com/paschalidoud/superquadric_parsing
【181】ContactDB: Analyzing and Predicting Grasp Contact via Thermal Imaging(Oral)
论文链接:https://contactdb.cc.gatech.edu/contactdb_paper.pdf
源码链接:https://github.com/samarth-robo/contactdb_prediction
【182】Aggregation Cross-Entropy for Sequence Recognition
论文链接:https://arxiv.org/abs/1904.08364
【183】Variational Prototyping-Encoder: One-Shot Learning with Prototypical Images
论文链接:https://arxiv.org/abs/1904.08482
【184】Meta-learning Convolutional Neural Architectures for Multi-target Concrete Defect Classification with the COncrete DEfect BRidge IMage Dataset
论文链接:https://arxiv.org/abs/1904.08486
【185】Machine Vision Guided 3D Medical Image Compression for Efficient Transmission and Accurate Segmentation in the Clouds
论文链接:https://arxiv.org/abs/1904.08487
【186】Few-Shot Learning with Localization in Realistic Settings
论文链接:https://arxiv.org/abs/1904.08502
【187】Progressive Attention Memory Network for Movie Story Question Answering
论文链接:https://arxiv.org/abs/1904.08607
【188】Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization
论文链接:https://arxiv.org/abs/1904.08631
【189】DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition
论文链接:https://arxiv.org/abs/1904.08634
【190】Out-of-Distribution Detection for Generalized Zero-Shot Action Recognition
论文链接:https://arxiv.org/abs/1904.08703
【191】A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning
论文链接:https://arxiv.org/abs/1904.08720
【192】Cascaded Partial Decoder for Fast and Accurate Salient Object Detection
论文链接:https://arxiv.org/abs/1904.08739
【193】4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
论文链接:https://arxiv.org/abs/1904.08755
【194】Attentive Single-Tasking of Multiple Tasks
论文链接:https://arxiv.org/abs/1904.08918
【195】Towards VQA Models that can Read
论文链接:https://arxiv.org/abs/1904.08920
【196】Listen to the Image
论文链接:https://arxiv.org/abs/1904.09115
【197】SelFlow: Self-Supervised Learning of Optical Flow
作者:Pengpeng Liu, Michael Lyu, Irwin King, Jia Xu
论文链接:https://arxiv.org/abs/1904.09117
【198】Visualizing the decision-making process in deep neural decision forest
论文链接:https://arxiv.org/abs/1904.09201
源码链接:https://github.com/Nicholasli1995/VisualizingNDF
【199】STEP: Spatio-Temporal Progressive Learning for Video Action Detection(Oral,视频动作识别)
论文链接:https://arxiv.org/abs/1904.09288
【200】Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving
论文链接:https://arxiv.org/abs/1812.07179
【201】Transferrable Prototypical Networks for Unsupervised Domain Adaptation
论文链接:https://arxiv.org/abs/1904.11227
【202】Exploring Object Relation in Mean Teacher for Cross-Domain Detection
论文链接:https://arxiv.org/abs/1904.11245
CVPR学习(三):CVPR2019-各个方向相关推荐
- 深度学习三(PyTorch物体检测实战)
深度学习三(PyTorch物体检测实战) 文章目录 深度学习三(PyTorch物体检测实战) 1.网络骨架:Backbone 1.1.神经网络基本组成 1.1.1.卷积层 1.1.2.激活函数层 1. ...
- 深度学习的过去、当下与未来!深度学习三巨头发文展望
点击上方"机器学习与生成对抗网络",关注星标 获取有趣.好玩的前沿干货! 来源:ACM 新智元 编辑:Priscilla Emil [导读]2018图灵奖获得者Yoshua Ben ...
- 2020届 AAAI Fellow名单新鲜出炉!!!深度学习三巨头终于齐聚
点击上方"深度学习技术前沿",选择"星标"公众号 资源干货,第一时间送达 AAAI 是国际人工智能领域最权威的学术组织,Fellow 是该学会给予会员的最高荣誉 ...
- 【目录】 软件测试全栈需要学习什么? 软件测试的各个阶段 ,软件测试学习路径,软件测试方向选择,软件测试的薪资待遇。...
关于博主: 博主是一位帅气的美男子,自认为我每次坐地铁的时候看到比我帅的人不多,目前从事于自动化测试工作与云计算方向的研究.就业与某行业国内排行前三的公司.个人认为学习,不仅为了当时学会了,过两天就忘 ...
- OpenCV学习之六: 使用方向梯度直方图估计图像旋转角度
OpenCV学习之六: 使用方向梯度直方图估计图像旋转角度 原文:http://blog.csdn.net/zhjm07054115/article/details/26964275 下面的代码通过计 ...
- 实至名归!ACM宣布深度学习三巨头共同获得图灵奖
昨日晚间,ACM(国际计算机学会)宣布,有"深度学习三巨头"之称的Yoshua Bengio.Yann LeCun.Geoffrey Hinton共同获得了2018年的图灵奖,这是 ...
- 【技术综述】图像与CNN发家简史,集齐深度学习三巨头
文章首发于微信公众号<有三AI> [技术综述]图像与CNN发家简史,集齐深度学习三巨头 没有一个经典的发现会是突然之间横空出世,它总是需要一些积淀. 提起卷积神经网络,我们总会从LeNet ...
- 深度学习三巨头共获 2018 年图灵奖(经典重温)!
整理 | 琥珀 出品 | AI科技大本营(ID:rgznai100) 2019 年 3 月 27 日,ACM 宣布,深度学习三位大牛 Yoshua Bengio.Yann LeCun.Geoffrey ...
- 深度学习三十年创新路
深度学习三十年创新路 编者注:深度学习火了,从任何意义上,大家谈论它的热衷程度,都超乎想象.但是,似乎很少有人提出不同的声音,说深度学习的火热,有可能是过度的繁荣,乃至不理性的盲从.而这次,有不同的想 ...
- 昨日种种已得奖,那深度学习三巨头今天在忙什么?
上周,AI圈最大的事情,没有之一,就是图灵奖,终于终于,终于颁给了深度学习三巨头. 关于Geoffrey Hinton和他的两位学生Yoshua Bengio.Yann LeCun的故事,在消息出来后 ...
最新文章
- Ubuntu 下配置 SSH服务全过程及问题解决
- java简单springboot系统_Springboot系列 3 - 建立简单的用户登录系统
- linux文件类型为ext4怎么扩展,如何扩展ext4分区和文件系统?
- linux 内核编译错误 .size expression for copy_user_generic_c does not evaluate to a constant
- python中国大学排名爬虫写明详细步骤-python中国大学排名爬虫
- 42. Vue、React 等前端项目部署,刷新 404 问题解决方案
- UnicodeEncodeError: 'UCS-2' codec can't encode characters in position 8-8: Non-BMP character not sup
- Xming + PuTTY 在Windows下远程Linux主机使用图形界面的程序
- bootstrap table使用参考
- 0day的NFO文件名的含义大全
- 基本控件Password控件
- 给学习java web新手们的建议和推荐一些书籍
- 用python计算1~100的阶乘之和_在Python中递归函数调用举例and匿名函数lambda求1~100的和及计算阶乘举例...
- mongodb集群分片环境搭建
- 231 · 自动补全
- 虚拟机网络连接模式中桥接模式和NAT模式的区别
- 计算机无法识别建行网银盾,为你修复建行网银盾无法识别
【应对方案】
的详细方案_...
- 腾讯内部深度文章曝光:微信向左 手机QQ向右
- OpenSSL密码库算法笔记——第5章 椭圆曲线
- Java实现微信小程序校验图片是否含有违法违规内容
热门文章
- ShortcutMapper 是应用程序的键盘快捷键
- 的clear会清空内存吗_Python内存分配时有哪些不为你知的小秘密?
- matlab工具箱使用50hz低通滤波器设计 和FFT 变化截取50hz工频信号幅值
- python3学习总结(个人遇到问题后搞明白的知识点总结)
- 现代化多媒体教室的计算机系统,多媒体教室系统建设方案
- 接口是java面向对象的实现机制之一_以下说法正确的是: ()_接口是Java面向对象的实现机制之一,以下说法正确的是:( )...
- vue项目中使用axios发送请求
- uni-app发布为H5页面白屏问题
- linux设置activemq开机启动,Activemq(centos7)开机自启动服务
- python 等差数列list_Python3基础 list range+for 等差数列