本专栏是计算机视觉方向论文收集积累,时间:2021年7月16日,来源:paper digest

欢迎关注原创公众号 【计算机视觉联盟】,回复 【西瓜书手推笔记】 可获取我的机器学习纯手推笔记!

直达笔记地址:机器学习手推笔记(GitHub地址)

1, TITLE: Applying The Case Difference Heuristic to Learn Adaptations from Deep Network Features
AUTHORS: Xiaomeng Ye ; Ziwei Zhao ; David Leake ; Xizi Wang ; David Crandall
CATEGORY: cs.AI [cs.AI, cs.CV]
HIGHLIGHT: This paper investigates a two-phase process combining deep learning for feature extraction and neural network based adaptation learning from extracted features.

2, TITLE: Object Retrieval and Localization in Large Art Collections Using Deep Multi-Style Feature Fusion and Iterative Voting
AUTHORS: Nikolai Ufer ; Sabine Lang ; Bj�rn Ommer
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In the following, we introduce an algorithm that allows users to search for image regions containing specific motifs or objects and find similar regions in an extensive dataset, helping art historians to analyze large digitized art collections.

3, TITLE: Surgical Instruction Generation with Transformers
AUTHORS: Jinglu Zhang ; Yinyu Nie ; Jian Chang ; Jian Jun Zhang
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Inspired by the neural machine translation and imaging captioning tasks in open domain, we introduce a transformer-backboned encoder-decoder network with self-critical reinforcement learning to generate instructions from surgical images.

4, TITLE: An Efficient and Small Convolutional Neural Network for Pest Recognition -- ExquisiteNet
AUTHORS: Shi-Yao Zhou ; Chung-Yen Su
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we propose a small and efficient model called ExquisiteNet to complete the task of recognizing the pests and we expect to apply our model on mobile devices.

5, TITLE: Semantic Image Cropping
AUTHORS: Oriol Corcoll
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this thesis, I introduce an additional dimension to the problem of cropping, semantics. To support my argument, I provide a new dataset containing 100 images with at least two different entities per image and four ground truth croppings collected using Amazon Mechanical Turk.

6, TITLE: Level Generation and Style Enhancement -- Deep Learning for Game Development Overview
AUTHORS: Piotr Migda? ; Bart?omiej Olechno ; B?a?ej Podg�rski
CATEGORY: cs.CV [cs.CV, I.2.10; I.4.3; J.5]
HIGHLIGHT: We present practical approaches of using deep learning to create and enhance level maps and textures for video games -- desktop, mobile, and web.

7, TITLE: What and When to Look?: Temporal Span Proposal Network for Video Visual Relation Detection
AUTHORS: Sangmin Woo ; Junhyug Noh ; Kangil Kim
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: To date, two representative methods have been proposed to tackle Video Visual Relation Detection (VidVRD): segment-based and window-based.

8, TITLE: Training for Temporal Sparsity in Deep Neural Networks, Application in Video Processing
AUTHORS: Amirreza Yousefzadeh ; Manolis Sifalakis
CATEGORY: cs.CV [cs.CV, cs.LG, eess.IV]
HIGHLIGHT: Towards this goal, in this paper we introduce a new DNN layer (called Delta Activation Layer), whose sole purpose is to promote temporal sparsity of activations during training.

9, TITLE: DynaDog+T: A Parametric Animal Model for Synthetic Canine Image Generation
AUTHORS: Jake Deane ; Sinead Kearney ; Kwang In Kim ; Darren Cosker
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Consequently, we introduce a parametric canine model, DynaDog+T, for generating synthetic canine images and data which we use for a common computer vision task, binary segmentation, which would otherwise be difficult due to the lack of available data.

10, TITLE: Unsupervised Anomaly Instance Segmentation for Baggage Threat Recognition
AUTHORS: Taimur Hassan ; Samet Akcay ; Mohammed Bennamoun ; Salman Khan ; Naoufel Werghi
CATEGORY: cs.CV [cs.CV, eess.IV]
HIGHLIGHT: This paper presents a novel unsupervised anomaly instance segmentation framework that recognizes baggage threats, in X-ray scans, as anomalies without requiring any ground truth labels.

11, TITLE: Deep Learning Based Food Instance Segmentation Using Synthetic Data
AUTHORS: D. Park ; J. Lee ; J. Lee ; K. Lee
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: In order to solve the difficulties of data collection and annotations, this paper proposes a food segmentation method applicable to real-world through synthetic data. To perform food segmentation on healthcare robot systems, such as meal assistance robot arm, we generate synthetic data using the open-source 3D graphics software Blender placing multiple objects on meal plate and train Mask R-CNN for instance segmentation.

12, TITLE: Incorporating Lambertian Priors Into Surface Normals Measurement
AUTHORS: YAKUN JU et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we use the initial normal under the Lambertian assumption as the prior information to refine the normal measurement, instead of solely applying the observed shading cues to deriving the surface normal.

13, TITLE: MeNToS: Tracklets Association with A Space-Time Memory Network
AUTHORS: Mehdi Miah ; Guillaume-Alexandre Bilodeau ; Nicolas Saunier
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: We propose a method for multi-object tracking and segmentation (MOTS) that does not require fine-tuning or per benchmark hyperparameter selection.

14, TITLE: Lidar Light Scattering Augmentation (LISA): Physics-based Simulation of Adverse Weather Conditions for 3D Object Detection
AUTHORS: VELAT KILIC et. al.
CATEGORY: cs.CV [cs.CV, physics.optics]
HIGHLIGHT: To address this issue, we propose a physics-based approach to simulate lidar point clouds of scenes in adverse weather conditions.

15, TITLE: Potential UAV Landing Sites Detection Through Digital Elevation Models Analysis
AUTHORS: Efstratios Kakaletsis ; Nikos Nikolaidis
CATEGORY: cs.CV [cs.CV, cs.GR, 68W40]
HIGHLIGHT: In this paper, a simple technique for Unmanned Aerial Vehicles (UAVs) potential landing site detection using terrain information through identification of flat areas, is presented.

16, TITLE: A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing
AUTHORS: WEI LIU et. al.
CATEGORY: cs.CV [cs.CV, cs.GR, cs.LG]
HIGHLIGHT: A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing

17, TITLE: Recurrent Parameter Generators
AUTHORS: Jiayun Wang ; Yubei Chen ; Stella X. Yu ; Brian Cheung ; Yann LeCun
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: We present a generic method for recurrently using the same parameters for many different convolution layers to build a deep network.

18, TITLE: StyleFusion: A Generative Model for Disentangling Spatial Segments
AUTHORS: Omer Kafri ; Or Patashnik ; Yuval Alaluf ; Daniel Cohen-Or
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: We present StyleFusion, a new mapping architecture for StyleGAN, which takes as input a number of latent codes and fuses them into a single style code.

19, TITLE: Neighbor-view Enhanced Model for Vision and Language Navigation
AUTHORS: DONG AN et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this work, we propose a multi-module Neighbor-View Enhanced Model (NvEM) to adaptively incorporate visual contexts from neighbor views for better textual-visual matching.

20, TITLE: From Show to Tell: A Survey on Image Captioning
AUTHORS: MATTEO STEFANINI et. al.
CATEGORY: cs.CV [cs.CV, cs.CL]
HIGHLIGHT: The final goal of this work is to serve as a tool for understanding the existing state-of-the-art and highlighting the future directions for an area of research where Computer Vision and Natural Language Processing can find an optimal synergy.

21, TITLE: Training Compact CNNs for Image Classification Using Dynamic-coded Filter Fusion
AUTHORS: MINGBAO LIN et. al.
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: In this paper, we present a novel filter pruning method, dubbed dynamic-coded filter fusion (DCFF), to derive compact CNNs in a computation-economical and regularization-free manner for efficient image classification.

22, TITLE: STAR: Sparse Transformer-based Action Recognition
AUTHORS: FENG SHI et. al.
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: This work proposes a novel skeleton-based human action recognition model with sparse attention on the spatial dimension and segmented linear attention on the temporal dimension of data.

23, TITLE: Mutually Improved Endoscopic Image Synthesis and Landmark Detection in Unpaired Image-to-image Translation
AUTHORS: LALITH SHARAN et. al.
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: Instead, we propose to use landmark detection on the points when sutures pass into the tissue.

24, TITLE: Deep Automatic Natural Image Matting
AUTHORS: Jizhizi Li ; Jing Zhang ; Dacheng Tao
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: In this paper, we investigate the difficulties when extending them to natural images with salient transparent/meticulous foregrounds or non-salient foregrounds.

25, TITLE: FetalNet: Multi-task Deep Learning Framework for Fetal Ultrasound Biometric Measurements
AUTHORS: SZYMON P?OTKA et. al.
CATEGORY: cs.CV [cs.CV, cs.AI, cs.LG, eess.IV]
HIGHLIGHT: In this paper, we propose an end-to-end multi-task neural network called FetalNet with an attention mechanism and stacked module for spatio-temporal fetal ultrasound scan video analysis.

26, TITLE: Single-image Full-body Human Relighting
AUTHORS: MANUEL LAGUNAS et. al.
CATEGORY: cs.CV [cs.CV, cs.GR]
HIGHLIGHT: We present a single-image data-driven method to automatically relight images with full-body humans in them.

27, TITLE: What Image Features Boost Housing Market Predictions?
AUTHORS: Zona Kostic ; Aleksandar Jevremovic
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: In this paper, we propose a set of techniques for the extraction of visual features for efficient numerical inclusion in modern-day predictive algorithms.

28, TITLE: High Carbon Stock Mapping at Large Scale with Optical Satellite Imagery and Spaceborne LIDAR
AUTHORS: Nico Lang ; Konrad Schindler ; Jan Dirk Wegner
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: Here, we propose an automated approach that aims to support conservation and sustainable land use planning decisions by mapping tropical landscapes at large scale and high spatial resolution following the High Carbon Stock (HCS) approach.

29, TITLE: Amodal Segmentation Just Like Doing A Jigsaw
AUTHORS: Xunli Zeng ; Jianqin Yin
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Different from this method, we propose a method of amodal segmentation based on the idea of the jigsaw.

30, TITLE: COAST: COntrollable Arbitrary-Sampling NeTwork for Compressive Sensing
AUTHORS: Di You ; Jian Zhang ; Jingfen Xie ; Bin Chen ; Siwei Ma
CATEGORY: cs.CV [cs.CV, eess.IV]
HIGHLIGHT: In this paper, we propose a novel COntrollable Arbitrary-Sampling neTwork, dubbed COAST, to solve CS problems of arbitrary-sampling matrices (including unseen sampling matrices) with one single model.

31, TITLE: StyleVideoGAN: A Temporal Generative Model Using A Pretrained StyleGAN
AUTHORS: Gereon Fox ; Ayush Tewari ; Mohamed Elgharib ; Christian Theobalt
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we present a novel approach to the video synthesis problem that helps to greatly improve visual quality and drastically reduce the amount of training data and resources necessary for generating video content.

32, TITLE: Adversarial Attacks on Multi-task Visual Perception for Autonomous Driving
AUTHORS: Ibrahim Sobh ; Ahmed Hamed ; Varun Ravi Kumar ; Senthil Yogamani
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this work, detailed adversarial attacks are applied on a diverse multi-task visual perception deep network across distance estimation, semantic segmentation, motion detection, and object detection.

33, TITLE: Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains
AUTHORS: Puneet Mangla ; Shivam Chandhok ; Vineeth N Balasubramanian ; Fahad Shahbaz Khan
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Motivated from the success of generative zero-shot approaches, we propose a feature generative framework integrated with a COntext COnditional Adaptive (COCOA) Batch-Normalization to seamlessly integrate class-level semantic and domain-specific information.

34, TITLE: Passive Attention in Artificial Neural Networks Predicts Human Visual Selectivity
AUTHORS: THOMAS A. LANGLOIS et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: This work contributes a new approach to evaluating the biological and psychological validity of leading ANNs as models of human vision: by examining their similarities and differences in terms of their visual selectivity to the information contained in images.

35, TITLE: Variational Topic Inference for Chest X-Ray Report Generation
AUTHORS: Ivona Najdenkoska ; Xiantong Zhen ; Marcel Worring ; Ling Shao
CATEGORY: cs.CV [cs.CV, cs.LG, eess.IV]
HIGHLIGHT: The topics are inferred in a conditional variational inference framework, with each topic governing the generation of a sentence in the report. Specifically, we introduce a set of topics as latent variables to guide sentence generation by aligning image and language modalities in a latent space.

36, TITLE: Diff-Net: Image Feature Difference Based High-Definition Map Change Detection
AUTHORS: Lei He ; Shengjie Jiang ; Xiaoqing Liang ; Ning Wang ; Shiyu Song
CATEGORY: cs.CV [cs.CV, cs.RO]
HIGHLIGHT: To achieve constantly updated HD maps, we present a deep neural network (DNN), Diff-Net, to detect changes in them.

37, TITLE: Recommending Best Course of Treatment Based on Similarities of Prognostic Markers\thanks{All Authors Contributed Equally
AUTHORS: Sudhanshu ; Narinder Singh Punn ; Sanjay Kumar Sonbhadra ; Sonali Agarwal
CATEGORY: cs.IR [cs.IR, cs.CV]
HIGHLIGHT: Following this context, the goal of this paper is to propose collaborative filtering based recommender system in the healthcare sector to recommend remedies based on the symptoms experienced by the patients.

38, TITLE: MultiBench: Multiscale Benchmarks for Multimodal Representation Learning
AUTHORS: PAUL PU LIANG et. al.
CATEGORY: cs.LG [cs.LG, cs.AI, cs.CL, cs.CV, cs.MM]
HIGHLIGHT: MultiBench provides an automated end-to-end machine learning pipeline that simplifies and standardizes data loading, experimental setup, and model evaluation.

39, TITLE: Learning Sparse Interaction Graphs of Partially Observed Pedestrians for Trajectory Prediction
AUTHORS: Zhe Huang ; Ruohua Li ; Kazuki Shin ; Katherine Driggs-Campbell
CATEGORY: cs.RO [cs.RO, cs.CV]
HIGHLIGHT: Thus, we propose Gumbel Social Transformer, in which an Edge Gumbel Selector samples a sparse interaction graph of partially observed pedestrians at each time step.

40, TITLE: VILENS: Visual, Inertial, Lidar, and Leg Odometry for All-Terrain Legged Robots
AUTHORS: David Wisth ; Marco Camurri ; Maurice Fallon
CATEGORY: cs.RO [cs.RO, cs.CV]
HIGHLIGHT: We present VILENS (Visual Inertial Lidar Legged Navigation System), an odometry system for legged robots based on factor graphs.

41, TITLE: FastSHAP: Real-Time Shapley Value Estimation
AUTHORS: Neil Jethani ; Mukund Sudarshan ; Ian Covert ; Su-In Lee ; Rajesh Ranganath
CATEGORY: stat.ML [stat.ML, cs.CV, cs.LG]
HIGHLIGHT: We introduce FastSHAP, a method for estimating Shapley values in a single forward pass using a learned explainer model.

42, TITLE: A Modular U-Net for Automated Segmentation of X-ray Tomography Images in Composite Materials
AUTHORS: Jo�o P C Bertoldo ; Etienne Decenci�re ; David Ryckelynck ; Henry Proudhon
CATEGORY: eess.IV [eess.IV, cs.CV, 68T07 (Primary) 68T45 (Secondary), I.4.6; I.2.10; I.5.4; J.2]
HIGHLIGHT: In this paper a modular interpretation of UNet (Modular U-Net) is proposed and trained to segment 3D tomography images of a three-phased glass fiber-reinforced Polyamide 66.

43, TITLE: Multi-Channel Auto-Encoders and A Novel Dataset for Learning Domain Invariant Representations of Histopathology Images
AUTHORS: ANDREW MOYES et. al.
CATEGORY: eess.IV [eess.IV, cs.CV, cs.LG]
HIGHLIGHT: In this work, the Multi-Channel Auto-Encoder (MCAE) model is presented as an extension to DCAE which learns from more than two domains of data.

计算机视觉论文-2021-07-16相关推荐

  1. 2021.07.16 总结

    2021.07.16 总结 ​ 今天状态不怎么好,几道那么容易的题就只有140分,毕竟也就打了前两道 T1 花生采摘 题目描述 鲁宾逊先生有一只宠物猴,名叫多多.这天,他们两个正沿着乡间小路散步,突然 ...

  2. 2021.07.16【普及组】模拟赛C组

    2021.07.16[普及组]模拟赛C组 文章目录 2021.07.16[普及组]模拟赛C组 前言 花生采摘 题目 解析 代码 FBI树 题目 解析 代码 火星人 题目 解析 代码 麦森数 题目 解析 ...

  3. SSL集训 2021.07.16 提高B组 T1 下棋【博弈论】

    题目大意 给你K盘棋的局面,左上角为(1,1),绝顶聪明的两个人下,规则: 只能翻黑棋,会把 [ 1 − x ] [ 1 − y ] [1-x][1-y] [1−

  4. 项目实训2021.07.16

    由于gunicorn只适配UNIX核心,所以需要在linux虚拟机上部署并建立跨系统通讯. 安装并学习使用linux虚拟机.

  5. 股票预测数据与真实数据对比图(八)2021/07/16

    重点在于echarts的option 经过查阅文档可以知道,对于echarts的使用,对于图表要绘制的内容是填写在option这一选项,并通过 charts.setOption(option,true ...

  6. 2021年必读的10 个计算机视觉论文总结

    点击上方"3D视觉工坊",选择"星标" 干货第一时间送达 作者丨Louis Bouchard 来源丨DeepHub IMBA 编辑丨极市平台 本文是作者总结的今 ...

  7. 【AI视野·今日CV 计算机视觉论文速览 第240期】Thu, 4 Nov 2021

    AI视野·今日CS.CV 计算机视觉论文速览 Thu, 4 Nov 2021 Totally 35 papers

  8. 【AI视野·今日CV 计算机视觉论文速览 第239期】Wed, 3 Nov 2021

    AI视野·今日CS.CV 计算机视觉论文速览 Wed, 3 Nov 2021 Totally 48 papers

  9. 【AI视野·今日CV 计算机视觉论文速览 第238期】Fri, 1 Oct 2021

    AI视野·今日CS.CV 计算机视觉论文速览 Fri, 1 Oct 2021 Totally 62 papers

  10. 【AI视野·今日CV 计算机视觉论文速览 第237期】Thu, 30 Sep 2021

    AI视野·今日CS.CV 计算机视觉论文速览 Thu, 30 Sep 2021 Totally 47 papers

最新文章

  1. 【跃迁之路】【495天】程序员高效学习方法论探索系列(实验阶段252-2018.06.15)...
  2. CPA相关功耗分析(二)
  3. SAP UI5 应用 index.html 里 data-sap-ui-resourceroots 指令的含义和作用
  4. 《Programming WPF》翻译 第9章 6.我们进行到哪里了?
  5. 如何实现登录、URL和页面按钮的访问控制?
  6. java --微信支付2
  7. paip.提升安全性--CRSF跨站请求伪造的检测与防范
  8. 从AssetBundle中获取图片,给Image 组件替换Sprite
  9. 基于 Arduino 的 RFID 识别实验
  10. BigDecimal除法报错:java.lang.ArithmeticException: Division undefined
  11. 敖丙大佬的《吐血整理》-顶级程序员书单集 JAVA
  12. PROFINET转EtherCAT网关方案设计
  13. X86、X86_64、X64的区别
  14. ​TCP和UDP的135、137、138、139、445端口的作用?​
  15. Windowed reading and writing
  16. python图像降噪
  17. EB Tresos安装错误 No Package
  18. php xcat createadmin,[BUG]Unable to load class: \App\Command\createAdmin
  19. 今日头条的新闻推荐算法原理
  20. 为什么要购买阿里云服务器?云服务器用途有哪些?

热门文章

  1. opencv python教程-OpenCV4 Python 最新中文版官方教程来了(附下载)
  2. 输入python出现商店_Win 10 中使用 Python 碰到的奇怪现象
  3. php试题及答案 博客,转php面试题及我的答案(一)
  4. python print(f)执行将报错_Linux中为什么执行自己的程序要在前面加./
  5. linux查看cpu占用率_Linux 性能查看
  6. 计算机系统的组成doc,计算机系统的基本组成.doc
  7. 【linux】kali 创建应用程序桌面图标(以burpsuite为例)
  8. 华北科技学院计算机科学与技术分数线,2019年华北科技学院优势专业排名及分数线...
  9. python在windows配置_Python在windows平台的多版本配置
  10. android cm 老罗,Android之父打造了一款全面屏旗舰 罗永浩如此评价