[CVPR 2013] Three Trending Computer Vision Research Areas

As I walked through the large poster-filled hall at CVPR 2013, I asked myself, “Quo vadis Computer Vision?" (Where are you going, computer vision?)  I see lots of papers which exploit last year’s ideas, copious amounts of incremental research, and an overabundance of off-the-shelf computational techniques being recombined in seemingly novel ways.  When you are active in computer vision research for several years, it is not rare to find oneself becoming bored by a significant fraction of papers at research conferences.  Right after the main CVPR conference, I felt mentally drained and needed to get a breath of fresh air, so I spent several days checking out the sights in Oregon.  Here is one picture -- proof that the CVPR2013 had more to offer than ideas!

When I returned from sight-seeing, I took a more circumspect look at the field of computer vision.  I immediately noticed that vision research is actually advancing and growing in a healthy way.  (Unfortunately, most junior students have a hard determining which research papers are actually novel and/or significant.)  A handful of new research themes arise each year, and today I’d like to briefly discuss three new computer vision research themes which are likely to rise in popularity in the foreseeable future (2-5 years).
1) RGB-D input data is trending.
Many of this year’s papers take a single 2.5D RGB-D image as input and try to parse the image into its constituent objects.  The number of papers doing this with RGBD data is seemingly infinite.  Some other CVPR 2013 approaches don’t try to parse the image, but instead do something else like: fit cuboids, reason about affordances in 3D, or reason about illumination.  The reason why such inputs are becoming more popular is simple: RGB-D images can be obtained via cheap and readily available sensors such as Microsoft’s Kinect.  Depth measurements used to be obtained by expensive time of flight sensors (in the late 90s and early 00s), but as of 2013, $150 can buy you one these depth sensing bad-boys!  In fact, I had bought a Kinect just because I thought that it might come in handy one day -- and since I’ve joined MIT, I’ve been delving into the RGB-D reconstruction domain on my own.  It is just a matter of time until the newest iPhone has an on-board depth sensor, so the current line of research which relies on RGB-D input is likely to become the norm within a few years.
H. Jiang and J. Xiao. A Linear Approach to Matching Cuboids in RGBD Images. In CVPR 2013. [pdf] [code]


2) Mid-level patch discovery is a hot research topic.
 Saurabh Singh from CMU introduced this idea in his seminal ECCV 2012 paper, and Carl Doersch applied this idea to large-scale Google Street-View imagery in the “What makes Paris look like Paris?” SIGGRAPH 2012 paper.  The idea is to automatically extract mid-level patches (which could be objects, object parts, or just chunks of stuff) from images with the constraint that those are the most informative patches.Regarding the SIGGRAPH paper, see the video below.

Unsupervised Discovery of Mid-Level Discriminative Patches Saurabh Singh, Abhinav Gupta, Alexei A. Efros. In ECCV, 2012.

allowfullscreen="" frameborder="0" height="315" src="http://www.youtube.com/embed/s5-30NKSwo8" width="560">  Carl Doersch, Saurabh Singh, Abhinav Gupta, Josef Sivic, and Alexei A. Efros. What Makes Paris Look like Paris? In SIGGRAPH 2012. [pdf]
At CVPR 2013, it was evident that the idea of "learning mid-level parts for scenes" is being pursued by other top-tier computer vision research groups.  Here are some CVPR 2013 papers which capitalize on this idea:
Blocks that Shout: Distinctive Parts for Scene Classification. Mayank Juneja, Andrea Vedaldi, CV Jawahar, Andrew Zisserman. In CVPR, 2013. [pdf]
Representing Videos using Mid-level Discriminative Patches. Arpit Jain, Abhinav Gupta, Mikel Rodriguez, Larry Davis. CVPR, 2013. [pdf]
Part Discovery from Partial Correspondence. Subhransu Maji, Gregory Shakhnarovich. In CVPR, 2013. [pdf]

3) Deep-learning and feature learning are on the rise within the Computer Vision community.
It seems that everybody at Google Research is working on Deep-learning.  Will it solve all vision problems?  Is it the one computational ring to rule them all?  Personally, I doubt it, but the rising presence of deep learning is forcing every researcher to brush up on their l33t backprop skillz.  In other words, if you don't know who Geoff Hinton is, then you are in trouble.

from: http://www.computervisionblog.com/2013/07/cvpr-2013-three-trending-computer.html

从CVPR 2013看计算机视觉的研究领域和趋势 [CVPR 2013] Three Trending Computer Vision Research Areas相关推荐

  1. 从CVPR 2014看计算机视觉领域的最新热点

    从CVPR 2014看计算机视觉领域的最新热点 编者按:2014年度计算机视觉方向的顶级会议CVPR上月落下帷幕.在这次大会中,微软亚洲研究院共有15篇论文入选.今年的CVPR上有哪些让人眼前一亮的研 ...

  2. 图像处理与计算机视觉基础相关领域的经典书籍以及论文

    原文的链接是http://www.iask.sina.com.cn/u/2252291285/ish. 我非常感谢原作者杨晓冬辛勤地编写本文章,并愿意共享出来.我也希望转载本文的各位朋友,要注明原作者 ...

  3. 模式识别、计算机视觉、机器学习领域的顶级期刊和会议(整理)

    部分AI刊物影响因子05 SCIIF 2005 2004JMLR 4.027 5.952(机器学习)PAMI 3.810 4.352(模式识别) IJCV 3.657 2.914(计算机视觉) TOI ...

  4. 斯坦福 AI Lab 主任 Chris Manning:人工智能研究的最新趋势和挑战

    https://www.infoq.cn/article/NocvJXE0wd4HCMDyJ_Sa 本文为 Robin.ly 授权转载,文章版权归原作者所有,转载请联系原作者. 本期 Robin.ly ...

  5. 计算机视觉Computer Vision网址导航

    1常用网站 20条常用网站网址,更多点此 Google(gfsoso) [直达] 计算机视觉网 [直达] 增强现实资讯 [直达] 开源中国社区oschina [直达] 百度搜索 [直达] 小木虫,学术 ...

  6. 计算机视觉研究那些事 |CVPR 2020 论文分享会

    本文转载自微软学术合作. 在以下链接查看 CVPR 2020 线上论文分享会全程回放: https://space.bilibili.com/110487933/channel/detail?cid= ...

  7. 从CVPR 2021的论文看计算机视觉的现状

    作者丨Georgian 来源丨DeepHub IMBA 编辑丨极市平台 导读 本文根据今年的CVPR录用结果总结出了一些CV领域相关的发展现状. 计算机视觉(Computer Vision, CV)是 ...

  8. 【杂谈】如何学会看arxiv.org才能不错过自己研究领域的最新论文?

    文章首发于微信公众号<有三AI> [杂谈]如何学会看arxiv.org才能不错过自己研究领域的最新论文? 今天介绍一个用于追踪arxiv.org平台上最新论文的工具arxiv-sanity ...

  9. 计算机视觉在农业领域中的应用

    摘要:随着计算机等技术的不断发展,计算机视觉技术被广泛运用到各个领域中.与此同时,随着人口数量的增长.城市化进程导致耕地面积的减少,农业向着高质量.高产量方向的发展成为关键.将计算机视觉技术应用在农业 ...

最新文章

  1. 【CF EDU59 E】 Vasya and Binary String (DP)
  2. 编译报错field has incomplete type
  3. C++ 调试技术:addr2line
  4. Linux系统.xsesion日志文件,linux系统日志
  5. python学习笔记三一 函数学习
  6. java cpu过高排查_论线上如何排查一次CPU100%的情况
  7. uboot启动流程概述_Alibaba Cloud Linux 2 LTS OS 启动优化实践
  8. Unity3D 之UGUI 滑动条(Slider)
  9. IdentityServer4密码模式
  10. java路径在那_Java 路径
  11. python print_Python print()
  12. Matlab - 演化博弈论实现
  13. 计算机打字键盘亮怎么设置,键盘指示灯亮着却不能打字的解决方法
  14. 十一条Python学习路线,推荐收藏
  15. Word学习笔记:P6-文档封面、页眉、页脚设置
  16. CSP 201712-2 游戏
  17. Travis CI 简介
  18. N-vop、S-vop、Packed Bistream
  19. 如何优雅的将Mybatis日志中的Preparing与Parameters转换为可执行SQL
  20. 怎么绕过付费验证获取作文网站上的内容

热门文章

  1. 深度学习在推荐领域的应用
  2. jvm性能调优实战 - 27亿级数据量的实时分析引擎,为啥频繁发生Full GC
  3. Apache Kafka-消费端_顺序消费的实现
  4. Java - String源码解析及常见面试问题
  5. python 选择排序算法
  6. abaqus最大应力准则怎么用_ANSYS与ABAQUS对比,你选择那个?
  7. idea 2019.2 版本更新(最顶部从白色边框变为黑色边框)
  8. 计算机专业的分支,计算机专业分支(转载)
  9. python判断值是否在excel中_python接口自动化测试之根据excel中的期望结果是否存在于请求返回的响应值中来判断用例是否执行成功...
  10. sp MySQL 导入_mysql数据导入redis