Perceptual Generative Adversarial Networks for Small Object Detection

2017CVPR 新鲜出炉的paper,这是针对small object detection的一篇文章,采用PGAN来提升small object detection任务的performance。

最近也没做object detection,只是别人推荐了这篇paper,看了摘要觉得通俗易懂就往下看了。。。最后发现还是没怎么搞懂,只是明白PGAN的模型。如果理解有误的地方,请指出。

言归正传,PGAN为什么对small object有效?具体是这样,small object 不好检测,而large object好检测,那PGAN就让generator 学习一个映射,把small object 的features 映射成 large object 的features,然后就好检测了。PGAN呢,主要就看它的generator。

传统GAN中的generator是学习从随机噪声到图像的映射,也就是generator可以把一个噪声变成图片,而PGAN的思想是让generator把small object 变成 large object,这样就有利于检测了。 来看看文章中的原话都是怎么介绍generator的:

  1. we address the small object detection problem by developing a single architecture that internally lifts representations of small objects to “super-resolved” ones, achieving similar characteristics as large objects
  2. Perceptual Generative Adversarial Network (Perceptual GAN) model that improves small object detection through narrowing representation difference of small objects from the large ones.
  3. generator learns to transfer perceived poor representations of the small objects to super-resolved ones
  4. The Perceptual GAN aims to enhance the representations of small objects to be similar to those of large object
  5. the generator is a deep residual based feature generative model which transforms the original poor features of small objects to highly discriminative ones by introducing fine-grained details from lower-level layers, achieving “super-resolution” on the intermediate representations
    6.传统的generator G represents a generator that learns to map data z from the noise distribution pz(z) to the distribution pdata(x) over data x,而PGAN的generator中 x and z are the representations for large objects and small objects
  6. The generator network aims to generate super-resolved representations for small objects to improve detection accurac
  7. the generator as a deep residual learning network that augments the representations of small objects to super-resolved ones by introducing more fine-grained details absent from the small objects through residual learning

文章在不同地方不断的重复了一个意思,就是generator学习的是一个映射,这个映射就是把假(small object)的变成真(large object)的
来看看generator长什么样子

分两个部分,这里就没看懂是什么意思了,或许和object detection有关了。最终得出的结果是Super-Resolved Features 这个就很像Large Objects Featuresle. 如图,左下角是G生成的,左上角是真实的:

讲完了generator 就到discriminator了,这里的discrimintor和传统的GAN也有不一样的地方。

在这里,加入了一个新的loss,叫做perceptual loss ,PGAN也因此而得名(我猜的,很明显嘛)这个loss我也是没看明白的地方,贴原文大家看看吧(有理解的这部分的同学,请在评论区讲一讲,供大家学习)
1. justify the detection accuracy benefiting from the generated super-resolved features with a perceptual loss

看完paper感觉作者没有很直接说提出PGAN是inspired by哪些文章~不过GAN(2014 Goodfellow)

【文献阅读】Perceptual Generative Adversarial Networks for Small Object Detection –CVPR-2017相关推荐

  1. 文献阅读20期:Transformer Transforms Salient Object Detection and Camouflaged Object Detection

    [ 文献阅读 ] Transformer Transforms Salient Object Detection and Camouflaged Object Detection [1] 表现SOTA ...

  2. [论文阅读] AMPNet: Average- and Max-Pool Networks for Salient Object Detection

    论文地址:https://dx.doi.org/10.1109/TCSVT.2021.3054471 发表于:TCSVT 2021 Abstract 显著目标检测的任务是检测图像中最具视觉特色的物体. ...

  3. Generative Adversarial Networks: An Overview文献阅读笔记

    Generative Adversarial Networks: An Overview笔记 Abstract Generative adversarial networks (GANs) provi ...

  4. 自动驾驶轨迹预测论文阅读(三)Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks

    [略读]GUPTA A, JOHNSON J, FEI-FEI L, et al., 2018. Social GAN: Socially Acceptable Trajectories with G ...

  5. BEGAN: Boundary Equilibrium Generative Adversarial Networks阅读笔记

    BEGAN: Boundary Equilibrium Generative Adversarial Networks阅读笔记 摘要 我们提出了一种新的用于促成训练时生成器和判别器实现均衡(Equil ...

  6. 论文阅读Reconstructing CT from Biplanar X-Rays with Generative Adversarial Networks

    Reconstructing CT from Biplanar X-Rays with Generative Adversarial Networks 基于生成对抗网络的双平面x线重建CT 来源:CV ...

  7. 论文翻译:2019_Bandwidth Extension On Raw Audio Via Generative Adversarial Networks

    论文地址:原始音频的带宽扩展通过生成对抗网络 博客作者:凌逆战 博客地址:https://www.cnblogs.com/LXP-Never/p/10661950.html 摘要 基于神经网络的方法最 ...

  8. ESRGAN - Enhanced Super-Resolution Generative Adversarial Networks论文翻译——中英文对照

    文章作者:Tyan 博客:noahsnail.com  |  CSDN  |  简书 声明:作者翻译论文仅为学习,如有侵权请联系作者删除博文,谢谢! 翻译论文汇总:https://github.com ...

  9. ESRGAN - Enhanced Super-Resolution Generative Adversarial Networks论文翻译——中文版

    文章作者:Tyan 博客:noahsnail.com  |  CSDN  |  简书 声明:作者翻译论文仅为学习,如有侵权请联系作者删除博文,谢谢! 翻译论文汇总:https://github.com ...

最新文章

  1. Jieba分词原理与解析
  2. eyoucms添加内容
  3. C++ Primer 5th笔记(chap 12 动态内存)shared_ptr
  4. 相当全面:推荐系统干货总结
  5. Django 视图的FBV 与 CBV
  6. java开发串口步骤
  7. equals 和 == 的区别?知乎转载
  8. Chrome 上最必不可少的29 款插件,超级提高效率
  9. server2012文件服务器的开始按钮,Windows Server 2012实用技巧集锦
  10. js 直接打开选择文件窗口_打开CAD文件时,怎么快速选择“gbcbig.shx”字体?
  11. Why ASMLIB and why not?
  12. RAD Studio XE (DelphiXE)官网下载地址与注册
  13. hadoop之hdfs及其工作原理
  14. JS学习--用JS读取本地文件
  15. jquery的ajax的使用!
  16. Python Diary - Day 4 运算符
  17. SpringBoot切面应用-输出接口调用日志
  18. activiti 多实例任务
  19. Python 中的 Plotly 是什么?
  20. 你不再需要动态网页——编辑-发布-开发分离

热门文章

  1. 计算机网络学习1-网络层次
  2. 页面修改成套用MasterPage时遇到Invalid postback or callback argument的错误
  3. Path.GetExtension 方法
  4. intelj idea安装和配置
  5. 一个 Java 的 Socket 服务器和客户端通信的例子
  6. webApp 开发技术要点总结
  7. 分享一个SQL文件的合并的小程序
  8. [Teaching] [Silverlight] 30秒快速建立遊戲迴圈 (Game Loop)
  9. [原创] 在XP/2K 下实现 Win+Ctrl+Del 等键的屏蔽的方法,附源码与演示程序下载。...
  10. 关注 Web Client Software Factory [Weekly Drop 08]