一、作者

​ Lv Tang ,Bo Li1,Yijie Zhong ,Shouhong Ding, Mofei Song2, Youtu Lab

二、地址

2.1 原文地址

ICCV 地址

2.2 代码地址

代码

三、摘要

​ Aiming at discovering and locating most distinctive objects from visual scenes, salient object detection (SOD) plays an essential role in various computer vision systems. Coming to the era of high resolution, SOD methods are facing new challenges. The major limitation of previous methods is that they try to identify the salient regions and estimate the accurate objects boundaries simultaneously with a single regression task at low-resolution. This practice ignores the inherent difference between the two difficult problems, resulting in poor detection quality. In this paper, we propose a novel deep learning framework for high-resolution SOD task, which disentangles the task into a low-resolution saliency classification network (LRSCN) and a high-resolution refinement network (HRRN). As apixel-wise classification task, LRSCN is designed to capture sufficient semantics at low-resolution to identify the definite salient(most pixels inside the salient object have the highest saliency value), background( most pixels in the background regions have the lowest salient value) and uncertain image regions(saliency values of the pixels at blurry object boundaries fluctuate between 0 and 1). HRRN is a regression task, which aims at accurately refining the saliency value of pixels in the uncertain region to preserve a clear object boundary at high-resolution with limited GPU memory. It is worth noting that by introducing uncertainty into the training process, our HRRN can well address the high-resolution refinement task without using any high-resolution training data. Extensive experiments on high-resolution saliency datasets as well as some widely used saliency benchmarks show that the method achieves superior performance compared to the state-of -the-art methods.

四、主要内容

4.1 主要工作

  • We provide a new perspective that high-resolution salient object detection should be disentangled into two tasks, and demonstrate that the disentanglement of the two tasks is essential for improving the performance of DNN based SOD models.
  • Motivated by the principle of disentanglement, we propose a novel framework for high-resolution salient object detection, which uses LRSCN to capture sufficient semantics at low-resolution and HRRN for accurate boundary refinement at high-resolution.
  • We make the earliest efforts to introduce the uncertainty into SOD network training, which empowers HRRN to well address the high-resolution refinement task without any high-resolution training datasets.
  • We perform extensive experiments to demonstrate the proposed method refreshes the SOTA performance on high-resolution saliency datasets as well as some widely used saliency benchmarks by a large margin.

4.2 网络结构 (VGG-16)

4.3 MECF and AGA

4.3.1 ME ( based on Global Convolutional Network (GCN))

4.3.2 CF (utilize cross-level feature fusion module)

4.3.3 SGA ( guarantees the alignment of trimap and saliecny map)

4.4 LOSS

4.4.1 LRSCN Loss

$ T^{gt}$ : trimap groundtruth

Tgt(x,y)={2,Tgt(x,y)ϵdefinitesalient0,Tgt(x,y)ϵdefinitebackground1,Tgt(x,y)ϵdefiniteuncertainregionT^{gt}(x, y) = \begin{cases} 2, \ T^{gt}(x,y)\epsilon \ definite\ salient \\ 0,T^{gt}(x,y)\epsilon \ definite\ background \\ 1, T^{gt}(x,y)\epsilon \ definite\ uncertain region \end{cases} Tgt(x,y)=⎩⎪⎨⎪⎧​2, Tgt(x,y)ϵ definite salient0,Tgt(x,y)ϵ definite background1,Tgt(x,y)ϵ definite uncertainregion​

Ltrimap=1N∑i−log(eTi∑jeTj)L_{trimap}=\frac{1}{N}\sum_i-log(\frac{e^{T_i}}{\sum_j e^{T_j}}) Ltrimap​=N1​i∑​−log(∑j​eTj​eTi​​)

LLRSCN=Lsaliency+LtrimapLsaliency:BCE+SSIM+F−measureL_{LRSCN}=L_{saliency}+L_{trimap}\\ L_{saliency}:BCE+SSIM+F-measure LLRSCN​=Lsaliency​+Ltrimap​Lsaliency​:BCE+SSIM+F−measure

4.4.2 HRRN Loss

​ uncertainty loss will make the weight of the loss in the uncertainty region be small and let the network ignore effects from noisy data as much as possible.

L1=1E∑iϵE∣SiH−GiH∣E:numbeofpixelsL_1 = \frac{1}{E} \sum_{i\epsilon E}|S_i^H - G_i^H| \\ E:numbe\ of\ pixels L1​=E1​iϵE∑​∣SiH​−GiH​∣E:numbe of pixels

Luncertainty=1U∑iϵU∣∣SiH−GiH∣∣22σi2+12logσi2U:totalnumberofpixelsinuncertainregionL_{uncertainty} = \frac{1}{U} \sum_{i\epsilon U}\frac{{||S_i^H - G_i^H||}^2}{2\sigma_i^2}+ \frac{1}{2}log\sigma_i^2 \\ U:total\ number\ of\ pixels\ in\ uncertain\ region Luncertainty​=U1​iϵU∑​2σi2​∣∣SiH​−GiH​∣∣2​+21​logσi2​U:total number of pixels in uncertain region

LHRRN=Luncertainty+L1L_{HRRN} = L_{uncertainty} + L_1 LHRRN​=Luncertainty​+L1​

五、评估材料

  • MAE
  • F-measure (FβF_\betaFβ​ and FβmaxF_\beta ^{max}Fβmax​)
  • Structure Measure
  • RP曲线
  • BDE ( Boundary Displacement Error) 边界漂移误差
  • BμB_\muBμ​

六、结论

​ In this paper, we argue that there are two difficult and inherently different problems in high-resolution SOD. From this perspective, we propose a novel deep learning framework to disentangle the high-resolution SOD into two tasks: LRSCN and HRRN. LRSCN can identify the definite salient, background and uncertain regions at low-resolution with sufficient semantics. While HRRN can accurately refining the saliency value of pixels in the uncertain region to preserve a clear object boundary at high-resolution with limited GPU memory. We also make the earliest efforts to introduce the uncertainty into SOD network training, which empower HRRN to learn rich details without using any high-resolution training datasets. Extensive evaluations on high-resolution datasets and popular benchmark datasets not only verify the superiority of our method but also demonstrate the importance of disentanglement for SOD. We believe our novel disentanglement view in this work can contribute to other high-resolution computer vision tasks in the future.

(2021 ICCV) Disentangled High Quality Salient Object Detection (A类)相关推荐

  1. [论文阅读] Disentangled High Quality Salient Object Detection

    论文地址:https://arxiv.org/abs/2108.03551 代码:https://github.com/luckybird1994/HQSOD 发表于:ICCV'21 Abstract ...

  2. [论文阅读] Stereoscopically Attentive Multi-scale Network for Lightweight Salient Object Detection

    论文地址:https://dx.doi.org/10.1109/TIP.2021.3065239 代码:https://mmcheng.net/SAMNet 发表于:TIP 2021 Abstract ...

  3. [论文阅读] Looking for the Detail and Context Devils: High-Resolution Salient Object Detection

    论文地址:https://dx.doi.org/10.1109/TIP.2020.3045624 发表于:TIP 2021 Abstract 近年来,随着大规模基准测试与深度学习技术的成就,显著目标检 ...

  4. Revisiting Salient Object Detection: Simultaneous Detection, Ranking, and Subitizing of Multiple Sal

    问题: 作者认为,显著性目标检测领域迄今为止的工作解决的是一个相当病态的问题.即不同的人对于什么是显著性目标没有一个普遍的一致意见.这意味着一些目标会比另一些目标更加显著,并且不同的显著性目标中存在着 ...

  5. [论文阅读] Boosting Salient Object Detection with Transformer-based Asymmetric Bilateral U-Net

    论文地址:https://arxiv.org/abs/2108.07851 发表于:Arxiv 2021.08 Abstract 现有的显著目标检测(SOD)方法主要依靠基于CNN的U型结构,通过跨层 ...

  6. Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection(速读啊)内含与u-shape的对比

    今天早早起来了 吃完饭就开始干活了 十点开始读论文 所以速读适合没有很长事假你的情况下,你只需要读懂大意就可以了 QAQ,bhys,以后一定精读,好好找找里面的专业名词整理下来呜呜呜 这次策略跟以前差 ...

  7. Weakly Supervised Video Salient Object Detection

    Weakly Supervised Video Salient Object Detection 摘要 1. Introduction 2. Related Work 3. Our Method 3. ...

  8. Dynamic Selective Network for RGB-D Salient Object Detection

    Dynamic Selective Network for RGB-D Salient Object Detection 用于 RGB-D 显着目标检测的动态选择网络 IEEE TRANSACTION ...

  9. 文献阅读20期:Transformer Transforms Salient Object Detection and Camouflaged Object Detection

    [ 文献阅读 ] Transformer Transforms Salient Object Detection and Camouflaged Object Detection [1] 表现SOTA ...

最新文章

  1. Visual C++ 2011-5-20
  2. centos普通用户修改文件权限_用户管理(特殊权限、特殊属性、umask 默认权限 )
  3. JEEWX 使用ngrok将本地Web服务映射到外网
  4. 零火线都带电该怎么处理?
  5. AutoCAD.net: 用于ObjectARX 开发的Visual Studio 智能感知文件
  6. APS背景和入门基础
  7. 前端调用websocket的3种不同写法
  8. 修炼Python基础篇-字典(Dictionary)学习
  9. 数论基础——扩展欧几里得【详细】
  10. CVPR-2020 AAAI2020 CVPR-2019 NIPS-2019 ICCV-2019 IJCAI-2019 论文超级大合集下载,整理好累,拿走不谢
  11. 固态硬盘进阶知识:颗粒篇
  12. vivos9e防抖功能设置教程
  13. 第一个TensorFlow程序
  14. from flask._compat import text_type ModuleNotFoundError: No module named ‘flask._compa‘
  15. 网络渗透测试实验二(网络扫描与网络侦察)
  16. 背景设置透明度字体不透明
  17. 【2022年9月】237条微信内置浏览器UA
  18. 固高控制卡Home回零的使用
  19. CSDN账号修改绑定邮箱
  20. 数据库大批量SQL插入性能优化

热门文章

  1. linux find 更新时间,Linux查找find命令详解
  2. 入驻亚马逊你该知道这些!—跨海汇
  3. asp.net829家乡土特产销售网站#毕业设计
  4. 【新教程】VR博物馆项目实战教程
  5. 8月份各行业头部网站宕机汇总 - Blexb
  6. 35岁转行网络安全来得及吗?有发展空间吗?
  7. jq 移动端网页分享功能_原生javascript实现分享到朋友圈功能 支持ios和android
  8. 我的百度地图位置定位服务器,百度地图手机版如何进行我的位置定位?
  9. 微信官方服务器什么时候恢复,微信上午“罢工” 因服务器故障所致 预计后天恢复...
  10. 杨老师课堂之JavaScript定时器案例的红绿灯设计