论文阅读 [TPAMI-2022] Grid Anchor Based Image Cropping: A New Benchmark and An Efficient Model

论文搜索(studyai.com)

搜索论文: Grid Anchor Based Image Cropping: A New Benchmark and An Efficient Model

搜索论文: http://www.studyai.com/search/whole-site/?q=Grid+Anchor+Based+Image+Cropping:+A+New+Benchmark+and+An+Efficient+Model

关键字(Keywords)

Agriculture; Measurement; Databases; Robustness; Benchmark testing; Training; Image cropping; photo cropping; image aesthetics; deep learning

机器视觉

数据扩增

摘要(Abstract)

Image cropping aims to improve the composition as well as aesthetic quality of an image by removing extraneous content from it.

图像裁剪旨在通过去除图像中的无关内容来改善图像的构图和审美质量。.

Most of the existing image cropping databases provide only one or several human-annotated bounding boxes as the groundtruths, which can hardly reflect the non-uniqueness and flexibility of image cropping in practice.

现有的图像裁剪数据库大多只提供一个或多个人类标注的边界框作为基本事实,这很难反映实际中图像裁剪的非唯一性和灵活性。.

The employed evaluation metrics such as intersection-over-union cannot reliably reflect the real performance of a cropping model, either.

所采用的评估指标(如联合上的交叉)也不能可靠地反映裁剪模型的真实性能。.

This work revisits the problem of image cropping, and presents a grid anchor based formulation by considering the special properties and requirements (e.g., local redundancy, content preservation, aspect ratio) of image cropping.

这项工作重新审视了图像裁剪问题,并通过考虑图像裁剪的特殊属性和要求(例如,局部冗余、内容保留、纵横比),提出了一种基于网格锚的公式。.

Our formulation reduces the searching space of candidate crops from millions to no more than ninety.

我们的公式将候选作物的搜索空间从数百万减少到不超过90。.

Consequently, a grid anchor based cropping benchmark is constructed, where all crops of each image are annotated and more reliable evaluation metrics are defined.

因此,构建了一个基于网格锚的裁剪基准,其中每个图像的所有裁剪都被注释,并定义了更可靠的评估指标。.

To meet the practical demands of robust performance and high efficiency, we also design an effective and lightweight cropping model.

为了满足高性能和高效率的实际需求,我们还设计了一种高效、轻量级的裁剪模型。.

By simultaneously considering the region of interest and region of discard, and leveraging multi-scale information, our model can robustly output visually pleasing crops for images of different scenes.

通过同时考虑感兴趣区域和丢弃区域,并利用多尺度信息,我们的模型可以为不同场景的图像稳健地输出视觉上令人愉悦的作物。.

With less than 2.5M parameters, our model runs at a speed of 200 FPS on one single GTX 1080Ti GPU and 12 FPS on one i7-6800K CPU.

在不到2.5M的参数下,我们的模型在一个GTX 1080Ti GPU上以200 FPS的速度运行,在一个i7-6800K CPU上以12 FPS的速度运行。.

The code is available at: https://github.com/HuiZeng/Grid-Anchor-based-Image-Cropping-Pytorch…

该代码可从以下网址获取:https://github.com/HuiZeng/Grid-Anchor-based-Image-Cropping-Pytorch…

作者(Authors)

[‘Hui Zeng’, ‘Lida Li’, ‘Zisheng Cao’, ‘Lei Zhang’]

论文阅读 [TPAMI-2022] Grid Anchor Based Image Cropping: A New Benchmark and An Efficient Model相关推荐

  1. 论文阅读:Visual Semantic Localization based on HD Map for AutonomousVehicles in Urban Scenarios

    题目:Visual Semantic Localization based on HD Map for Autonomous Vehicles in Urban Scenarios 中文:基于高清地图 ...

  2. 【论文阅读】A2S-Det: Efficiency Anchor Matching in Aerial Image Oriented Object Detection

    A 2 S-Det:航空图像定向目标检测中的高效锚点匹配 论文地址:https://www.mdpi.com/2072-4292/13/1/73/htm 二次阅读笔记也可以看看,下面这篇博客翻译会更准 ...

  3. 论文阅读:Semantic Aware Attention Based Deep Object Co-segmentation(ACCV2018)

    协同分割论文:Semantic Aware Attention Based Deep Object Co-segmentation(ACCV2018) 论文原文     code 目录 1.简介 2. ...

  4. 论文阅读:A Novel Graph based Trajectory Predictor with Pseudo Oracle

    A Novel Graph based Trajectory Predictor with Pseudo Oracle 摘要 1 引言 2 相关工作 3 PROPOSED METHOD IV. EXP ...

  5. 【论文阅读】Siamese Neural Network Based Few-Shot Learning for Anomaly Detection in Industrial Cyber-Physi

    文章目录 Abstract 1. Introduction 2. Related Work 2.1 Anomaly Detection techniques for CPS 2.2 Few-Shot ...

  6. 论文阅读 | Residual Conv-Deconv Grid Network for Semantic Segmentation

    GridNet发表在BMVC2017,用于语义分割,一篇很早期的文章 论文地址:[here] (文章没有给代码地址,但是里面的网络设计讲的很详细,可以自己复现出来,github上也有很多别人复现的代码 ...

  7. (论文阅读)2022年一些图像去雾方法的简单调研

    2022年一些图像去雾方法的简单调研 1. Self-augmented Unpaired Image Dehazing via Density and Depth Decomposition 基于密 ...

  8. 论文阅读笔记:Link Prediction Based on Graph Neural Networks

    文章目录 说明 Abstract 1 Introduction 2 Preliminaries Notations Latent features and explicit features Grap ...

  9. 【论文阅读 NeurIPS 2022】A Large Scale Search Dataset for Unbiased Learning to Rank

    文章目录 前言 Abs Intro 2.Preliminary 2.1.Ubiased Learning to Rank 2.2.Existion ULTR Datasets 3.Dataset De ...

最新文章

  1. 为什么我们需要一门新语言——Go语言
  2. 航“空”、航“天”大不同
  3. IDEA系列(十)--新建一个项目后之前的项目不显示
  4. VCS-bilibili教程篇1-Simulation Basics
  5. linux版本和目录结构
  6. C语言 · 数的读法
  7. OpenGL基础26:Assimp库
  8. 9.企业安全建设入门(基于开源软件打造企业网络安全) --- SOC系统建设
  9. 拓端tecdat|R语言对混合分布中的不可观测与可观测异质性因子分析
  10. HBase学习之路 (四)HBase的API操作
  11. Z-TEK CE usb转串口驱动(win32)
  12. openwrt添加SLM750模块驱动
  13. 通过建设银行外联平台进行转账/提现操作(Java)
  14. var模型的建模步骤python_Python语言之概述建模步骤
  15. ZYNQ学习之路5.扩展PL端串口
  16. 第三模块:面向对象网络编程基础 第1章 面向对象
  17. OpenCV开发笔记(四十四):红胖子8分钟带你深入了解霍夫圆变换(图文并茂+浅显易懂+程序源码)
  18. 前端知识合集【重中之重】,我只看这一篇!
  19. 使用python画出彩虹效果
  20. 主成分分析-简单人脸识别(二)

热门文章

  1. 系统性谈谈软件可靠性——第7讲:家电软件出问题的一些思考
  2. Python调用MMDetection实现AI抠图去背景
  3. 今日有会员反映,金山毒霸对x3报毒
  4. 告别Flashget
  5. 爆音在bilibili 的韵脚语录
  6. 多普达838,在关机状态下硬启的方法.
  7. c语言中的头文件stdlib.h的作用,C语言中你可能不熟悉的头文件(stdlib.h)
  8. 经济型EtherCAT运动控制器(八):轴参数与运动指令
  9. 17、前端开发:CSS知识总结——过渡(transition)
  10. 这绝对是你见过的最全深度学习服务器管理配置手册,学不会你打我