INRIAPerson数据集转化为yolo训练格式并可视化

记录贴：将inria行人检测数据集转化为YOLO可以训练的txt格式

inria行人检测数据集解压后有train和test文件，将里面的标注信息提取出来

转化代码

# coding=UTF-8import os
import re
from PIL import Imagesets=['train']
#需要填写变量image_path、annotations_path、full_path
image_path = r"D:\BaiduNetdiskDownload\59_INRIA Person Dataset\shuju1/"                          # 图片存放路径，路径固定
annotations_path = r"D:\BaiduNetdiskDownload\59_INRIA Person Dataset\INRIAPerson\Test\annotations/" #文件夹目录                                          # INRIA标签存放路径
annotations= os.listdir(annotations_path) #得到文件夹下的所有文件名称# 获取文件夹下所有图片的图片名
def get_name(file_dir):list_file=[]for root, dirs, files in os.walk(file_dir):for file in files:# splitext()将路径拆分为文件名+扩展名，例如os.path.splitext(“E:/lena.jpg”)将得到”E:/lena“+".jpg"if os.path.splitext(file)[1] == '.jpg':list_file.append(os.path.join(root, file))return list_file# 在labels目录下创建每个图片的标签txt文档
def text_create(name,bnd):full_path = r"D:\BaiduNetdiskDownload\59_INRIA Person Dataset\labels1/%s.txt"%(name)size = get_size(name + '.png')convert_size = convert(size, bnd)file = open(full_path, 'a')file.write('0 ' + str(convert_size[0]) + ' ' + str(convert_size[1]) + ' ' + str(convert_size[2]) + ' ' + str(convert_size[3]) )file.write('\n')# 获取要查询的图片的w,h
def get_size(image_id):im = Image.open(r'D:\BaiduNetdiskDownload\59_INRIA Person Dataset\INRIAPerson\Test\pos/%s'%(image_id))       # 源图片存放路径size = im.sizew = size[0]h = size[1]return (w,h)# 将Tagphoto的x,y,w,h格式转换成yolo的X,Y,W,H
def convert(size, box):dw = 1./size[0]dh = 1./size[1]x = (box[0] + box[2])/2.0y = (box[1] + box[3])/2.0w = box[2] - box[0]h = box[3] - box[1]x = x*dww = w*dwy = y*dhh = h*dhreturn (x,y,w,h)# 将处理的图片路径放入一个ｔｘｔ文件夹中
for image_set in sets:if not os.path.exists(r'D:\BaiduNetdiskDownload\59_INRIA Person Dataset\labels1'):os.makedirs(r'D:\BaiduNetdiskDownload\59_INRIA Person Dataset\labels1')                     # 生成的yolo3标签存放路径，路径固定image_names = get_name(image_path)list_file = open('2007_%s.txt'%(image_set), 'w')for image_name in image_names:list_file.write('%s\n'%(image_name))list_file.close()s = []
for file in annotations: #遍历文件夹str_name = file.replace('.txt', '')if not os.path.isdir(file): #判断是否是文件夹，不是文件夹才打开with open(annotations_path+"/"+file) as f : #打开文件iter_f = iter(f); #创建迭代器for line in iter_f: #遍历文件，一行行遍历，读取文本str_XY = "(Xmax, Ymax)"if str_XY in line:strlist = line.split(str_XY)strlist1 = "".join(strlist[1:])    # 把list转为strstrlist1 = strlist1.replace(':', '')strlist1 = strlist1.replace('-', '')strlist1 = strlist1.replace('(', '')strlist1 = strlist1.replace(')', '')strlist1 = strlist1.replace(',', '')b = strlist1.split()bnd = (float(b[0]) ,float(b[1]) ,float(b[2]) ,float(b[3]))text_create(str_name, bnd)else:continue

可视化一下

判断转化是否正确，写了一个可视化代码


import os
import cv2
img_path = r'D:\BaiduNetdiskDownload\59_INRIA Person Dataset\INRIAPerson\Train\pos/'
label_path = r'D:\BaiduNetdiskDownload\59_INRIA Person Dataset\labels/'
f = os.listdir(img_path)
def paint(label_file, img_file):#读取照片img = cv2.imread(img_file)img_h, img_w, _ = img.shapewith open(label_file, 'r') as f:obj_lines = [l.strip() for l in f.readlines()]for obj_line in obj_lines:cls, cx, cy, nw, nh = [float(item) for item in obj_line.split(' ')]color = (0, 0, 255) if cls == 0.0 else (0, 255, 0)x_min = int((cx - (nw / 2.0)) * img_w)y_min = int((cy - (nh / 2.0)) * img_h)x_max = int((cx + (nw / 2.0)) * img_w)y_max = int((cy + (nh / 2.0)) * img_h)cv2.rectangle(img, (x_min, y_min), (x_max, y_max), color, 2)cv2.imshow('Ima', img)cv2.waitKey(0)
for i in f:label_path_name = label_path + i.replace('png','txt')img_path_name = img_path + iprint(label_path_name)print(img_path_name)paint(label_path_name,img_path_name)

发现这个数据集的多人场景下只标注了几个人

INRIAPerson数据集转化为yolo训练格式并可视化相关推荐

用Python对我们自己标注的数据集转化为YOLO训练需要的txt文件
用Python对我们自己标注的数据集转化为YOLO训练需要的txt文件一. 数据分类在项目的根目录下新建一个maketxt.py文件. 该脚本会在straw/ImageSets文件夹下生成:tra ...
BDD 100K数据集label转换为yolo训练格式
提示:文章写完后,目录可以自动生成,如何生成可参考右边的帮助文档 BDD 100K数据集label转换为yolo训练格式前言数据集介绍: 代码如下: 补充说明: 总结前言因为最近要做车辆,行人 ...
【目标检测数据集汇总】YOLO txt格式各种数据集
提示:文章写完后,目录可以自动生成,如何生成可参考右边的帮助文档 [目标检测数据集汇总]目标检测YOLO txt格式数据集~各种数据集前言相关连接: 一.安全帽数据集(10755张,nc2) 二. ...
inria数据集下载及转换成yolo训练格式
部分转载自:https://zhuanlan.zhihu.com/p/31836357 6.INRIA Person Dataset(INRIA行人数据库) 该数据库是目前使用最多的静态行人检测数据库 ...
基于paddlex图像分类模型训练（一）：图像分类数据集切分：文件夹转化为imagenet训练格式
相关博文基于paddlex图像分类模型训练(二):训练自己的分类模型.熟悉官方demo 背景在使用paddlex GUI训练图像分类时,内部自动对导入的分类文件夹进行细分,本文主要介绍其图像分类数 ...
【深度学习】【Python】【Widerface数据集】转VOC格式，VOC 转YOLOv5格式，YOLOv5训练WiderFace数据集，检查yolo labels对不对
文章目录 Widerface数据集转VOC格式 VOC 转YOLO格式数据集的imageslisttxt YOLOv5训练检查yolo labels对不对并行训练 Widerface数据集转VO ...
YOLO训练自己的数据集的一些心得
YOLO训练自己的数据集 YOLO-darknet训练自己的数据 [Darknet][yolo v2]训练自己数据集的一些心得----VOC格式 YOLO模型训练可视化训练过程中的中间参数项目开源代 ...
把LabelImg标注的YOLO格式标签转化为VOC格式标签和把VOC格式标签转化为YOLO格式标签
把LabelImg标注的YOLO格式标签转化为VOC格式标签和把VOC格式标签转化为YOLO格式标签文章目录: 1 用LabelImgvoc和yolo标注标签格式说明 1.1 LabelImg标 ...
用yolo训练自己的数据集（以车牌为例）
我看了网上很多yolo教程,可能是因为电脑环境和配置的不一样,所以我并没有完全通过网上教程成功训练自己的数据集,接下来我将我自己完全亲自测试并且最后成功训练数据集的教程记录如下: 训练自己的数据集有如 ...

INRIAPerson数据集转化为yolo训练格式并可视化

记录贴：将inria行人检测数据集转化为YOLO可以训练的txt格式

转化代码

可视化一下

INRIAPerson数据集转化为yolo训练格式并可视化相关推荐

最新文章

热门文章