python：VOC格式数据集转换为YOLO数据集格式

57 阅读 0 评论 0 点赞

在计算机视觉领域，数据集的格式统筹决定了模型训练的效率和效果。常见的目标检测数据集格式有Pascal VOC格式和YOLO格式。虽说这两者都是为了标注图像中的物体，但它们的文件结构和标注方式却有所不同。本文将详细介绍如何将Pascal VOC格式的数据集转换为YOLO格式，并附上相应的Python代码示例。

Pascal VOC格式简介

Pascal VOC（视觉对象类别挑战）是一个广泛使用的图像数据集格式。其数据集中的标注文件通常为XML格式，包含了图像中每个物体的类别及其在图像中的位置信息（如边界框的坐标）。标注文件的结构大致如下所示：

<annotation>
    <folder>images</folder>
    <filename>image1.jpg</filename>
    <size>
        <width>400</width>
        <height>400</height>
        <depth>3</depth>
    </size>
    <object>
        <name>cat</name>
        <bndbox>
            <xmin>50</xmin>
            <ymin>50</ymin>
            <xmax>150</xmax>
            <ymax>150</ymax>
        </bndbox>
    </object>
</annotation>

YOLO格式简介

YOLO（You Only Look Once）格式则使用纯文本文件来标注每张图片，其每一行包含一个目标的类别和其边界框的归一化坐标（x_center, y_center, width, height）。对于每个图像，标注文件的格式如下：

0 0.32 0.32 0.25 0.25

这里的数字代表： - 0：物体的类别（从0开始计数） - 0.32：物体中心点的x坐标（相对于图像宽度） - 0.32：物体中心点的y坐标（相对于图像高度） - 0.25：物体框宽度（相对于图像宽度） - 0.25：物体框高度（相对于图像高度）

转换步骤

要将Pascal VOC格式转换为YOLO格式，主要的步骤有： 1. 读取Pascal VOC的XML标注文件。 2. 提取每个物体的类别和边界框信息。 3. 将边界框的坐标转换为YOLO格式的归一化坐标。 4. 将转换后的数据保存为YOLO格式的文本文件。

Python代码示例

以下是一个将Pascal VOC格式转换为YOLO格式的Python示例代码：

import os
import xml.etree.ElementTree as ET

def voc_to_yolo(voc_annotation_path, yolo_annotation_path, classes):
    """
    将Pascal VOC格式转换为YOLO格式
    :param voc_annotation_path: VOC格式XML文件目录
    :param yolo_annotation_path: YOLO格式TXT文件保存目录
    :param classes: 类别名称列表
    """
    if not os.path.exists(yolo_annotation_path):
        os.makedirs(yolo_annotation_path)

    for xml_file in os.listdir(voc_annotation_path):
        if xml_file.endswith('.xml'):
            tree = ET.parse(os.path.join(voc_annotation_path, xml_file))
            root = tree.getroot()

            filename = root.find('filename').text
            width = int(root.find('size/width').text)
            height = int(root.find('size/height').text)

            yolo_labels = []

            for obj in root.iter('object'):
                class_name = obj.find('name').text
                class_id = classes.index(class_name)  # 获取类别索引

                bndbox = obj.find('bndbox')
                xmin = int(bndbox.find('xmin').text)
                ymin = int(bndbox.find('ymin').text)
                xmax = int(bndbox.find('xmax').text)
                ymax = int(bndbox.find('ymax').text)

                # 计算YOLO格式
                x_center = (xmin + xmax) / 2 / width
                y_center = (ymin + ymax) / 2 / height
                obj_width = (xmax - xmin) / width
                obj_height = (ymax - ymin) / height

                yolo_labels.append(f"{class_id} {x_center} {y_center} {obj_width} {obj_height}")

            # 保存到YOLO格式文件
            with open(os.path.join(yolo_annotation_path, filename.replace('.jpg', '.txt')), 'w') as f:
                for label in yolo_labels:
                    f.write(label + '\n')

# 使用示例
voc_annotation_folder = 'path/to/voc/annotations'
yolo_annotation_folder = 'path/to/yolo/annotations'
classes = ['cat', 'dog', 'person']  # 根据实际类别调整
voc_to_yolo(voc_annotation_folder, yolo_annotation_folder, classes)

总结

本文介绍了Pascal VOC格式与YOLO格式的基本结构以及如何实现格式转换。通过Python脚本，用户可以快速高效地将数据集从一个格式转换为另一个格式，以满足不同深度学习框架的需求。希望这对你在目标检测任务中有所帮助！

点赞(0) 打赏

本文分类：后端
本文标签：目标跟踪人工智能 YOLO
浏览次数：57 次浏览
发布日期：2024-09-21 22:02:40
本文链接：http://makehui.com/houduan/339.html

python：VOC格式数据集转换为YOLO数据集格式

Pascal VOC格式简介

YOLO格式简介

转换步骤

Python代码示例

总结

微信扫一扫：分享

Python 3.12 环境搭建（Windows版）

Python 列表全方位解析：创建、操作、删除与遍历的全面指南

C++探索之旅：打造高效二叉搜索树的奥秘与实践

换脸插件升级导致SDWebUI无法启动cannot import name ‘Undefined‘ from ‘pydantic.fields‘

微信扫一扫：分享