ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
https://docs.ultralytics.com
GNU Affero General Public License v3.0
50.89k stars 16.38k forks source link

can we split dataset using python if there are some background images inside the dataset ? #9437

Closed yjackboo closed 2 years ago

yjackboo commented 2 years ago

Search before asking

Question

Hi Thank you so much for this great repository.

I am a very beginner to coding and yolo algorithm. I know we can add some background images for our dataset in yolov5 without adding any annotation text file. I know we can divide our dataset manually for train and validation. but I have a problem how to divide a dataset using python script if there are some images they don't have any annotation txt files. is it ok to create empty text files for those images and split the dataset ?. or how can we do that ?

Thank you so much for any help.

Additional

No response

github-actions[bot] commented 2 years ago

👋 Hello @yjackboo, thank you for your interest in YOLOv5 🚀! Please visit our ⭐️ Tutorials to get started, where you can find quickstart guides for simple tasks like Custom Data Training all the way to advanced concepts like Hyperparameter Evolution.

If this is a 🐛 Bug Report, please provide screenshots and minimum viable code to reproduce your issue, otherwise we can not help you.

If this is a custom training ❓ Question, please provide as much information as possible, including dataset images, training logs, screenshots, and a public link to online W&B logging if available.

For business inquiries or professional support requests please visit https://ultralytics.com or email support@ultralytics.com.

Requirements

Python>=3.7.0 with all requirements.txt installed including PyTorch>=1.7. To get started:

git clone https://github.com/ultralytics/yolov5  # clone
cd yolov5
pip install -r requirements.txt  # install

Environments

YOLOv5 may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):

Status

CI CPU testing

If this badge is green, all YOLOv5 GitHub Actions Continuous Integration (CI) tests are currently passing. CI tests verify correct operation of YOLOv5 training (train.py), validation (val.py), inference (detect.py) and export (export.py) on macOS, Windows, and Ubuntu every 24 hours and on every commit.

glenn-jocher commented 2 years ago

@yjackboo background images do not need any txt label files. You can also autosplit datasets:

https://github.com/ultralytics/yolov5/blob/c7a2d6bcf4f7e88db53f3d09a8484391dac7bc89/utils/dataloaders.py#L878-L902

yjackboo commented 2 years ago

@glenn-jocher thank you so much sir. I will try above

glenn-jocher commented 1 year ago

@yjackboo you're welcome! If you have any more questions, feel free to ask. Good luck with your project!