Open Rajaro72 opened 1 month ago
Give me your dataset and code you try to run. We'll solve it
Thank you! I tried two attempts, one was to change the VOCannotationsclass to read the CSV files and another attempt which converted the dataset into xml files for each image. I will attach the code files to this comment and here is the link to the dataset:https://www.kaggle.com/datasets/robikscube/textocr-text-extraction-from-images-dataset?select=train_val_images XML_attempt.zip CSV_attempt.zip
I have changed my dataset into xml files for each image, however I have multiple
Has anyone found a way to train the model for multiple bounding boxes?