voxel51 / fiftyone

Refine high-quality datasets and visual AI models
https://fiftyone.ai
Apache License 2.0
8.85k stars 558 forks source link

[?] What would be the dataset_type for image captioning dataset? #2149

Open nahidalam opened 2 years ago

nahidalam commented 2 years ago

URL(s) with the issue:

Is there a way to specify image captioning datasets here with voxel51? For example, if I want to load the flickr8k dataset with fiftyone similar to below, what would be the dataset_type field?

import fiftyone as fo

name = "flickr8k"
dataset_dir = "path_to_dataset"

# Create the dataset
dataset = fo.Dataset.from_dir(
    dataset_dir=dataset_dir,
    #dataset_type=fo.types.WHAT_WOULD_GO_HERE,
    name=name,
)
session = fo.launch_app(dataset)
session.wait()

Description of proposal (what needs changing):

Please add documentation on what would be the dataset_type for image captioning dataset

Willingness to contribute

The FiftyOne Community encourages documentation contributions. Would you or another member of your organization be willing to contribute a fix for this documentation issue to the FiftyOne codebase?

brimoor commented 2 years ago

Hi! If the flickr8k dataset isn't formatted in any of the formats described on this page, then I would recommend just writing a simple Python loop to load the data, as demonstrated here: https://voxel51.com/docs/fiftyone/user_guide/dataset_creation/index.html#custom-formats