Open Conchylicultor opened 4 years ago
@Conchylicultor : working on translate type datasets. Edit : I have created the changes but not yet sent a PR, could you please review in Colab https://colab.research.google.com/drive/1LDXsE2tAxbn8qnhqpxUnXVpzltpSiYts the show_examples for translate type datasets. I am just printing the texts as they can't really be visualized in the figure like images. @Conchylicultor : Please review PR https://github.com/tensorflow/datasets/pull/1547
Hello I would like to contribute on this issue. This is my first time so could you please guide me. So I only need to make a visualisation of these formats by implementing some updates in visualization_test.py.
@Conchylicultor : I am working on coco object detection dataset but this dataset was to big to load in my colab so can i use another dataset apart from tensorflow datasets because tensorflow object detection datasets are too big.
@vinayvr11 Why don't you have a look at our catalog which display the download size ? For instance voc/2007 is about 1GB, https://www.tensorflow.org/datasets/catalog/voc
Hi, I was building code for support of object detection datasets but I got stuck at the values of the Bbox feature. They all are in between 0 and 1 and I checked in the source that it is meant to be that. I am just confused on what scale should I use while plotting the Bounding Boxes?
I figured it out. Here is the notebook link. @Conchylicultor Please review the changes.
@VaranRohila nice, this looks great. Could you send a PR so I can see and review the code ?
Edit: Oups, I missed the one you sent. Thank you!
I just did! Link
Just saw this. Thank you!
Hi, Is anyone currently working on audio data visualization - eg. groove If no, I would be grateful to get any guidance on what features should the visualization entail ( should the output be a few random samples of audio or a visual representation of the audio dataset diversity ( classes, frequency, range of audio ), etc ) edit - both ljspeech and librispeech seem to be inaccessible owing to still being in development phase
@harshitadd Thank you for looking into this! I don't think anyone is working on audio yet.
For the output, I think both image
and audio
representation could be helpful, but you can start with anything. IPython.display.Audio
might be helpful to display audio.
Also have a look at my comment in: https://github.com/tensorflow/datasets/pull/1639#discussion_r391848285 to try to factorise this new feature in independent classes.
both ljspeech and librispeech seem to be inaccessible owing to still being in development phase
What do you mean "development phase", the datasets statistics are available on our website which seems to indicates that the data were generated successfully https://www.tensorflow.org/datasets/catalog/librispeech. Are you using the last TFDS version ? If there is an issue with those datasets, please report a bug.
@harshitadd Thank you for looking into this! I don't think anyone is working on audio yet.
For the output, I think both
image
andaudio
representation could be helpful, but you can start with anything.IPython.display.Audio
might be helpful to display audio. Also have a look at my comment in: #1639 (comment) to try to factorise this new feature in independent classes.both ljspeech and librispeech seem to be inaccessible owing to still being in development phase
What do you mean "development phase", the datasets statistics are available on our website which seems to indicates that the data were generated successfully https://www.tensorflow.org/datasets/catalog/librispeech. Are you using the last TFDS version ? If there is an issue with those datasets, please report a bug.
@Conchylicultor Thank you for your prompt reply - With respect to the latter comment - I am running tfds version 2.0.0 and using librispeech as an argument to tfds.load() gives the error - "Dataset librispeech is under active development and is not available yet". 'ljspeech' simply returns - dataset not found. Kindly advise, If there are no other alternatives that I may use to load them: I will report it as a bug.
As for the former comment - I shall try and include a colab test file link for code review as soon as possible.
Thanks!
Did you try with tfds 2.1.0 or tfds-nightly ?
Did you try with tfds 2.1.0 or tfds-nightly ?
Thanks a lot, Both ljspeech and librispeech work with tfds-nightly ( not tfds 2.1.0 though ).
Here is the first draft that encapsulates the visual and audio displays of passed dataset. Kindly review the generated output format - If this is what is required - I shall further optimize the code and submit it for review.
A few questions/notes regarding this output:
Thanks!
@harshitadd, thank you for the quick implementation. This looks nice. Can you send a PR so I can comment on the code ?
tfds.show_examples()
. You can assume some fixed sample rate if not known.@Conchylicultor I am working for showing examples for video datasets
@harshitadd, thank you for the quick implementation. This looks nice. Can you send a PR so I can comment on the code ?
Thanks a lot for the inputs! I have made a PR for the same - Linkhttps://github.com/tensorflow/datasets/pull/1683 Truncating the audio to a certain length definitely makes the file size quite manageable, so I have added a basic implementation for the same.
Edit - I couldn't fix the LJspeech bug - Would be very grateful if you could direct where the code is flawed in that respect too.
Task: Currently tfds.show_examples() only works for supervised images datasets. It would be good to extend the heuristic to more dataset types, like:
div2k
,cityscapes
)Instructions:
show_examples
visualization.py to support new dataset types of your choice.visualization_test.py
with dummy data to test the visualization of the new dataset.As multiple data-type exists, multiple people can work on this issue at the same time