Will ' fn.readers.video' support reading visual content and audio content at the same time?

NVIDIA / DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Apache License 2.0

5.09k stars 615 forks source link

Is this a new feature, an improvement, or a change to existing functionality?

New Feature

How would you describe the priority of this feature request

Must have (e.g. DALI adoption is impossible due to lack in functionality).

Please provide a clear description of problem this feature solves

As a researcher in audio-visual cross-modal learning, I hope to support loading audio and video frames at the same time.

Feature Description

As a researcher in audio-visual cross-modal learning, I hope to support loading audio and video frames at the same time.

Describe your ideal solution

def nvidia.dali.fn.decoders.video(): pass return audio, images, label

Describe any alternatives you have considered

No response

Additional context

No response

Check for duplicates

[X] I have searched the open bugs/issues and have found no duplicates for this bug report

NVIDIA / DALI