AlKun25 / WeHear

MIT License
0 stars 0 forks source link

Create a Data Loader for Lightning Pipeline #1

Open AlKun25 opened 10 months ago

AlKun25 commented 10 months ago

Problem Statement:

As part of our PyTorch-based pipeline, we require a data loader that can efficiently load and preprocess our dataset. Currently, we lack a dedicated data loader for this purpose.

Key Features and Requirements:

Pass condition:

Try to run the baseline in end-to-end fashion on the toy split of the dataset

Additional Context:

This should be implemented as a modular code in a separate file that can be later imported wherever required. Providing information about the data dimensions as output would be great. Understanding whether the data loader has any difference for regression and match-mismatch task.

References:

You can have a look at the implementation by competition organiser in their repository

AlKun25 commented 9 months ago

I will be focussing on the Task 2 (regression) implementation first