Closed Shashankss1205 closed 3 months ago
Hi @Shashankss1205, thank you for taking the first step. We will take some time to review it and get back to you.
I'd like to suggest some things to keep in mind. The dataset we will look for analysis would consist of data collected from social network relevant to Indian context. So analysis of musical features and genre classification are not an intended use case.
I would also like to propose two changes to the notebook :
Ideally we'd like to mount our own google drive to your notebook and run it against our dataset. Could you document somewhere what is the expected structure/naming convention your code expects so its easy for us to run your code against our dataset.
While I see that you have implemented some clustering in your notebook, it would be nice if you added some code so we could preview/hear the audio files in a cluster within the notebook itself. This would help us evaluate how good the clustering is for a particular use case.
Thank You @dennyabrain for your valuable feedback, I will look into the changes you suggested and update my code.
Hello @dennyabrain, sorry I wasn't available because of my end-semester exams. From what I have seen in the comments of the issue, We have to classify the dataset based on the contextual references rather than the tone, music etc.. Therefore I think there's no use in updating my previous notebook which was intended for different use case.
I am creating a new pipeline which takes audio as input, transcribes it into a text, using the voice from the audio, and then to build a clustering algorithm for dividing it according to the different classes, that are intended to be made. If you think this can be the possible way out, I would like to start working on it.
Thanks and regards, Shashank Shekhar Singh
closing this issue because the DMP program has started.
This is an Issue raised related to the Issue #82 with a concrete idea based on 3rd approach given. I am Shashank Shekhar Singh, a sophomore at IIT BHU, India having interests in Machine Learning model development and deployment.
Approach Chosen(amongst the 3 mentioned) :
Link to my first approach: My Colab Notebook
Background: Reference
Future Trial Work on this approach:
Please provide me with any feedback related to this development so that I can delve deep into this. @dennyabrain @duggalsu