abhisheks008 / ML-Crate

ML-Crate stands as the ultimate hub for a multitude of exciting ML projects, serving as the go-to resource haven for passionate and dedicated ML enthusiasts!πŸŒŸπŸ’« Devfolio URL, https://devfolio.co/projects/mlcrate-98f9
https://quine.sh/repo/abhisheks008-ML-Crate-409463050
MIT License
205 stars 215 forks source link

German Italian Speech Analysis #297

Open abhisheks008 opened 2 years ago

abhisheks008 commented 2 years ago

ML-Crate Repository (Proposing new issue)

:red_circle: Project Title : German Italian Speech Analysis :red_circle: Aim : Analyse and visualize different aspects of the German and Italian language. :red_circle: Dataset : https://www.kaggle.com/datasets/mitishaagarwal/german-italian-speech-analysis :red_circle: Approach : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model.


πŸ“ Follow the Guidelines to Contribute in the Project :


:red_circle::yellow_circle: Points to Note :


:white_check_mark: To be Mentioned while taking the issue :


Happy Contributing πŸš€

All the best. Enjoy your open source journey ahead. 😎

guneshm11 commented 2 years ago

Hello I would like to work on this project. Please assign it to me Full name : Gunesh Mahajan GitHub Profile Link : https://github.com/guneshm11 Participant ID (If not, then put NA) : NA Approach for this Project : Try to use 3-4 algorithms to implement the models and compare all the algorithms to find out the best fitted algorithm for the model by checking the accuracy scores. Also do not forget to do a exploratory data analysis before creating any model. What is your participant role? - Contributor

abhisheks008 commented 2 years ago

Work on one issue at a time.

nkhanna94 commented 6 months ago

Full name: Niharika Khanna GitHub Profile Link: nkhanna94 Participant ID (If not, then put NA): NA

Approach for this Project:

Exploratory Data Analysis (EDA): Overview: Understand dataset structure and language distribution. Feature Extraction: Extract relevant speech features. Visualization: Explore differences in speech characteristics between German and Italian.

Preprocessing: Cleaning: Address missing values, outliers, and noise. Normalization: Normalize extracted features for consistency.

Model Implementation: Algorithm Selection: Employ 3-4 algorithms like CNNs, RNNs, SVMs, and GBMs. Training: Train models using extracted features and language labels.

Model Evaluation: Metrics: Assess model performance using accuracy and cross-validation. Comparison: Identify best algorithm for language classification. Interpretation: Analyze key features contributing to classification.

Participant Role: SSOC S3

abhisheks008 commented 6 months ago

Implement 5-6 models for this project.

Assigned @nkhanna94