Recode-Hive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
80 stars 117 forks source link

Improvement of Feature: Missing Data Preprocessing and Visualization in Sentiment_Analysis.ipynb #157

Closed Harshitmishra001 closed 3 weeks ago

Harshitmishra001 commented 3 weeks ago

Bug: Missing Data Preprocessing and Visualization

The current script lacks crucial data preprocessing (cleaning, stemming) and visualizations to understand model performance. This leads to potential inaccuracies and difficulty interpreting results.

Fix: This commit (or issue) adds:

Data Preprocessing: Removes duplicates, handles missing values, cleans text (including stemming).

Visualizations:` Includes a confusion matrix and rating distribution plot for better understanding.

This` improves model accuracy, reliability, and interpretability.

github-actions[bot] commented 3 weeks ago

Thank you for raising a issue, Hope you enjoing the open source. we try to reply or assign as soon possibe. Connect with mentor.

Harshitmishra001 commented 3 weeks ago

@sanjay-kv I am Done with writing code for it can you assign me this so that I can submit a PR

Shouryabhardwajj commented 3 weeks ago

I am interested in this issue. Please assign me this

Himanshi11045 commented 3 weeks ago

Please assign this issue to me.

sanjay-kv commented 3 weeks ago

this is been already assigned feel free to create new issues.

github-actions[bot] commented 3 weeks ago

Hello @Harshitmishra001! Your issue #157 has been closed. Thank you for your contribution!