recodehive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
105 stars 137 forks source link

clustering movie data from imdb issue #153

Closed Roverbk closed 5 months ago

Roverbk commented 5 months ago

1.We scrape IMDb data including movie titles, ratings, and summaries. 2.We use TF-IDF vectorization to convert the movie summaries into numerical feature vectors. 3.We apply K-means clustering to group the movies into clusters based on their summaries. 4.We use PCA for dimensionality reduction to visualize the clusters in a 2D space.

Roverbk commented 5 months ago

https://github.com/Recode-Hive/Scrape-ML/issues/64

github-actions[bot] commented 5 months ago

Thank you for raising a issue, Hope you enjoing the open source. we try to reply or assign as soon possibe. Connect with mentor.

Roverbk commented 5 months ago

@sanjay-kv

Roverbk commented 5 months ago

@sanjay-kv please merge the PR

github-actions[bot] commented 5 months ago

Hello @Roverbk! Your issue #153 has been closed. Thank you for your contribution!