Recode-Hive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
80 stars 117 forks source link

Make a full test on dataset based on BERT #188

Open Soubeer opened 2 weeks ago

Soubeer commented 2 weeks ago

I would like to propose a new method of analyzing data over the IMDB dataset. BERT (Bidirectional Encoder Representations from Transformers) for analyzing the IMDb movie review dataset. BERT has revolutionized natural language processing (NLP) with its deep bidirectional understanding of text, making it ideal for tasks such as sentiment analysis and review classification.

The task should cover an introduction to BERT, data preprocessing, model training and fine-tuning, evaluation and results, and usage instructions. This will provide a clear, hands-on guide for users to understand and replicate our BERT-based sentiment analysis and classification on the IMDb dataset.

This will have a baseline and BERT workaround difference scale, to showcase the difference between the two approaches.

I would like to take this task under GSSOC'24.

Thank You

sanjay-kv commented 2 weeks ago

I cant assign any label atm , as I need to see the code . you can start working on it.