Recode-Hive / Scrape-ML

For new data generation Semi-supervised-sequence-learning-Project we have writtern a python script to fetch📊, data from the 💻, imdb website 🌐 and converted into txt files.
https://scrape-ml.streamlit.app/
MIT License
80 stars 117 forks source link

Added BERT v/s Raw Performance #191

Closed Soubeer closed 1 day ago

Soubeer commented 2 weeks ago

Related Issue

This pull request tackles the need for a detailed comparison of different text classification methods on the IMDB dataset. By comparing raw text processing, pre-trained BERT, and fine-tuned BERT models, I aim to highlight their performance differences. This analysis will help us understand which approach works best for sentiment analysis and guide future enhancements. It provides valuable insights into the strengths and weaknesses of each method, helping to make informed decisions for upcoming projects in the repository.

About Issue #188

Description

In this pull request, I've conducted a comparative study of text classification methods on the IMDB dataset: raw text processing, pre-trained BERT, and fine-tuned BERT models. BERT, known for its deep bidirectional representations, is evaluated both in its pre-trained state and after fine-tuning on the IMDB dataset. Fine-tuning involves further training BERT on our specific dataset, enhancing its performance by capturing domain-specific nuances. The analysis includes training, validation, and performance metrics for each method, highlighting the strengths and weaknesses of each approach. This comparison aims to guide future development and optimization efforts in our repository, ensuring the use of the most effective text classification methods.

Type of PR

Screenshots / videos (if applicable)

[Attach any relevant screenshots or videos demonstrating the changes]

(Not really required,already present in notebook file)

Checklist:

Additional context:

I have not added a READme and have made the whole thing inside a new folder named raw_bert_finetune..should I change it or add something?

github-actions[bot] commented 1 day ago

This PR has been automatically closed due to inactivity from the owner for 15 days.