instadeepai / tunbert

TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA), Tunisian Dialect Identification (TDI) and Reading Comprehension Question-Answering (RCQA)
MIT License
107 stars 37 forks source link

The prediction function skips a row when outputting "test_results.tsv" results #1

Closed Alaa-1 closed 3 years ago

Alaa-1 commented 3 years ago

After training the module using finetuning_sa_tdid.sh , then running predictions on the test.tsv, the output of the prediction which gets written in test_results.tsv I noticed that the test file and the test_results file don't have same number of rows.

for example the tunbert/dev-data/sentiment_analysis_tsac/test.tsv file that comes with your repo has 5 sentences/rows. When I run the predictions the tunbert/finetuning_tsac/test_results.tsv file outputs only 4 sentences/rows.

my_issue

Update: After spending more time tinkering with the model, It truns out that I didn't spot a minor issue when I was dealing with headers. Sorry for the inconvenience and the false issue. Great work, I really enjoyed it ^^

hassibatej commented 3 years ago

Amazing !