gesistsa / grafzahl

🧛 fine-tuning Transformers for text data from within R
https://gesistsa.github.io/grafzahl/
GNU General Public License v3.0
41 stars 2 forks source link

Multinomial classification does not work #36

Open chainsawriot opened 5 months ago

chainsawriot commented 5 months ago

simpletransformers 0.7

require(grafzahl)
require(quanteda)
download.file("https://huggingface.co/datasets/israel/Amharic-News-Text-classification-Dataset/resolve/main/train.csv", destfile = "am_train.csv")
input <- read.csv("am_train.csv")

input_corpus <- corpus(input, text_field = "article") %>% corpus_subset(category != "")
model <- model <- grafzahl(x = input_corpus, y = "category", model_name = "castorini/afriberta_base")

Error

Target is multiclass but average='binary'. Please choose another average setting, one of [None, 'micro', 'macro', 'weighted'].Traceback:

1. grafzahl(x = input_corpus, y = "category", model_name = "castorini/afriberta_base")

2. grafzahl.corpus(x = input_corpus, y = "category", model_name = "castorini/afriberta_base")

3. py_train(data = input_data, num_labels = num_labels, output_dir = output_dir, 
 .     best_model_dir = best_model_dir, cache_dir = cache_dir, model_type = model_type, 
 .     model_name = model_name, num_train_epochs = num_train_epochs, 
 .     train_size = train_size, manual_seed = manual_seed, regression = regression, 
 .     verbose = verbose)

4. py_call_impl(callable, call_args$unnamed, call_args$named)
chainsawriot commented 5 months ago

It worked previously.

chainsawriot commented 5 months ago

ThilinaRajapakse/simpletransformers#1566