-
I am hoping to classify sentence pairs in two different languages (English A and Chinese B). Do I just use the multi-lingual BERT model?
Another possible classification task I am considering is cla…
-
#### Issue Description
The accuracy of ParagraphVectors is low on 20news-group dataset.
Dataset: http://qwone.com/~jason/20Newsgroups/20news-bydate.tar.gz
#### Version Information
0.8.0, using…
tvanh updated
6 years ago
-
Thank you very much for your excellent work! We had already run the model by using the demo and found out that the ability of Theia model on feature extraction visualization was not as good as individ…
-
## Description
I am engaged in research with Kosmos-2, aiming to replicate the Zero-Shot Image Classification with Descriptions task as detailed in Section 4.7 of the Kosmos-1 paper (figure). Unfortu…
-
While using cross-encoder (CrossEncoder('distilroberta-base',num_labels=1)), I'm getting F1 score of 1 on the validation set with labels 0 and 1 after fine tuning. Although fine tuning is not happenin…
-
Thank you for sharing the source code of VLMO recently.
We took a stab and pretrained a large (1024 hidden dim) multiway transformer with mim loss, mlm loss, and contrastive loss.
BEIT3 pret…
-
```
Original Issue -
https://code.google.com/p/alageospatialportal/issues/detail?id=304
Project Member Reported by leebel...@gmail.com, Nov 22, 2010
Code and documentation received from Glenn Man…
-
- [ ] [argilla/magpie-ultra-v0.1 · Datasets at Hugging Face](https://huggingface.co/datasets/argilla/magpie-ultra-v0.1)
# Dataset Card for magpie-ultra-v0.1
## Dataset Summary
`magpie-ultra` is a s…
-
In a second step we could extract the "scientific" medical documents from our positive set.
-
Hi,
I would like to create my own domain-specific "stsb" datset to further improve performance.
I have a 500 GB domain specific text corpus and want to use / label some of the sentence pairs.
Do …