-
**Describe the bug**
I tried to optimize BERT model with bert_ptq_cpu.json but it gave 7 output models.
It there any ways or change the config to get only one output model?
```
[2024-10-25 10:54:59,1…
-
Hello!
Thanks for the great re-implementation of GroundingDino. I am trying to understand you code.
In the [usage.md](https://github.com/open-mmlab/mmdetection/blob/main/configs/mm_grounding_din…
-
Cool package.
Wanted to try this with better and newer models
-
**Describe the Feature**
Add BERTScore as additional evaluation metric scorer for context-precision and context-recall.
**Why is the feature important for you?**
As a RAGAS user trying to eva…
-
Confirm valid implementation
References:
> Loss? Loss is:
> Total span extraction loss is the sum of a Cross-Entropy for the start and end positions.
https://huggingface.co/transformers/v4…
-
### Issue type
Support
### Have you reproduced the bug with TensorFlow Nightly?
Yes
### Source
source
### TensorFlow version
2.17
### Custom code
Yes
### OS platform and distribution
ubuntu…
-
Hi there. Thanks for the great library!
I have one issue regarding the usage of Bert-based models. I trained different models finetuning them on my custom dataset (roberta, luke, deberta, xlm-rober…
-
### System Info
```shell
transformers.js@main
```
### Who can help?
@xenova
It is mentioned in that [wav2vec2-bert](https://huggingface.co/docs/transformers.js/main/en/index#models:~:…
-
https://github.com/tensorflow/tensorflow/issues/77826
TensorFlow version: 2.17.0
Transformers version: 4.46.0.dev0
Keras version: 3.6.0
This problem can be solved by using TensorFlow version 2…
-
We are trying to use a LongFormer and Bert model for multi-label classification of different documents.
When we use the BERT model (BertForSequenceClassification) with max length 512 (batch size 8…