-
Hi, I'm curious about the training data of xlm-r models finetuned on conll ner datasets (e.g. xlm-roberta-large-finetuned-conll03-german, xlm-roberta-large-finetuned-conll03-english), are the models…
-
### Describe the issue
First of all, thank you for your great contributions.
I have a similar question to the [issue 146](https://github.com/microsoft/LLMLingua/issues/146), I cannot reproduce the…
-
**Describe the bug**
Cannot export the model.
**To Reproduce**
```
import keras
from keras_nlp.models import XLMRobertaPreprocessor, XLMRobertaBackbone
import tensorflow as tf
preprocessor …
-
### Feature request
I encountered a KeyError while loading the phi3-v vision model into Optimum Huggingface. The error message states:
```
KeyError: 'phi3-v model type is not supported yet in Nor…
-
### Question
Hi, I have data in BIO format (not BIOES). I am training a sequence tagger model with transformer embedding but consistently get 0 f1-score for every epoch for XLM-ROBERTA-LARGE, but for…
-
## ❓ Questions and Help
### Before asking:
1. Search for similar [issues](https://github.com/Unbabel/COMET/issues).
3. Search the [docs](https://unbabel.github.io/COMET/html/index.html).
…
-
Hello,
First, thanks for these great models! I was wondering if I could use these models for zero-shots classification, especially for emotion detection (Ekman). While doing so, I encountered this …
-
When I tried to run the training, it raises an error for missing `model_type` key in config.json.
The training script I used is the one in the `run-ccp.sh`:
``` bash
outdir=runs/CCP
model_cfg=da…
-
Interesting paper. Regarding pretrained model I'm wondering - are they Roberta based or R-XLM? Did you evaluate performance wrt mDeberta as base model?
And finally - how would one use such/this mod…
zidsi updated
6 months ago
-
https://lmsys.org/blog/2023-06-29-longchat/
https://arxiv.org/abs/2305.07185
https://www.reddit.com/r/LocalLLaMA/comments/14fgjqj/a_simple_way_to_extending_context_to_8k/
https://github.com/epfml…