-
@NielsRogge Dear Author, really appreciate that you have created great notebooks to guide us on how to perform relation extraction task using XFUN dataset by LayoutXLM model.
With your notebook, I…
-
We were running into the following error while trying to run the fine-tuning code for Indic-English
``` size mismatch for encoder.embed_tokens.weight: copying a param with shape torch.Size([806, 25…
-
Hi,
I want to reproduce your result on IWSLT14 De-En, but I can't get 35.78. My best result is 34.25. Here I want to ask some detailed setup:
1. Do you use share embedding? I don't use. If yes, ho…
-
**RTKLIB version: 2.4.2 p13**
**Using Language: C**
Hello, I've identified a potential issue in the RTCM3 decoding functions that may lead to misunderstandings and unexpected behavior. The pro…
-
I am trying to train Sinhala Merlin voice with the data that we have open sourced [0]. It consists of 2064 prompts that were recorded (with multiple speakers with similar acoustics), phonology, transc…
-
The following query keeps returning NaN
```
SELECT extract(epoch from now() - pg_last_xact_replay_timestamp())::int AS last_replay_time
```
I'm able to run this query as a non-superuser in the dat…
-
Recently, I have experimented DPO training for Vietnamese. I start with a strong SFT model, which is [vinai/PhoGPT-4B-Chat](https://huggingface.co/vinai/PhoGPT-4B-Chat), and follow the method describe…
-
I thought I had read something about this before, but couldn't find an issue about it, so apologies if this is a duplicate.
When defining a table with a foreign key, it sometimes is desired to rest…
-
@tnylea
- I just pushed the 'js' branch. it's not ready for PR yet but I want you to see this database setup to start thinking.
- You were right – JSON was the wrong direction.
- we'll be ab…
-
I want to fine-tune BERT-base-uncased for the language model, according to my custom dataset. It consists of around 80M tweets. I'm a bit puzzled about how many training steps I should set so it is tr…