-
Posting for @awadalaa
We are blocked on experimenting with a new Tensorflow model in production because it fails to inference with this error:
**tensorflow.python.framework.errors_impl.FailedPr…
-
I'm very new to the concept of neural networks so correct me if I'm getting something wrong but don't LSTMs have short term memory so you can call inputs in a sequence and get different results depend…
-
## Motivation
Whenever a script wants to load model weights, there are different variations of it that could be loaded depending on which script we are referring to:
1. A lit model weights file …
-
I'm evaluating whether to implement an online account feature (self-hostable) where you can upload your lists and thus edit them on your PC in your browser. This eliminates the import/export busywork …
-
Hi, it's me again. The training is working great but when it comes to saving the checkpoint, I got this bug. Any ideas?
```
[rank0]: File "/workspace/train.py", line 230, in
[rank0]: train…
-
问题描述:
使用peft微调llama3 8b,训练代码基本是按照样例稍作修改,在训练的时候 前10个steps,loss稍高,后面输出的loss,一直都是0.0了
微调代码:
```python
import torch
from datasets import Dataset
import pandas as pd
from transformers impo…
-
**Describe the bug**
I\m training a seq2seq transformer model in librispeech_100 egs2 dir.
I ahve set ctc_weight =0.0 to disable ctc in model training and I expect that to hold for decoding too.
…
-
I ran light lda over a corpus with 1000 doc and approx 22000 vocabulary size , i used text2libsvm api present in lightlda/example to convert UCI data to libsvm and get dictionary too.
I am getting th…
-
As a user of vocascan, I'd like to have the feature of importing and exporting vocabularies as csv files.
I can imagine that many starts to learn by creating tables and have it already there and wo…
-
Implement minimum risk trainer as described in http://arxiv.org/abs/1512.02433.
- sampling is done by computing top-k translations (beam search is already done)