-
Just ran into this midst of training. I assume maybe the epoch ended and it tried to save something to disk, which is only few MB in size however.
```
2023-05-01 13:03:54,309 INFO [trainer.py:757] E…
-
This is a "living issue". Editing is appreciated.
### Context:
- Most prominent benchmark for embedding models: https://huggingface.co/spaces/mteb/leaderboard
- We can choose to index the pdf dat…
-
IBM Plex Sans JP slightly bigger(or not hinted properly?) than other fonts.
![image](https://user-images.githubusercontent.com/17811025/126870103-67ee1237-7d13-480a-b5d2-d060e4a4a4dc.png)
All text…
-
-
Hey, I read your blogpost about `profanity-check`, so I've seen the code there.. but I'm wondering whether you have a file separately to that for training? And/or one for validation or "benchmarking"?…
ghost updated
3 years ago
-
See this reply: https://github.com/kaegi/MorphMan/pull/221#issuecomment-754379723
-
### Describe the feature or improvement you're requesting
Hi, this is not a suggestion, but rather a question.
I have been working on new ideas of evals lately, but none seem to be reviewed. …
-
# 行业角度看LLM
通向AGI之路:大型语言模型(LLM)技术精要
# 大模型有哪些
https://zhuanlan.zhihu.com/p/611403556
# 模型结构
为什么现在的LLM都是Decoder only的架构?
lowrank角度
# 如何训练
[Ladder Side-Tuning:预训练模型的“过墙梯”](https://kexue.f…
-
In regular situations, `word-break: normal` is expected to pick the right kind of word breaking for various scripts, keeping letters of a word together in languages that have word-based line breaking,…
-
on a colab GPU instance, I setup m4t runtime env and try a s2tt task. It produce bad quality transcription as follows compared whisper. I wonder if I have been doing something wrong on setup seamless …