-
### ❓ The question
Hi all,
Thanks so much for this amazing repo. I'm training a 1B model from scratch and am just wondering what it the final loss converged to and what the final perplexity is. Than…
-
### 🐛 Describe the bug
I tried resuming training on a previous unsharded checkpoint from step 1k and the training resumed with no initial issue however when it tried to save the sharded checkpoint …
-
### Potential sources
- Research gate or other online bodies of published research (tailored search based profile)
- Reddit/Twitter (searching through communities/posts)
- QuiverQuant (look at ins…
-
* anthropic
* openai
* claude
* perplexity
* gemini
Prepare a suite of questions, tune prompts, compare results
Allow us to pick the one that performs the best
Maybe consider ethical issu…
-
Alevolar macrophage would be a good test case for cleaning up cell surface marker assertions.
The basic term can, I think, be defined very simply - these are macrophages that reside on the luminal …
-
When I run predict.py, an error message 'No module named 'perplexity'' appears. how can I download this library?
-
**Is your feature request related to a problem? Please describe.**
`_gen_json_object` dictates key order.
**Describe the solution you'd like**
Instead, allow the LLM to dictate it, reducing perpl…
wjn0 updated
3 months ago
-
Measuring perplexity takes forever, is that working?
-
Does the TSNE algorithm implemented in cmml has a maximal perplexity? I used the following code to create a TSNE object, the perplexity parameter seems to have no impact on the result.
from cuml.m…
-
I need to get ppl per sentence for millions of lines. Splitting them into files each containing one sentence would be time consuming. Is it possible to achieve this by modifying dataloader? For exampl…