-
The *2Vec models have an underdocumented implementation limit in their Cython paths: any single text passed to training that's more than 10000 tokens is silently truncated to 10000 tokens, discarding …
-
I have read all README.md files,and all papers.But i can not find any instructions or tutorials to build an easy application(even i know for some general steps:Questions analysis,Answer Producers,..).…
-
from collections
* https://zenodo.org/communities/empirical-software-engineering/
* https://zenodo.org/communities/msr/
-
```
tansell@tansell:~/tmp/verible$ bazel --bazelrc=/opt/kythe-v0.0.38/extractors.bazelrc build --override_repository kythe_release=/opt/kythe-v0.0.38 --define=kythe_corpus=github.com/google/verible -…
-
Your treatment of entropy per letter in the attributes (and as explained in your blogs) is interesting, but I think that to ensure a passphrase will not be too short, one must consider the number of w…
-
- This issue focuses on the technical courses we take about LLM, we'll put the paper part in
https://github.com/xp1632/DFKI_working_log/issues/70
---
1. **ChainForge** https://chainforge.ai/ …
-
Ideas for analysis please.
Current plans:
Bar graph of albums:
--the 4 types of phrases
--length (number of lines)
Bar graph comparing artists:
-variety of phrases (different profanities)
…
-
First, write down three intuitions you have about broad content patterns you will discover in your data. Plan an asterisk next to the one you expect most firmly, and a plus next to the one that, if tr…
-
There are several things that vet can do to support native fuzzing. This issue tracks all of the potential checks that could be added.
Vet should fail if...
- [ ] the inputs to any `f.Add` ca…
-
We should display things we look at often in W&B. Final merged corpus size after deduplication is something I look at periodically to understand how aggressive the cleaning is overall. We can also dis…