-
Today I was going to train a gpt3_124m model, when I noticed that the max_seq_len is hardcoded [here](https://github.com/karpathy/llm.c/blob/d396cd18b71367f79cbaab8f8203e64e578f9ee8/train_gpt2.cu#L653…
-
Write a SOP Page for data models and exchange tab of the L3. The concept of actors, sequence diagrams and transactions are picked from IHE. reuse IHE language where possible
This tab is envisioned al…
-
**What Needs to be Done?**
The decision to do load-balancing in the tempered algorithm is currently constrained by hard-code minimum bound on average load (set at 1e-10).
**Is your feature reques…
-
# Context
In regards to the 2024 plans we discussed gathering SCI Use Cases and linking them back to the Patterns & Principles to demonstrate their real-world usage.
@tmcclell shared one that ha…
-
+ Testing
+ Incremental
+ Principles
+ Patterns
-
Can you refactor our legacy project? We need to to use SOLID principles, KISS, YAGNI and DRY as well as clean code
The `IUserCreditService` and `IUserCreditServiceChannel` interfaces and the `UserC…
-
For the site: These tutorials are primarily intended to help data scientists new to Julia familiarise themselves with various software interfaces. It is not attempt to teach fundamental data science p…
-
To show the principles correctly on the Korean website, we need a translation. `README.md` should contain the English version and the Korean translation could be placed to `localization/ko/README.md`
…
-
I propose to implement the "Graph Transitivity" coefficient for undirected graphs as presented in the book Complex Networks: Principles, Methods and Applications by Latora, V, et.al
Would this …
gerdm updated
3 years ago
-
Discuss https://w3ctag.github.io/design-principles/