learned-tokenization Search Results

279 results
for learned-tokenization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

varnamproject/libvarnam #171

Alternate Approach

Varnam has a tokenizer that converts Malayalam (or any other Indian lang) text to manglish patterns. While learning, Varnam makes a database of such patterns -> word : ``` Pattern | Word ID | Learne…

subins2000 updated 3 years ago
2
iLanguage/ilanguagelab #25

Implement Javascript functions turn rules into unknown word …

``` Purpose of implementation request: To try something very challenging at the interface of X-bar theory and Finite State Machines. When implementing the request, please focus on these steps/functi…

GoogleCodeExporter updated 9 years ago
1
LambdaLabsML/examples #57

ValueError: Input is not valid. Should be a string, a list/t…

Hello, I'm following the SD fine tuning tutorial. I ran with the Pokemon dataset and all was well, so I formatted my own dataset, edited the .yaml, forked the repo and am having this issue with my cod…

pimentoliver updated 1 year ago
3
irthomasthomas/undecidability #882

“Emergent” abilities in LLMs actually develop gradually and …

- [ ] [“Emergent” abilities in LLMs actually develop gradually and predictably – study | Hacker News](https://news.ycombinator.com/item?id=39811155) # "Emergent" abilities in LLMs actually develop gr…

ShellLM updated 3 months ago
1
lllyasviel/stable-diffusion-webui-forge #1008

The xformers problem also exists in the webui_forge_cu121_to…

Python 3.10.6 (tags/v3.10.6:9c7b4bd, Aug 1 2022, 21:53:49) [MSC v.1932 64 bit (AMD64)] Version: f2.0.1v1.10.1-previous-240-g294416ed Commit hash: 294416ed55cad69eb8a01393854457e35207a2d4 Launching…

Firetheft updated 3 months ago
2
karpathy/minbpe #50

Alternative to bpe

Maybe I am completely wrong, but to me using something like bpe to build an encoding for text feels stupid. Sure, it is a fairly easy way and it will build an encoding that is efficient in terms of se…

marcov-dart updated 8 months ago
16
ziglang/zig #10761

parse inline assembly syntax according to a set of dialects;…

Currently we have this situation: * stage1: Inline assembly is a comptime-known string that can be built with expressions such as `++`. * stage2: Inline assembly must be string literals. This is i…

andrewrk updated 2 months ago
11
golang/vscode-go #2286

ui: enable semantic tokens by default

We can set this default before gopls switches its default (https://github.com/golang/go/issues/45313) Semantic tokens fixes many issues TextMate-based syntax highlighting has (incorrect highlighting,…

hyangah updated 1 year ago
4
mistralai/mistral-inference #138

Training code

Hi Where can I find the code needed to train the initial model and produce the model files?

sartimo updated 7 months ago
1
AlexisTercero55/AI-Research #14

Byte Pair Encoding on MWT 14 EN2DE

# BPE as input tokens of the transformer model The transformer model proposed by "_Attention is all you need_" encodes the 4.5M sentence input data into a small vocabulary generated by learning sha…

AlexisTercero55 updated 8 months ago
5

上一页 1...1 2 3 4 5 6 7...28 下一页

279 results for learned-tokenization

279 results
for learned-tokenization