tokenisation Search Results

1000+ results
for tokenisation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

43081j/postcss-js-core #7

Investigate using custom tokenisation for part of the AST

As seen in the original iteration of this core, it is possible to achieve most of what we have here through using a custom tokeniser. Initially, we didn't do this since we wanted to avoid duplicati…

43081j updated 1 year ago
6
giellalt/lang-sme #65

No norm analysis for `Standing Rock-vuosttaldemiin`, wrong t…

This is what I had expected: ``` Standing Rock-vuosttaldemiin @U.Cap.Obl@Standing Rock+CmpNP/First+N+Prop+Sem/Plc@U.Cap.Obl@+Cmp/SgNom@P.CmpFrst.FALSE@@P.CmpPref.FALSE@@D.CmpLast.TRUE@@D.CmpNone.T…

snomos updated 1 year ago
1
hunspell/hunspell #263

OOo-like output option for debugging

(sorry for the long post) It would be very helpful when developing spelling dictionaries intended to be used in a graphical environment like OOo and others, to be able to spell check texts in exactly…

phajdan updated 7 years ago
2
omnilingo/omnilingo #64

We should keep track of tokenisation so sentences can be mer…

For example here: ![imatge](https://user-images.githubusercontent.com/449545/109909263-b4cd5f80-7c9d-11eb-8d2c-68d52ece0e35.png) The original text is: ``` common_voice_th_23657260.mp3 สองอันเท…

ftyers updated 1 year ago
5
microsoft/vscode #172713

Option to turn off TOKENIZATION IS SKIPPED ... hover message…

When Tokenization is disabled on long lines, such as when I am reading a minified file or, yes, the line is in fact very long for a reason, I am plagued with messages telling me that Token…

frumbert updated 1 month ago
3
liuquangao/FT-TabPFN #1

Anticipation for the Release of FT-TabPFN: Enhancing Perform…

I'm really looking forward to your paper on the FT-TabPFN and the release of the source code! The novel Feature Tokenisation layer sounds like a significant enhancement for handling categorical featur…

liangchen341 updated 7 months ago
1
HIT-SCIR/ELMoForManyLangs #47

Document what tokenisation was used for the offered models

Closed issue #45 indicates that udpipe was used and `__main__.py` suggests that you use the expanded form for conll multiword tokens, e.g. 2 tokens "de le" instead of "du" in French. The readme should…

jowagner updated 5 years ago
2
TotalALM/VSTS-Tasks #39

Tokenisation - The task should use the supported task-lib

Azure DevOps reports the following warning when running against a hosted agent. ##[warning]Task 'Tokenization' (2.10.0) is using deprecated task execution handler. The task should use the supported…

OutKa5t updated 2 years ago
3
mlcommons/inference #1291

Network Division - BERT QSL dataset pre-processing

For non-network division BERT benchmarks the dataset is tokenised outside of running the benchmark but in the inference rules, the use of "Text in. No compression allowed." implies that for Network Di…

G4V updated 1 year ago
12
jonorthwash/ud-annotatrix #53

token → subtokens and merge subtokens

Add functionality to the existing tokenisation routines (#2) so that tokens can be split into subtokens and adjacent subtokens can be merged.

jonorthwash updated 6 years ago
14

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for tokenisation

1000+ results
for tokenisation