tokenizing Search Results

1000+ results
for tokenizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

neelsmith/virgapes #10

Interference with NeumeReader when tokenizing unclear elemen…

For example: h009.h05.1.6.0

neelsmith updated 5 years ago
1
dotnet/csharp-tmLanguage #155

VSCode freezes tokenizing specific line of C#

From https://github.com/microsoft/vscode/issues/75355 ## Details From @kiranjulapalli: I spent some more time and here are the steps to repro: Open a vscode window -> New file -> ctrl+shift+…

alexr00 updated 2 years ago
1
andreeaiana/newsreclib #11

Dealing with cold start users click history

Hello! I'm currently handling a dataset where the `histories` column might initially be empty, especially for users who are accessing the system for the first time. Given this context, I'm seeking …

igor17400 updated 7 months ago
6
skeskinen/bert.cpp #1

implement do_handle_chinese_characters in tokenizing

As of yet I haven't tried what happens with Chinese/Japanese characters in tokenization. Some special handling is required since these languages don't have spaces between words. It should be relati…

skeskinen updated 1 year ago
1
ewafula/MetaClassifier #1

Only works with Pandas version 1.2.2

The README states that pandas newer than or equal to 1.2.2 will work, but newer versions give the error: File "pandas/_libs/parsers.pyx", line 805, in pandas._libs.parsers.TextReader.read_low_…

MKeao updated 1 week ago
1
Chocobozzz/PeerTube #3526

Improve Thai search results with splitting (tokenizing) word…

**Describe the problem to be solved** Thai sentences don't have spaces between words. They are usually spaced between sentences, which might result in less search results being displayed than wha…

ppnplus updated 11 months ago
7
python/cpython #54178

tokenize: add support for tokenizing 'str' objects

BPO | [9969](https://bugs.python.org/issue9969) --- | :--- Nosy | @ncoghlan, @vstinner, @voidspace, @meadori, @takluyver, @vadmium Files | [issue9969.patch](https://bugs.python.org/file23099/issue9969…

meadori updated 2 years ago
11
jankovicsandras/plpgsql_bm25 #3

Incremental Index Updates

Hello, a very nice work! I am using paradedb atm for bm25 pg search and I googled out this repo when checking whether there is an alternative implementation. You say that creating bm25 index from ta…

magaton updated 4 days ago
1
42-Ikole-Systems/TMK-SH #7

Lexer implementation

Support tokenizing all tokens as listed in the standard from a read line. https://pubs.opengroup.org/onlinepubs/009695399/utilities/xcu_chap02.html#tag_02_10 Out of scope: here doc reading and to…

mraasvel updated 1 year ago
5
c2nes/javalang #99

Faulty unicode escape handling leads to tokenizing failure

It seems that javalang replaces unicode escapes back to the raw form (as pointed out in issue #58) in `pre_tokenize` method before tokenizing. I don't get why this replacement is necessary (`pre_to…

xmcp updated 3 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for tokenizing

1000+ results
for tokenizing