korean-tokenizer Search Results

373 results
for korean-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/lucene #9677

How Nori Tokenizer can deal with Longest-Matching [LUCENE-86…

I think... Nori tokenizer has one issue. I don’t understand why “Longest-Matching” is NOT working to Nori tokenizer via config mode (config mode: Here is an example for explaining what is longe…

asfimport updated 2 years ago
5
mikemccand/stargazers-migration-test #630

How Nori Tokenizer can deal with Longest-Matching [LUCENE-86…

I think... Nori tokenizer has one issue. I don’t understand why “Longest-Matching” is NOT working to Nori tokenizer via config mode (config mode: Here is an example for explaining what is longe…

mikemccand updated 2 years ago
5
JohnSnowLabs/spark-nlp #7163

Lemmatization performance on Universal Dependency Treebanks

I am comparing the performance of the most popular lemmatization tools. I have found benchmark results for [Stanza](https://stanfordnlp.github.io/stanza/v100performance.html), [Trankit](https://tranki…

abdullah-alnahas updated 2 years ago
1
bigscience-workshop/multilingual-modeling #2

Incrementally adding new languages to pre-trained models

# Experiments design Follow discussion [here](https://docs.google.com/document/d/110tlidAcpiNteKnA27tR5KPS_VahNqYKqCeJlu1MWww/edit#heading=h.wmf5tyes1tfk) ## pointers to code and datasets ### …

hadyelsahar updated 2 years ago
9
toriving/KoEDA #4

Checklist for v0.0.4

- [ ] Improve WordNet / Synonyms (NIKLex, KorLex) - NIKLex : https://corpus.korean.go.kr/ - KorLex : http://korlex.pusan.ac.kr/ - [x] Improve README.md ~~docstring~~ - [x] requirements.txt ve…

toriving updated 2 years ago
2
huggingface/transformers #14559

Question about an error occurring while running hf_argparser…

## Environment info - `transformers` version: 4.3.2 - Platform: Google Colab - Python version: Python 3.7.12 - PyTorch version (GPU?): 1.8.0+cu111 (using Colab Pro - GPU/high-RAM) - Tensorflo…

seuly1203 updated 2 years ago
2
JohnSnowLabs/spark-nlp #7485

How to train Linear Chain CRF for word segmentation?

**Name of the Spark NLP feature whose docs need improvement:** Linear Chain CRF **What you think the docs should say:** Hi, I want to thank you for this great NLP project first. I am new to N…

jackieair updated 2 years ago
12
OpenKore/openkore #628

~~

pRO will be launching today for CBT let's all be updated for the new 3rd party programs we can use.... FF

weeetitit updated 2 years ago
2499
boostcampaitech2/mrc-level2-nlp-14 #3

[Dev] Retrieve Module 개발노트

Special Mission 3에서 언급된 `SparseRetrieval`와 `DenseRetrieval` 객체를 추상화시켜서 확장성있는 코드로 개발. 제공된 retrieval.py에는 boilerplate가 너무 많습니다. 이를 추상화시켜서 나중에 새로운 Retrieval를 추가해도 동작하도록 개발합니다. + End2End도 고민!

jinmang2 updated 2 years ago
8
woowacourse/prolog #277

검색 인프라를 구축한다.

- [x] ec2 배포 - [x] 인덱스 매핑 - [x] 벌크로 더미 데이터 구축 - [x] 검색 테스트 - [ ] 데이터 싱크 전략 - #355 - [ ] 보안그룹 설정 - 특정 ip만 허용하도록 하고 추후 고도화 하기

gracefulBrown updated 3 years ago
4

上一页 1...21 22 23 24 25 26 27...38 下一页

373 results for korean-tokenizer

373 results
for korean-tokenizer