tokenizing Search Results

1000+ results
for tokenizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

gimseng/99-ML-Learning-Projects #148

[EXE] Determine which Naive Bayes algorithm is the Best to …

**Learning Goals** - To know the best among the five Naive Bayes algorithms in analyzing the sentiment analysis Gaussian, Categorical, Complement etc - The basics of NLP such as token…

daretechie updated 1 month ago
1
tensorflow/tensor2tensor #1498

How to use encode_without_tokenizing in text_encoder.py

### Description I came across with some trouble in generating training data using t2t-datagen. The tokenizes are messy. ![image](https://user-images.githubusercontent.com/32641072/54520210-2756af0…

yourSylvia updated 5 years ago
6
otakustay/react-diff-view #210

The empty line is lost when copy after tokenize

I can't copy the empty line after tokenizing my diff: ![image](https://github.com/otakustay/react-diff-view/assets/13199771/deed6d5c-fe0f-4dad-8f25-ec9c01787049)

ensorrow updated 11 months ago
3
CosmicHorrorDev/rust_text_classifier #6

Try out different tokenizers?

The current tokenizer is pretty unaware of the structure of the text. Situations to improve upon would be ## tokenizing links Something like `http://www.google.com/useless/junk` gets transformed…

CosmicHorrorDev updated 3 years ago
2
boostcamp-5th-NLP05/level1_semantictextsimilarity-nlp-05 #15

맞춤법 검사 전처리(hanspell) (2023/04/18)

# 맞춤법 검사 전처리 📌 가설 - 맞춤법을 처리하지 않은 데이터와 맞춤법 검사를 한 훈련 결과 비교 hanspell 적용 결과 train_loss 줄어듬 확인 ```python def correct_spell(self, text): spelled_sent = hanspell.spell_checker.check(t…

yunjinchoidev updated 1 year ago
1
bowersd/textAnalysis #10

tokenizer punctuation

‘ (right/left curly single quote) is not split off from words when tokenizing

bowersd updated 1 year ago
1
dask/dask #10465

ParserError: Error tokenizing data. C error: EOF inside stri…

Hello. Testing out Dask to help me deal with over 46M rows of data. I'm loading it like so: `dask_df = dd.read_csv(FILE_PATH)` and when I, for example, look at the head I see the head of the…

duarteharris updated 1 year ago
1
patrik-piskay/react-truncate-markup #94

Flex layout and character tokenizing don't interact as expec…

I'm using a button with a flex layout whose label is a comma delimited list of other labels. I've using the characters tokenize strategy, but the only way to get the word-wrap to correctly truncate th…

meisterz39 updated 2 years ago
1
stylelint/css-parser #1

List of CSS Tokenizers

_This is a list of CSS Tokenizers._ _This issue is not intended for in depth discussion about any individual tokenizer or any aspect of CSS tokenizing._ - [`csslex`](https://github.com/keithamus/c…

romainmenke updated 7 months ago
4
pytorch/data #1270

Enable Append Mode in SaverIterDataPipe

### 🚀 The feature Currently, Saver only allows write mode and only users to choose byte vs text mode. It might be useful to allow the flexibility to append to an existing file. ### Motivation, pitch…

rravu3 updated 5 months ago
1

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for tokenizing

1000+ results
for tokenizing