large-vocabulary Search Results

1000+ results
for large-vocabulary

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

deeplearning4j/deeplearning4j #5961

Using ParagraphVectors without labels

@raver119 I'm using `ParagraphVectors` in an unsupervised (unlabeled way). Rather than having a neural net with say ~50k words / NLP features, I'm constructing ParagraphVectors of 200-1000d in len…

apatzer updated 5 years ago
1
creativecommons/reversionary-rights #32

[Feature] Integrate Vocabulary Design System Across the Site

## Problem The vocabulary design system is not implemented across the site and that makes the site inconsistent with Creative Commons design system. This will help create visual uniformity and acces…

Silvia-Wachira updated 1 month ago
2
hassonlab/247-pickling #157

glove tokenizer

consider using the Stanford Tokenizer for glove. in their paper they say "We tokenize and lowercase each corpus with the Stanford tokenizer, build a vocabulary of the 400,000 most frequent words" and …

zkokaja updated 1 year ago
2
openrewrite/rewrite #4571

Support maximum line length / column limits formatting

## What problem are you trying to solve? Many style guides such as Google's impost maximum column limits (https://google.github.io/styleguide/javaguide.html#s4.4-column-limit). There is tooling such…

blipper updated 1 month ago
2
huggingface/transformers.js #803

Decoding Tokens added by the user for Whisper models

### Feature request Support decoding user defined added tokens that get added to end of the tokenizer's vocabulary for Whisper based models. This requires modifying the if statement in [_decode_asr](…

aravindMahadevan updated 5 months ago
2
kpu/kenlm #192

Filtering: the format of the target vocabulary

Hi, thanks for this tool. I have a very large language model and want to filter it according to a target vocabulary, is there a specific format for the vocabulary? If I have a test set, how to mat…

Amber819 updated 3 years ago
7
AILab-CVC/YOLO-World #484

Can large-scale pretraining achieve real open-vocabulary? 预训…

Recent works like YOLO-World and GroundingDINO mainly use Object365 and GoldG for pretraining. These methods do not use CLIP image encoder as the backbone (unlike some open-vocabulary detection method…

wangzishuo029 updated 3 weeks ago
2
dottxt-ai/outlines #795

Accelerate the index construction process

### What behavior of the library made you think about the improvement? My understanding of the index construction process is that for each state in the FSM, we need to iterate through all tokens in…

aeft updated 7 months ago
3
JuergenFleiss/atrain_core #24

Issues downloading models

Issues downloading models. all and base fail, but large-v3 works (Using conda env). Error: (atrain_core_env) C:\>aTrain_core load --model all C:\anaconda3\envs\atrain_core_env\lib\site-packages\aT…

aereimer updated 1 day ago
1
cf-convention/cf-convention.github.io #547

(Where to) include visualisations of the standard names on t…

At a [Hackathon session](https://cfconventions.org/Meetings/2024-Workshop.html) of the recent CF Workshop 2024, I began updating the code I created in 2020 to produce some visualisations of the standa…

sadielbartholomew updated 1 month ago
4

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for large-vocabulary

1000+ results
for large-vocabulary