corpus-analysis Search Results

1000+ results
for corpus-analysis

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

stanfordnlp/stanza #485

Add Abkhaz

How can we add the abkhazian language? There are a few resources like https://gitlab.com/Bachstelze/alp and https://github.com/danielinux7/Multilingual-Parallel-Corpus . Can we port those models to …

Bachstelze updated 3 years ago
4
embeddings-benchmark/mteb #905

Integrate InstructIR with MTEB

I am interested in integrating InstructIR into MTEB. Currently, the dataset for InstructIR is only available on GitHub (https://github.com/kaistAI/InstructIR) and not on Hugging Face. Could you advise…

henilp105 updated 3 months ago
10
bigscience-workshop/lam #41

Add dataset: royal_society_corpus

### A URL for this dataset https://fedora.clarin-d.uni-saarland.de/rsc/ ### Dataset description > The Royal Society Corpus (RSC) is based on the first two centuries of the [Philosophical Transactio…

davanstrien updated 2 years ago
5
nltk/nltk #2241

Add words to the Portuguese stopword list

A few very common words in Portuguese are not included in the stopword list. At least `é`, `ser` and `ter` should be included for consistency, since these are verbs whose other inflected forms are alr…

erickrf updated 5 years ago
2
mtpearce/idyom #42

Database Management/Upload

Hello, I've just downloaded IDyOM onto my laptop to use as part of my master's thesis, however after following the database management procedure as seen in the wiki section, IDyOM seems ubale to re…

ecm2021 updated 3 years ago
6
Computational-Content-Analysis-2020/Readings-Responses-Spring #1

Measuring Meaning & Counting Words - Orientation

Post questions here for one or both of week's orienting readings: Evans, James and Pedro Aceves. 2016. [“Machine Translation: Mining Text for Social Theory”](https://www.annualreviews.org/doi/abs/…

jamesallenevans updated 4 years ago
16
knowledgetechnologyuhh/OMGEmotionChallenge #9

How many unique speakers are there in data set ?

ajinkyakulkarni14 updated 5 years ago
2
datacamp/Brand-Analysis-using-Social-Media-Data-in-R-Live-Training #3

Notebook Review

Hi @vivekv73y and Sowmya :wave: Please read the key below to understand _how_ to respond to the feedback provided. Some items will require you to take action while others only need some tho…

adelnehme updated 4 years ago
10
ScandEval/ScandEval #435

[BENCHMARK DATASET REQUEST] NorBench

### Dataset name NorBench ### Dataset link https://github.com/ltgoslo/norbench ### Dataset languages - [ ] Danish - [ ] Swedish - [X] Norwegian (Bokmål or Nynorsk) - [ ] Icelandic - [ ] Faroese -…

Mikeriess updated 4 months ago
5
psal/JStylo-Anonymouth #1

Takes an extremely long time to run

I'm trying to process a set of 10,000 files using JStylo (with 12 possible authors), and it takes an extremely long time to generate the features (using the WriteLimits set). I've had it running for o…

dan-blanchard updated 12 years ago
11

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for corpus-analysis

1000+ results
for corpus-analysis