legal-corpus Search Results

583 results
for legal-corpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

UniversalDependencies/docs #412

Paragraph and document boundaries

UD guidelines currently do not specify how to mark document and paragraph boundaries and for many treebanks such information is not available (original text gone, sentences shuffled etc.) But where it…

dan-zeman updated 4 years ago
20
deepset-ai/FARM #244

What kind of text file require for fine-tuning the model?

Hello, I want to finetune the language model on domain-specific tasks. Could anyone tell me what kind of custom text file require for fine-tuning the model? Will it be okay if I put all sentences…

ankush20m updated 4 years ago
8
explosion/spaCy #4605

Questions and Directives for Pretraining

## My Environment * **spaCy version:** 2.2.2 * **Platform:** Linux-5.0.0-25-generic-x86_64-with-Ubuntu-18.04-bionic * **Python version:** 3.6.8 * **Machine:** AMD Ryzen Threadripper 2950X 16-Core…

pvcastro updated 4 years ago
3
TEIC/TEI #1776

add <w> to att.lexicographic

Just as is the case with well established usages of attributes native to att.lexicographic within the dictionary module, there are identical use-cases for these attributes that arise in the developmen…

iljackb updated 4 years ago
18
civio/verba #16

Usar frases como unidad de búsqueda / análisis

Hasta ahora estamos usando los fragmentos de los ficheros de entrada, `.vtt`. O sea, el texto de las 1-2 líneas que aparecen en un determinado momento en pantalla. Lo hacíamos así porque era lo más se…

dcabo updated 4 years ago
4
statedecoded/statedecoded #245

Create a CourtListener interface

[CourtListener](http://www.courtlistener.com/) is starting to add state Supreme Court decisions to their offerings, and intends to add all fifty states. Consequently, it is sensible to bake support fo…

waldoj updated 4 years ago
99
huggingface/transformers #1272

How long does it take? (BERT Model Finetuning using Masked M…

I am about to finetune a multilingual BERT model using English and Chinese text from the legal domain. My corpus is around 27GB, how long should I expect to train 3 epochs (default parameters) us…

echan00 updated 4 years ago
1
piskvorky/gensim #1701

Doc2Vec training hangs

#### Description Hi, I tried training a model, with ``` from gensim.models import Doc2Vec model = Doc2Vec(min_count=1, window=10, size=100, sample=1e-4, negative=5, workers=7) model.…

tarun-t updated 4 years ago
4
common-voice/common-voice #2236

Request to add `Karakalpak` language to the list

I wanted to start contributing with our local students to [Karakalpak](https://en.wikipedia.org/wiki/Karakalpak_language) corpus. Thanks in advance!

beknazar updated 4 years ago
10
harvard-lil/perma #2634

Excessively long form-encoded data in index results in CDXEx…

``` Jun 19 19:26:53 ip-172-31-58-70 docker-compose[7160]: app_1 | wr.io: 2019-06-19 19:26:53: [ERROR]: 500 (Internal Server Error) raised by https://wr.perma-archives.org/public/n3ry-mj6…

rebeccacremona updated 4 years ago
6

上一页 1...43 44 45 46 47 48 49...59 下一页

583 results for legal-corpus

583 results
for legal-corpus