issues
search
PolMine
/
bignlp
Tools to process large corpora line-by-line and in parallel mode
1
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
`corenlp_parse_conll()` throws warning
#41
ablaette
opened
1 year ago
0
Unknown untokenizable character
#40
ablaette
closed
1 year ago
0
Untokenizable token not previously encountered
#39
ablaette
closed
1 year ago
1
Explain installation of CoreNLP in README
#38
ablaette
opened
1 year ago
0
Update to StanfordCoreMLP 4.5.2
#37
ablaette
opened
1 year ago
0
corenlp_annotate() | creating new nodes removes potential attributes
#36
ChristophLeonhardt
closed
1 year ago
1
Minor remarks for the documentation of corenlp_annotate()
#35
ChristophLeonhardt
opened
2 years ago
0
Potential integer column type of parsed ConLL output
#34
ablaette
closed
2 years ago
2
Parsing conll output breaks if input/output includes "#"
#33
ablaette
opened
3 years ago
1
Show Java encoding upon initialization
#32
ablaette
closed
2 years ago
1
Windows CI
#31
ablaette
opened
3 years ago
0
Reconsider dependency on DT and webshot
#30
ablaette
opened
3 years ago
0
Vignette | Minor Remarks
#29
ChristophLeonhardt
opened
3 years ago
0
License of REUTERS data
#28
ablaette
opened
3 years ago
0
corenlp_parse_conll | Hashtags in input cause trouble in read.table
#27
ChristophLeonhardt
opened
3 years ago
0
Erroneous error messages and warnings
#26
ChristophLeonhardt
opened
3 years ago
0
AnnotationList$as.data.table() results in Illegal reflective access warning
#25
ablaette
closed
2 years ago
2
Input in AnnotationList in dev branch
#24
ChristophLeonhardt
opened
3 years ago
0
No colnames of CoNLL output format
#23
ablaette
closed
3 years ago
1
Setting output_format twice necessary?
#22
ablaette
closed
3 years ago
1
Setting the _Output format
#21
ablaette
closed
3 years ago
0
Different results of byline and in-memory processing
#20
ablaette
closed
3 years ago
2
Parallel processing using AnnotatorPipeline$annotate()
#19
ablaette
closed
3 years ago
1
pad output names of segment()
#18
ChristophLeonhardt
closed
3 years ago
1
Process data that has already been tokenized
#17
ablaette
opened
3 years ago
3
Hard coded setting of JVM heap space
#16
ablaette
closed
3 years ago
1
Effect of 'prettyPrint'
#15
ablaette
closed
3 years ago
1
Use CoreNLP parallelization
#14
ablaette
closed
3 years ago
4
Stanford CoreNLP 4.0.0 | German NER model leads to broken json
#13
ChristophLeonhardt
closed
3 years ago
6
Stanford NLP installation paths lead nowhere in versions older than 4.0.0
#12
ChristophLeonhardt
closed
4 years ago
1
corenlp_annotate on logging branch without (or with irritating) progress indication
#11
ChristophLeonhardt
closed
3 years ago
1
Use Java Parallelization
#10
ablaette
closed
3 years ago
2
Coping with the rJava installation
#9
ablaette
closed
3 years ago
1
corenlp_annotate,character-method: Argument progress ineffective
#8
PolMine
closed
3 years ago
1
Loading bignlp: Warning "Registered S3 method overwritten"
#7
PolMine
closed
3 years ago
1
superfluous quotation marks | remove for parallelized option as well
#6
ChristophLeonhardt
opened
4 years ago
0
Suggestion | corenlp.R install French
#5
ChristophLeonhardt
opened
4 years ago
0
No Output for corenlp_parse_ndjson
#4
ChristophLeonhardt
closed
3 years ago
2
Memory Usage with large data
#3
ChristophLeonhardt
closed
3 years ago
4
Output files in chunk_table_split
#2
ChristophLeonhardt
closed
3 years ago
1
"NA TOLL" error
#1
ChristophLeonhardt
closed
3 years ago
1