issues
search
ropensci
/
textreuse
Detect text reuse and document similarity
https://docs.ropensci.org/textreuse
197
stars
33
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
New Maintainer Welcome :-)
#97
maelle
closed
6 months ago
0
Deprecated feature reported in lsh-function
#96
Lrantala
opened
1 year ago
1
Punctuation align_local
#95
EtienneFerrandi
opened
3 years ago
0
align_local documentation
#94
under-score
closed
4 years ago
2
Adding new documents to a LSH object
#93
retrography
opened
4 years ago
0
Parallel lsh_compare
#92
retrography
closed
4 years ago
0
Move "lsh_buckets" class to the left
#91
romainfrancois
closed
3 months ago
1
Inconsistent skipping behavior in TextReuseCorpus
#90
tylerandrewscott
opened
4 years ago
3
Added encoding argument to TextReuseCorpus and TextReuseTextDocument
#89
davidfuhry
opened
4 years ago
1
Short documents and skip_grams assertion do not match
#88
awagner-mainz
opened
5 years ago
2
Add official docs url to description
#87
jeroen
closed
3 months ago
1
jaccard_similarity result?
#86
bihappywater
closed
5 years ago
1
record linkage using textreuse
#85
bihappywater
closed
5 years ago
1
Apparently `min(bitwXor(h, i)` is fed a double at times when it wants integers
#84
ghost
closed
6 years ago
6
Appveyor webhook
#83
maelle
closed
6 years ago
4
Error when calculating local_alignment
#82
ManuelBurghardt
opened
6 years ago
1
Added rOpenSci review badge
#81
karthik
closed
7 years ago
1
Operation with unevaluated n_call
#80
quartin
closed
5 years ago
2
Database backends
#79
lmullen
opened
7 years ago
1
Implement a method like the one described in Smith, Cordell, Mullen
#78
lmullen
opened
7 years ago
0
Add Rcpp interrupts
#77
lmullen
opened
7 years ago
0
Typo: search for "two few words"
#76
lmullen
opened
7 years ago
0
Error when using functions from tokenizers package
#75
mdlincoln
closed
7 years ago
2
Merging of corpora
#74
Ninoninoninonino
closed
7 years ago
2
can this packages support chinese corpus
#73
AlexYoung757
closed
7 years ago
1
Question: Reuse minhash functions
#72
iainmwallace
closed
7 years ago
2
Implement earth mover distances
#71
lmullen
opened
8 years ago
0
Depend on LSHR package
#70
lmullen
opened
8 years ago
0
Parallelize lsh_compare()
#69
lmullen
opened
9 years ago
1
Extra newline for print method for local alignments
#68
lmullen
opened
9 years ago
0
Redo matrix methods
#67
lmullen
opened
9 years ago
0
Some problem with lsh() function and data_frame?
#66
vmustafa
closed
9 years ago
6
Problem with converting to matrix
#65
lmullen
closed
9 years ago
1
Try to build a Corpus from character vector and got an error
#64
pommedeterresautee
closed
9 years ago
3
Set interactive = FALSE in all vignettes
#63
lmullen
closed
9 years ago
0
Re-documents imported/exported functions with roxygen 5.0
#62
lmullen
closed
9 years ago
0
switch from CharacterVector to a string vector
#61
Ironholds
closed
9 years ago
1
Parallelize wordcount.TextReuseCorpus?
#60
lmullen
closed
9 years ago
1
Define variables in sw_matrix only once
#59
noamross
closed
9 years ago
1
Function to query potential matches for just one (or more) documents from buckets
#58
lmullen
closed
9 years ago
0
TextReuseCorpus does not always emit warnings when skipping short documents
#57
lmullen
closed
9 years ago
0
Add citation to original lsh/minhash paper
#56
lmullen
closed
9 years ago
0
Fix bug with blank ID in skipped documents
#55
lmullen
closed
9 years ago
0
Parallelize text reuse corpus?
#54
lmullen
closed
9 years ago
1
Keep more information in alignment objects
#53
lmullen
closed
9 years ago
0
Function to write alignment object to a file
#52
lmullen
closed
9 years ago
0
Add a minhashes element to a document/corpus
#51
lmullen
closed
9 years ago
1
Add vignette pipeable
#50
lmullen
closed
9 years ago
0
Implement Smith-Waterman local sequence alignment
#49
lmullen
closed
9 years ago
1
Performance regression in skipping documents?
#48
lmullen
closed
9 years ago
1
Next