ropensci / textreuse

Detect text reuse and document similarity
https://docs.ropensci.org/textreuse
197 stars 33 forks source link

Redo matrix methods #67

Open lmullen opened 9 years ago

lmullen commented 9 years ago

There should be a way to get a matrix and a sparse matrix out of the textreuse candidates df. The sparse matrix should be in the format that apcluster expects.