-
The gallery was flooded with spam and inappropriate messages. I have removed the worst content and am now attempting to write some code to bulk remove the repeated sketches.
If anyone can offer any…
-
I'd like to run the application on huge picture collection (1 million images). That would require 10^12 comparisions. I'd like to propose improvement which would require 10^7 (N*logN) comparisions.
T…
-
Hi Maarten,
I'm having an issue with some important words not appearing within CountVectorizer when using `min_df` even though they are well above the set threshold. My understanding of `min_df` i…
-
## Project description
Sometimes one needs to extract a part of another project, which unfortunately is a monorepo, into an own repo.
Often there may be multiple forks of the same project in the f…
-
As per title. Are we including scene markers in this?
I've been hesitant to add things like duration and resolution to the scene schema, since there may be multiple files pointing to the same scene…
-
I tested the Mastiff Docker image on a Fedora 23 64-bit VM and ran into some errors. It looks like the script is setting the log file to the work directory (/home/nonroot/workdir in my Docker contain…
-
**Is your feature request related to a problem? Please describe.**
On some controversial subjects, authors who are faced with notes that contradict them simply delete their tweet and republish it a…
-
It may become very useful to some to provide approximate substring matching. This reports the smallest edit distance of the `needle` in all possible substrings of `haystack`. Here are some examples:
…
-
Hi, I have a question about large-scale LSH index. If I have billions of documents, I suppose even 1T RAM is not enough to do in-memory LSH, is there any recommended way to use datasketch for this sce…
-
No, this is not the title of a book written by Stephen Hawking. This issue aims to outline a proposal for future developments to the `UTCDateTime` object and to focus discussion on the topic that has …