-
Greetings,
Just wanted to know what technique/ implementation you suggest for finding similar sentences over a very large corpus? 31M sentences, for instance!
Worth mentioning that my question i…
-
We have some exciting proposed changes that will use `BINARY` doc values, e.g. [using block compression in the default codec](https://github.com/apache/lucene-solr/pull/1234#pullrequestreview-35307318…
-
```
$ cargo build
Updating crates.io index
error: failed to select a version for the requirement `ahash = "^0.1.18"`
candidate versions found which didn't match: 0.8.0, 0.7.6, 0.7.5, ...
loca…
-
https://fullstackdeeplearning.com/llm-bootcamp/spring-2023/
- [Learn to Spell: Prompt Engineering](https://fullstackdeeplearning.com/llm-bootcamp/spring-2023/#learn-to-spell-prompt-engineering)
- …
-
**Summary**
## Tasks
- [x] https://github.com/datafuselabs/databend/pull/10737
- [ ] #10769
- [x] https://github.com/datafuselabs/databend/issues/10775
- [x] Implement `ai_embedding_vector()` t…
-
We can split ANN algorithms into three distinct categories; trees, hashes, and graphs.
The following represent possible algorithmic approaches. For each approach there are typically variants. Note …
-
* face_recognition version: Last
* Python version: 3
* Operating System: Ubuntu
Hello!
I want to collect pre-calculated encodings of more than 1 million faces in database (mysql,postgres...?) w…
-
How do we want the top-level functions?
Even in the two uses case in the gallery we have one with test data (the naive Bayes classifier) and one without. Should be always have a `Rcpp::Nullable` t…
-
Let me start the discussion here.
I'm planning to add new method for feature matching, BDH (Bucket Distance Hashing) which which was announced in ICCV 2013
https://github.com/opu-imp/BDH
paper…
-
https://github.com/granne/granne
Saw this from https://0x65.dev/blog/2019-12-07/indexing-billions-of-text-vectors.html