-
- [ ] [Introducing the next generation of Claude \ Anthropic](https://www.anthropic.com/news/claude-3-family)
# Introducing the next generation of Claude \ Anthropic
**DESCRIPTION:**
"A new standa…
-
Hi,
I would like to create my own domain-specific "stsb" datset to further improve performance.
I have a 500 GB domain specific text corpus and want to use / label some of the sentence pairs.
Do …
-
#### Summary
Achieve self repairing uploads by checking for lost chunks and regenerating original data chunks / parity chunks for reupload for erasure coded uploads.
If some chunks from an era…
-
Some newer embedding models such as https://huggingface.co/intfloat/e5-mistral-7b-instruct require a one-sentence instruction that describes the retrieval task in addition to the content that should b…
-
Hi,
I experienced issues when working with the colbert example.
I trained the model as per: https://github.com/texttron/tevatron/tree/main/examples/colbert
I then encoded the corpus and queri…
-
After spending half of the day to fix the build, which finally worked, turned out that most of tests fail on the same spot:
```
/opt/local/var/macports/build/_opt_PPCSnowLeopardPorts_databases_til…
-
**Idea**
- Use mobb.ninja/docs as a datasource to create [QA over docs](https://python.langchain.com/en/latest/use_cases/question_answering.html)
**Inspiration**
- [ChatLangChain - Chatbot speci…
-
**Proposal**: Assess critical variables affecting semantic search quality on legal corpus before implementing semantic search on CL.
**Key Variables**:
- Embedding model
- Chunking strategy
- I…
-
This is the issue where I report my progress on the project.
-
Expected:
When I enable hybrid and im using Elasticsearch v8.14+, I should be able to retrieve documents from both a bm25 based search and NN search using dense/sparse.
Actual:
Cannot perform the se…