-
Compare
http://localhost:8044/swagger-ui/#/segmentation-controller/getCharacterTextSplitterUsingPOST
with
https://js.langchain.com/docs/modules/data_connection/document_transformers/
https:/…
-
We have a simulated MSv2 data which we use for testing purposes on our workstations. The data has following dimensions:
Time: 120
Baseline: 1,30,816
Channels: 150
Polarizations: 1 (XX)
The ov…
-
I'm noticing that GELF does have a chunking part of its protocol too, for multi-part messages when using UDP:
```
Prepend the following structure to your GELF message to make it chunked:
Chunke…
-
### 🚀 The feature, motivation and pitch
We want to support various alignment and distillation loss functions.
Refer this PR on ORPO: #362
## Progress
### Alignment
- [x] ORPO https://gith…
-
# Tokenizer Import Error When Using Ollama Models
## Description
When attempting to use Ollama models (llama3, llama3.1, mistral), the application fails due to a tokenizer import error. The error …
-
open software for large data analysis tasks leads to problems of compute strategy and large data conveyance how does your project deal with this?
[Session Notes](https://docs.google.com/document…
-
Hi,
Thanks for releasing MaskGCT! Are there any plans to support long-form speech synthesis besides using chunking?
Thanks!
-
I really don't want to promote llm/genai approaches, but if I have to, it may be interesting to use notebooks as the corpus in a RAG system.
As well as interestingness in the RAG chunking strategie…
-
I could be doing something wrong, but I've come across what appears to be a bug in the initialisation of `generate_contexts` method.
Each time this is called, contexts (and others) are being initiali…
-
Currently ADRIA runs are stored in a Zarr data store, chunking data on a _per scenario_ basis.
This means there are potentially $n$ files created, where $n$ is equal to the number of scenarios.
Th…