-
### Describe the enhancement requested
There has been recent work to move the [ChunkResolver](https://github.com/apache/arrow/issues/34535) to public API.
`ChunkResolver` uses `O(log(num_chunks))…
-
```python
from crawl4ai import WebCrawler
from crawl4ai.chunking_strategy import SlidingWindowChunking
from crawl4ai.extraction_strategy import LLMExtractionStrategy
crawler = WebCrawler()
…
-
300 wps, 3 words per chunk
Too long words in chunk are splitted on 2 lines and that interferes with reading "flow".
My ideas for this issue 1) downsize font to match window size 2) display less word…
ghost updated
8 years ago
-
### What
- We use redoc to render the API documentation. This is cool, but it is a bit hard to see the structure of the data there, and not easy to split in chunks.
- We split some of the the JSON s…
-
## What do you want
Instead of just sending a text file when charCount > 2000, I would like cordless to split chunks of text into multiple chunks ending with (currentChunk/chunkCount).
## Why
…
-
This is the first revision of the Chunked Exports proposal. All changes made as a result of this discussion can
be tracked in the repo: https://github.com/CommonMCOBJ/CommonMCOBJ/blob/main/V2_Propos…
-
### Describe your problem
How to make overlapping chunking for docx, pdf or txt? The chunking methods seemed to split contents with no overlapping. This way, if the answer appears in the middle of tw…
-
## Context
Bee 2.1.0-rc2 and earlier
### Summary
The sepolia testnet is in a state where the radius of many neighborhoods split from storage radius 4 to 5. There are some (5 out of the original …
-
Currently there is no CI running the tests. This is probably mostly due to the tests relying on testnet. It would be great if the tests could run against a regtest node so that a CI can be set up to r…
-
./exo-cli-3.1-70b.sh hello
Go for :
#!/bin/bash
/usr/bin/curl --progress-bar --connect-timeout 1800 --max-time 1800 http://edgenode2:8000/v1/chat/completions \
-H "Content-Type: application/json…