This PR modifies _get_sub_docs to use the separator passed into the LlamaParse constructor. I'm making this change as the string \n---\n occurs occasionally in our documents. If pagination is important, we need to use a separator less likely to occur in our documents such as \n$$$$$$$$\n.
Testing
CLI
Automated Tests passed
% export LLAMA_CLOUD_API_KEY=llx-[...]
% make test
pytest tests
====================================================================== test session starts =======================================================================
platform darwin -- Python 3.11.7, pytest-8.2.2, pluggy-1.5.0
rootdir: /Users/areichert/Documents/llama_parse
configfile: pyproject.toml
plugins: anyio-4.4.0
collected 3 items
tests/test_reader.py ... [100%]
======================================================================= 3 passed in 15.01s =======================================================================
Test Script
We parsed this two page document, which has a \n---\n where the background color changes.
[...]
**TO HELP ENGAGE EMPLOYEES**
---
Fulkrum has been providing inspection, [...]
Summary
This PR modifies
_get_sub_docs
to use the separator passed into theLlamaParse
constructor. I'm making this change as the string\n---\n
occurs occasionally in our documents. If pagination is important, we need to use a separator less likely to occur in our documents such as\n$$$$$$$$\n
.Testing
CLI
Automated Tests passed
Test Script
We parsed this two page document, which has a
\n---\n
where the background color changes.Test Script
Results