timpaul / form-extractor-prototype

A prototype of a tool that generates web forms from document forms
MIT License
376 stars 59 forks source link

Claude can only parse a few pages before the output length limit is reached #30

Open timpaul opened 4 months ago

timpaul commented 4 months ago

The maximum output length is 4096 tokens. This, it seems, is only enough for JSON representing 2 or 3 pages worth of questions.

I think perhaps the way to go is to process each page of the document in a seperate call to the API.

This would also allow us to return results much sooner. You could process the first page, and then either continue to process the others in the background, or only do them when the user requests.