nlmatics / nlm-ingestor

This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
Apache License 2.0
1.04k stars 148 forks source link

JSON Decode error when #84

Open shumin018 opened 1 month ago

shumin018 commented 1 month ago

Hello, i've followed the instructions to host llmsherpa on my own resources, but when im trying to access this via an external URL, im getting a json decode error

JSONDecodeError: Expecting value: line 1 column 1 (char 0)

any help please? thanks!

code -

from llmsherpa.readers import LayoutPDFReader
llmsherpa_api_url = ''
pdf_url = ""

pdf_reader = LayoutPDFReader(llmsherpa_api_url)
doc = pdf_reader.read_pdf(pdf_url)
shubhampatwa commented 1 month ago

Check for nlm-ingestion server, if is working or not.

JSONDecodeError: Expecting value: line 1 column 1 (char 0) arises when server is not working

emilyweber35 commented 1 month ago

I have gotten the same error. Myurl says service is running.

Quang-elec44 commented 2 weeks ago

I'm having the same issue too. Here is my code

from llmsherpa.readers import LayoutPDFReader

llmsherpa_api_url = "http://localhost:5010/api/document/developer/parseDocument?renderFormat=all"
pdf_url = "myfile.pdf" # also allowed is a file path e.g. /home/downloads/xyz.pdf
pdf_reader = LayoutPDFReader(llmsherpa_api_url)
doc = pdf_reader.read_pdf(pdf_url)

In the container, here is the log:

 * Serving Flask app '__main__'
 * Debug mode: off
WARNING: This is a development server. Do not use it in a production deployment. Use a production WSGI server instead.
 * Running on all addresses (
 * Running on
 * Running on
Press CTRL+C to quit - - [10/Sep/2024 04:11:06] "POST /api/document/developer/parseDocument?renderFormat=all HTTP/1.1" 404 -