Closed Minxiangliu closed 3 months ago
@Minxiangliu Does it relate to the tokenization mechanism?
Hi @fishfree ,
Thank you for your comment, but I'm not sure if this is the issue.
I noticed in the ollama serve log that there's a HTTP/1.1 503 Service Unavailable error
—perhaps it's due to an overwhelming number of server requests?
However, I'm unsure how to configure it.
Is this the correct approach?
export OLLAMA_MAX_QUEUE=1024
Interestingly, after restarting the computer and the container, I was able to build the knowledge graph (KG) again. Upon closer inspection, I also noticed some 500 errors in the ollama serve log. Do you have any insights on this?
[GIN] 2024/08/12 - 02:07:06 | 200 | 2m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:07:08 | 200 | 2m32s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:07:32 | 200 | 2m56s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:07:37 | 200 | 3m2s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:07:40 | 200 | 3m4s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:07:50 | 200 | 3m14s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
time=2024-08-12T02:08:05.722Z level=ERROR source=server.go:719 msg="Failed to acquire semaphore" error="context canceled"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
time=2024-08-12T02:08:05.723Z level=ERROR source=server.go:719 msg="Failed to acquire semaphore" error="context canceled"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
time=2024-08-12T02:08:05.722Z level=ERROR source=server.go:719 msg="Failed to acquire semaphore" error="context canceled"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
time=2024-08-12T02:08:05.723Z level=ERROR source=server.go:719 msg="Failed to acquire semaphore" error="context canceled"
[GIN] 2024/08/12 - 02:08:05 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:29 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:35 | 200 | 3m9s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:43 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:08:48 | 500 | 3m30s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:09:09 | 200 | 3m21s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:09:14 | 200 | 3m21s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:09:23 | 200 | 3m23s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:09:32 | 200 | 3m2s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:09:53 | 200 | 1m46s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:09:54 | 200 | 1m47s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:10:05 | 200 | 1m58s | 127.0.0.1 | POST "/v1/chat/completions"
[GIN] 2024/08/12 - 02:10:16 | 200 | 2m9s | 127.0.0.1 | POST "/v1/chat/completions"
@Minxiangliu Sorry to reply late. Did you solve your problem? If so, could you pls share the solution?
@fishfree Unfortunately, no. I can successfully build the knowledge graph using a small amount of document content. Additionally, I can also build the knowledge graph successfully with other models(Breeze-7B-Instruct-v1_0
). The dataset for this model includes many documents with the same font as mine.
Thank you for your tutorial. I understand that building a knowledge graph can take a lot of time, but I noticed that there seems to be a bottleneck during the process. The computer's CPU and GPU usage is almost idle. Is this normal? Are there any ways to improve efficiency?
Thanks in advance!
The execution time in the image is after I interrupted and re-executed it.
settings.yaml