mayooear / gpt4-pdf-chatbot-langchain

GPT4 & LangChain Chatbot for large PDF docs
https://www.youtube.com/watch?v=ih9PBGVVOO4
14.95k stars 3.02k forks source link

PDF file ingestion always results in garbled characters. #248

Closed victor-defi closed 1 year ago

victor-defi commented 1 year ago
CleanShot 2023-05-05 at 01 53 29@2x

I have experimented many times, but there are always garbled errors. I have already adjusted to UTF-8, and both Chinese and English will encounter such errors.

dosubot[bot] commented 1 year ago

Hi, @victor-defi! I'm Dosu, and I'm here to help the gpt4-pdf-chatbot-langchain team manage their backlog. I wanted to let you know that we are marking this issue as stale.

From what I understand, you reported an issue where the characters in the ingested PDF file are garbled, even after adjusting to UTF-8 encoding. However, there hasn't been any activity or comments on the issue since you reported it.

Before we close this issue, we wanted to check with you if it is still relevant to the latest version of the gpt4-pdf-chatbot-langchain repository. If it is, please let us know by commenting on the issue. Otherwise, feel free to close the issue yourself, or it will be automatically closed in 7 days.

Thank you for your understanding and contribution to the project. If you have any further questions or concerns, please don't hesitate to reach out.