binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
https://github.com/binary-husky/gpt_academic/wiki/online
GNU General Public License v3.0
60.9k stars 7.61k forks source link

[Bug]: 使用gemini API载入历史存档后显示 `Resource has been exhausted (e.g. check quota).` #1832

Open RonghaiHe opened 1 month ago

RonghaiHe commented 1 month ago

Installation Method | 安装方法与平台

Pip Install (I used latest requirements.txt)

Version | 版本

Latest | 最新版

OS | 操作系统

Linux

Describe the bug | 简述

如题,用的是gemini的API,不载入历史存档能够顺利对话,载入历史存档后界面显示 Resource has been exhausted (e.g. check quota).。终端的显示见下方(路径信息中包含个人信息部分使用 xxx 代替)。 查询gemini API的配额(quota),并没有超出当日限制(只用了一两次,测试不载入历史存档能否顺利对话,验证了不是网络的问题)。

Screen Shot | 有帮助的截图

-

Terminal Traceback & Material to Help Reproduce Bugs | 终端traceback(如有) + 帮助我们复现的测试材料样本(如有)

Traceback (most recent call last): File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/gradio/routes.py", line 422, in run_predict output = await app.get_blocks().process_api( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/gradio/blocks.py", line 1323, in process_api result = await self.call_function( ^^^^^^^^^^^^^^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/gradio/blocks.py", line 1067, in call_function prediction = await utils.async_iteration(iterator) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/gradio/utils.py", line 336, in async_iteration return await iterator.anext() ^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/gradio/utils.py", line 329, in anext return await anyio.to_thread.run_sync( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/anyio/to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 2144, in run_sync_in_worker_thread return await future ^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 851, in run result = context.run(func, args) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/xxx/miniconda3/envs/gptac/lib/python3.11/site-packages/gradio/utils.py", line 312, in run_sync_iterator_async return next(iterator) ^^^^^^^^^^^^^^ File "/xxx/gpt_academic/toolbox.py", line 128, in decorated yield from f(txt_passon, llm_kwargs, plugin_kwargs, chatbot_with_cookie, history, system_prompt, args) File "/xxx/gpt_academic/request_llms/bridge_all.py", line 1053, in predict yield from method(inputs, llm_kwargs, plugin_kwargs, chatbot, history, system_prompt, stream, additional_fn) File "/xxx/gpt_academic/request_llms/bridge_google_gemini.py", line 108, in predict raise RuntimeError('对话错误') RuntimeError: 对话错误

RonghaiHe commented 1 month ago

btw,存档的对话历史是使用 3.74 版本的对话,现在载入用的是最新的 3.76 版本

RonghaiHe commented 1 month ago

感觉是一次性发送到llm,导致token数过大?因为调用了其他API,反映token数超过限制