基于UIE-BASE模型定制训练后，使用定制训练的模型进行预测任务，每一次预测完后的显存不释放，导致随着预测次数增加，显存直接爆掉

PaddlePaddle / PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

https://paddlenlp.readthedocs.io

Apache License 2.0

12.17k stars 2.95k forks source link

基于UIE-BASE模型定制训练后，使用定制训练的模型进行预测任务，每一次预测完后的显存不释放，导致随着预测次数增加，显存直接爆掉 #6306

Open Eavinn opened 1 year ago

Eavinn commented 1 year ago

[Question]: 按照 https://github.com/PaddlePaddle/PaddleNLP/tree/develop/model_zoo/uie 教程基于uie-base模型做训练，训练完成之后模型保存在MODEL_PATH，使用 Taskflow('information_extraction', schema=self.schema, task_path=MODEL_PATH)去做预测任务，每预测一次GPU已用显存都会增大500M左右(模型大约500M)，直到最后显存不足。

后尝试提前使用paddle.set_flags()加入FLAGS_eager_delete_tensor_gb、FLAGS_memory_fraction_of_eager_deletion、FLAGS_fast_eager_deletion_mode、FLAGS_fraction_of_gpu_memory_to_use、FLAGS_use_cuda_managed_memory全局变量设置，没有任何效果。

使用版本： paddlepaddle-gpu 2.4.2.post117 paddlenlp 2.5.2

附截图：