PaddlePaddle / PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
https://paddlenlp.readthedocs.io
Apache License 2.0
12.13k stars 2.94k forks source link

[Bug]: windows下多模态特征提取,文本提取报错,但图片特征可以 #6428

Open 0x376h opened 1 year ago

0x376h commented 1 year ago

软件环境

- paddlepaddle:2.5.0
- paddlepaddle-gpu: 
- paddlenlp:2.5.2
- OS:win10
- python:3.7

重复问题

错误描述

windows下多模态特征提取,文本提取报错,但图片特征可以

稳定复现步骤 & 代码

vision_language=Taskflow("feature_extraction", model='PaddlePaddle/ernie_vil-2.0-base-zh') url="https://inews.gtimg.com/news_ls/OKRw5DwlX_6iHSr3f6YggdiI2L027KGb9-FoRMVGiHzroAA_640330/0" response = requests.get(url) x=BytesIO(response.content) f_embeds = vision_language(Image.open(x)) 可以正常提取特征 text_embeds = vision_language("猫的照片") 报错 InvalidArgumentError: The type of data we are trying to retrieve (int64) does not match the type of data (int32) currently contained in the container. image linux没有问题,就在windows上出问题,

ChengLiKe commented 11 months ago

现在还没修复