[Bug]: windows下多模态特征提取，文本提取报错，但图片特征可以

软件环境

- paddlepaddle:2.5.0
- paddlepaddle-gpu: 
- paddlenlp:2.5.2
- OS：win10
- python:3.7

重复问题

[X] I have searched the existing issues

错误描述

windows下多模态特征提取，文本提取报错，但图片特征可以

稳定复现步骤 & 代码

vision_language=Taskflow("feature_extraction", model='PaddlePaddle/ernie_vil-2.0-base-zh') url="https://inews.gtimg.com/news_ls/OKRw5DwlX_6iHSr3f6YggdiI2L027KGb9-FoRMVGiHzroAA_640330/0" response = requests.get(url) x=BytesIO(response.content) f_embeds = vision_language(Image.open(x)) 可以正常提取特征 text_embeds = vision_language("猫的照片") 报错 InvalidArgumentError: The type of data we are trying to retrieve (int64) does not match the type of data (int32) currently contained in the container. linux没有问题，就在windows上出问题，

PaddlePaddle / PaddleNLP