CatIIIIIIII / RNAErnie

Official implement of paper "Multi-purpose RNA Language Modeling with Motif-aware Pre-training and Type-guided Fine-tuning"
MIT License
18 stars 5 forks source link

关于import paddle失败的 #3

Closed zhendejuzi closed 19 hours ago

zhendejuzi commented 3 days ago

你好,我利用conda env create -f environment.yml进行安装。 但是在测试的时候,报错:

Python 3.8.0 (default, Nov 6 2019, 21:49:08) [GCC 7.3.0] :: Anaconda, Inc. on linux Type "help", "copyright", "credits" or "license" for more information.

import paddle Error: Can not import avx core while this file exists: /data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/fluid/core_avx.so Traceback (most recent call last): File "", line 1, in File "/data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/init.py", line 25, in from .framework import monkey_patch_variable File "/data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/framework/init.py", line 17, in from . import random # noqa: F401 File "/data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/framework/random.py", line 16, in import paddle.fluid as fluid File "/data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/fluid/init.py", line 36, in from . import framework File "/data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/fluid/framework.py", line 37, in from . import core File "/data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/fluid/core.py", line 298, in raise e File "/data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/fluid/core.py", line 256, in from . import core_avx ImportError: /data/wsj/software/anaconda3/envs/RNAErnie/lib/python3.8/site-packages/paddle/fluid/core_avx.so: undefined symbol: _dl_sym, version GLIBC_PRIVATE

我的服务器版本: PRETTY_NAME="Ubuntu 22.04.3 LTS" NAME="Ubuntu" VERSION_ID="22.04" VERSION="22.04.3 LTS (Jammy Jellyfish)"

该怎么解决?

CatIIIIIIII commented 2 days ago

你好!

这看起来是gcc的版本问题,可以尝试安装paddlepaddle==2.4

https://github.com/PaddlePaddle/Paddle/issues/44571

CatIIIIIIII commented 2 days ago

Considering most of the researchers will prefer to use transformers and pytorch as backend. So, I transfer my work to transformers and train a pytorch model from scratch. The new model is trained with more powerful settings: The max model length is up to 2048 now and the pretraining dataset is the newest version of rnacentral, which contains about 31 million RNA sequences after length filtering (<2048). This pytorch version model has been uploaded to huggingface at https://huggingface.co/WANGNingroci/RNAErnie and the training framework/tokenization is located at https://github.com/CatIIIIIIII/RNAErnie2. (NOTE: the tokenization is a little different from the original paddle implementation). Moreover, Multimolecule are implementing current most powerful RNA language model with transformers and pytorch. Our model also could be accessed at https://huggingface.co/multimolecule/rnaernie.