seervideodiffusion / SeerVideoLDM

[ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models
16 stars 2 forks source link

environment setup error and runtime error -- solution #7

Open Cuberick-Orion opened 2 weeks ago

Cuberick-Orion commented 2 weeks ago

Hi,

Thank you (the authors) for this very impressive work.

Below are a few notes that I've made while setting up the environment, in case if anyone ran into similar issues with the setup.

I am using the official pytorch docker pytorch/pytorch:1.12.1-cuda11.3-cudnn8-devel, note it comes with python 3.7, some packages (e.g., matplotlib) will complain with version mismatch, just remove the version specified in the requirement file, as the author states that they do not necessarily have to be bound to the specific versions).

A complete Dockerfile and requirement file is attached at the bottom. Below are some changes that I've made.


Reproduce with:

Dockerfile

FROM pytorch/pytorch:1.12.1-cuda11.3-cudnn8-devel
# FROM nvidia/cuda:11.3.1-devel-ubuntu20.04

RUN apt-get update && apt-get install -y ffmpeg libsm6 libxext6 libglib2.0-0 git ca-certificates && apt-get clean
RUN nvcc --version

# install mmcv
RUN pip install openmim && mim install mmcv-full==1.7.0

COPY requirements.txt /tmp/
RUN pip install -r /tmp/requirements.txt && pip install --upgrade google-cloud-storage

modified requirement.txt

pydantic==1.9.0
accelerate==0.15.0
boto3==1.26.93
botocore==1.29.94
chainer==7.8.1
clip==0.2.0
cupy-cuda111==8.1.0
deepspeed==0.7.7
diffusers==0.10.2
einops==0.6.0
ftfy==6.1.1
gdown==4.6.0
gifmaker==1.5
huggingface-hub==0.11.1
imageio==2.25.0
matplotlib
mmdet==2.25.3
multiprocess==0.70.14
numpy
omegaconf==2.3.0
open-clip-torch==2.16.0
opencv-python==4.7.0.68
pandas
Pillow
pyparsing==3.0.9
rotary-embedding-torch==0.1.5
scikit-learn
scipy
six
taming-transformers==0.0.1
tensorboard==2.11.0
timm==0.6.12
tokenizers==0.13.2
tqdm
transformers==4.25.1
triton==2.1.0
xformers=0.0.12