Note: This is the codebase to inference and train UGround-v1. It is modified from haotian-liu/LLaVA, with changes mainly on image processing. We may use other codebases or architectures for later versions.
conda create -n llava_uground python=3.11 -y
conda activate llava_uground
pip install --upgrade pip # enable PEP 660 support
There are several ways to install the package:
# Install from Github
pip install git+https://github.com/boyugou/llava_uground.git
# Install locally
git clone https://github.com/boyugou/llava_uground.git
pip install -e .
# Install from pypi (Not uploaded yet)
pip install to_be_uploaded
python single_infer.py # single request
python batch_infer.py # for inference of a jsonl