Azure / gen-cv

Vision AI Solution Accelerator
MIT License
408 stars 238 forks source link
azure-computer-vision cognitive-search-vector-store dalle-3 embeddings florence foundation-models generative-computer-vision image-search stable-diffusion

Vision AI Solution Accelerator

drawing

This repository serves as a rich resource offering numerous examples of synthetic image generation, manipulation, and reasoning. Utilizing Azure Machine Learning, Computer Vision, OpenAI, and widely acclaimed open-source frameworks like Stable Diffusion, it equips users with practical insights into the application of these powerful tools in the realm of image processing.

Content

Getting Started

The code within this repository has been tested on both Github Codespaces compute and an Azure Machine Learning Compute Instance. Although the use of a GPU is not a requirement, it is highly recommended if you aim to generate a large number of sample images using Stable Diffusion.

Follow these steps to get started:

  1. Clone this repository on your preferred compute using the following command:

    git clone https://github.com/Azure/gen-cv.git
  2. Create your Python environment and install the necessary dependencies. For our development, we utilized Conda. You can do the same with these commands:

conda create -n gen-cv python=3.10
conda activate gen-cv
pip install -r requirements.txt
  1. From the list provided above, select a sample notebook. After making your selection, configure the Jupyter notebook to use the kernel associated with the environment you set up in Step 2.
  2. Copy the .env.template file to .env to store your parameters:
    cp .env.template .env
  3. Add the required parameters and keys for your services to the .env file.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.