gcui-art / album-ai

AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery.
http://album.gcui.ai
Apache License 2.0
801 stars 79 forks source link
ai album gallery gpt-4o-mini haiku image llm rag

Album AI

AI-First Album, Chat with your gallery/album using plain language!

👉 We update frequently, feel free to star us.

English | 简体中文 | Demo | Discord

Album AI - AI-First Album - Chat with your gallery using plain language! | Product Hunt

https://github.com/user-attachments/assets/5cc72436-2749-479f-a1bf-06e0d06ce1e3

Introduction

Album AI is an experimental project that uses the recently released gpt-4o-mini and Haiku as a visual model to automatically identify metadata from image files in the album. It then leverages RAG technology to enable conversations with the album.

It can be used as a traditional photo album or as an image knowledge base to assist LLM in content generation.

Story

As a photography enthusiast facing terabytes of photos, I often felt overwhelmed. All existing photo management software required extra effort to maintain. Haiku and the newly released gpt-4o-mini gave me hope. So I decided to implement it immediately. My partner and I created the first version in less than 24 hours.

We hope you'll like it too. We welcome any praise or criticism. Don't forget to give us a ⭐️ or share to let more people know about it.

Live Demo

album.gcui.ai

Features

How to start using?

Recommended to run locally, if you want to run on a server, please deploy yourself, and we will improve this part of the guide.

1. Clone the project

git clone git@github.com:gcui-art/album-ai.git
cd album-ai

2. Modify the .env

cp .env.prod.example .env.prod

Open .env.prod with your favorite editor, modify the configuration:

HOST_NAME= # Your local IP address, usually 192.168.x.x:8080
PROXY_URL= # (Optional) Your local proxy IP address, usually 192.168.x.x:7890, required when accessing OpenAI API directly is not available

OPENAI_API_KEY= # Your openai api key
ANTHROPIC_API_KEY= # Your Anthropic api key 

3. Build and run the project

chmod a+x ./build.sh
./build.sh

4. Enjoy!

Open the browser and visit http://localhost:8080 to see the demo.

5. Add new photos

Open the images directory in the project, add new photos to the images directory, and the background will automatically recognize and vectorize metadata. After that, you can use it in the demo through search and chat.

API Reference

Album AI currently implements the following APIs:

Contribution

There are four ways to support this project:

  1. Fork the project and submit a PR: We welcome any PR to make Album AI better.
  2. Submit an Issue: We welcome any reasonable suggestions or bug reports.
  3. Recommend: Recommend the project to others; click Star; place a link to the project after using it.

License

Apache 2.0 License

Do You have a question/suggestion/issue/Bug?

We use Github's Issue to manage these feedbacks, you can submit one. We will often deal with them.

Related links

Disclaimer

If you want to use it for commercial purposes, please contact us.

Star History

Star History Chart