Open arun-gupta opened 3 months ago
Can we rename this Issue to "ChatQnA - GCP single node via Terraform" ? @mkbhanda per email. We will then create new issues for other examples, like CodeGen.
FYI I started to develop this, but I'm blocked with a compose issue, documented here: https://github.com/opea-project/GenAIExamples/issues/704
@lucasmelogithub Where are we at with this issue?
@lucasmelogithub Where are we at with this issue?
It's built. Both ChatQnA and CodeGen using 5th Gen Xeon in GCP. I will create PR to update the Xeon README.md files with links. We can discuss if that is the correct strategy on the PR.
Terraform Modules https://github.com/intel/terraform-intel-gcp-vm/tree/main/examples/gen-ai-xeon-opea-chatqna https://github.com/intel/terraform-intel-gcp-vm/tree/main/examples/gen-ai-xeon-opea-codegen
Ansible Playbooks: https://github.com/intel/optimized-cloud-recipes/tree/main/recipes/ai-opea-chatqna-xeon https://github.com/intel/optimized-cloud-recipes/tree/main/recipes/ai-opea-codegen-xeon
Sounds good
Create a TerraForm stack that allows to deploy GenAI examples on Google Cloud using a single node Docker Compose. The template should take an argument of which sample to deploy, setup all the necessary Google Cloud resources, and then give a publicly accessible URL where the example can be tested.
The stack should be modular such that other samples can be added later on with minimal effort.
Detailed instructions for each sample from https://github.com/opea-project/GenAIExamples/tree/main?tab=readme-ov-file
This is a specific issue for Google Cloud for the mega issue at https://github.com/opea-project/GenAIExamples/issues/427.