awslabs / llm-hosting-container

Large Language Model Hosting Container
Apache License 2.0
75 stars 32 forks source link

Llama2 #19

Closed philschmid closed 1 year ago

philschmid commented 1 year ago

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

nth-attempt commented 1 year ago

Thanks @philschmid for this PR. @xyang16 will this image be available in ECR as well? thanks!

xyang16 commented 1 year ago

@nth-attempt Yes it will be available in ECR once it's released.

monuminu commented 1 year ago

waiting for it soon :)

ashivadi commented 1 year ago

Need to update the dockerfile to have proper path to the files in the text-generation-inference repo. Eg.

COPY --chmod=775 ./text-generation-inference/sagemaker-entrypoint.sh entrypoint.sh

instead of this currently here

COPY --chmod=775 sagemaker-entrypoint.sh entrypoint.sh

assuming dir structure looks like this:

$tree 
.
├── llm-hosting-container
└── text-generation-inference
xyang16 commented 1 year ago

@ashivadi

The directory structure is like this:

$tree 
.
├── text-generation-inference
└── ── llm-hosting-container

Working directory is text-generation-inference

ashivadi commented 1 year ago

@ashivadi

The directory structure is like this:

$tree 
.
├── text-generation-inference
└── ── llm-hosting-container

Working directory is text-generation-inference

Thanks, updated here: https://github.com/ashivadi/llama2-with-tgi-sagemaker

nth-attempt commented 1 year ago

Is the image available in ECR yet?

xyang16 commented 1 year ago

@nth-attempt The target release date is by Monday next week.

nth-attempt commented 1 year ago

Thank you!! https://github.com/aws/deep-learning-containers/blob/master/available_images.md#huggingface-text-generation-inference-containers