Closed grzegorz-roboflow closed 2 months ago
Ok, this is finally consistent
@PawelPeczek-Roboflow , sorry I didn't get to integration tests in time, and I'm out for a week starting tomorrow @hansent , this is looking good to go locally
We can probably remove the paligemma Dockerfile too
Description
Update based on progress made by @PawelPeczek-Roboflow
cu118
- and got rid of all attempts to install / uninstall / re-install cudnn which turned out not to be a problem after switchrequirements/requirements.pali.local.txt
and put all requirements intorequirements/requirements.pali.server.txt
, as this model only runs at serverv0.13.0
and current:transformers>=4.42.0
breaks CogVLM!flash_attn
was needed to be in separate requirements file as it requires other dependencies to be already installed, rather than installed alongside with this package in single pip installruntime
image - but then forflash_attn
we need nvcc compiler which comes along with cuda toolkit - once installed - dependency expolded to size near to 30GB - bigger than starting from develdevel
image as starting point, then switching toruntime
- but I could only save 5GB from 19.4GB and despite things building - it was complaining on tensorrt so library - which was not present in devel when recursively searched for - so must have been built during flash_attn build probably - not sure where it sits and could not copy it and place properly into target imageinference
andinference-gpu
on each commit to PR branch tomain
main
generate()
- not sure if thats due to specific transformers version, but the code is compliant in behaviour with this: https://huggingface.co/microsoft/Florence-2-largeThis is not done, but @probicheaux please update docs for using paligema and florence - when we have release, rel notes would have relevant info
List any dependencies that are required for this change.
Type of change
Please delete options that are not relevant.
How has this change been tested, please provide a testcase or example of how you tested the change?
Any specific deployment considerations
For example, documentation changes, usability, usage/costs, secrets, etc.
Docs