with DedicatedLLM(client=cortecs, model_name='Qwen/Qwen2-7B-Instruct', instance_type='NVIDIA_L40S_1') as llm:
essay = llm.invoke('Write an essay about dynamic provisioning')
print(essay.content)
the instance_type is ignored and the recommended instance type is used.
When using
the instance_type is ignored and the recommended instance type is used.