Closed oplatek closed 1 week ago
@oplatek Thanks, looks useful!
Given that we allow to annotate any span categories, not just errors, can we call it category
or category_index
instead of error_type
?
And will you be able to write the wiki page for VLLM?
Given that we allow to annotate any span categories, not just errors, can we call it category or category_index instead of error_type?
Good point! Will fix it
And will you be able to write the wiki page for VLLM?
Yes, I will reference what I did for using VLLM on UFAL GPUs. I hope it will work for most people (I have very limited experience with vLLM but their documentation looks great)
What?
Introduces generation with structured outputs using OpenAI client for OpenAI API and VLLM
Defines JSON schema for ErrorSpanAnnotations as
Why?
Limitations
Notice that the Annotation has the attribute
error_type
instead oftype
as previously because for JSON schema thetype
is a reserved word. Just after parsing this PR creates the dictionary with keytype
instead oferror_type
soerror_type
is used for LLM calls and it's parsing.Limited testing: I tested all three LLM-eval configs (OpenAI, Ollama, VLLM)
The wiki page does not cover vLLM installation and vLLM requires GPU.