Open michael-newsrx opened 1 month ago
Hey there! I think the param you want to populate is max_new_tokens
, not max_tokens
The warning seems like you're also passing a different input text than what's declared in your repro code.
@Jofthomas thoughts on the error handling here? Is the integration catching an error it shouldn't be?
Checked other resources
Example Code
Error Message and Stack Trace (if applicable)
The HuggingFaceEndpoint silently fails.
The API endpoint shows in the logs:
The error message also implies that
max_new_tokens
from the bind on the LLM is being ignored for actual API request calls.Description
I'm trying to use the langchain library to interface with Hugging Face Dedicated Endpoints.
System Info
System Information
Package Information
Other Dependencies