Open jankulma opened 1 year ago
it just contains the last attempt - this is a good catch. Will leave this open to track, but tbh not a super priority for me right now. The actual logic to add this would actually be in the underlying llm-api
package if you want to give it a shot.
Any workaround to get the full token consumption for now?
Does the
response.usage
contain sum of usage for all attempts, or just the last, successful attempt?