Closed vaahtio closed 1 month ago
Attention: Patch coverage is 0%
with 51 lines
in your changes missing coverage. Please review.
:white_check_mark: All tests successful. No failed tests found.
Files with missing lines | Patch % | Lines |
---|---|---|
flow_judge/integrations/langchain.py | 0.00% | 51 Missing :warning: |
Files with missing lines | Coverage Δ | |
---|---|---|
flow_judge/integrations/langchain.py | 0.00% <0.00%> (ø) |
LangChain Integration for Flow Judge
Summary
This PR introduces an integration between Flow Judge and LangChain, allowing users to leverage Flow Judge's custom metrics within LangChain workflows.
Key Changes
Created
FlowJudgeLangChainEvaluator
class on the integrations folder:StringEvaluator
Added example notebook:
Testing
The integration has been manually tested:
FlowJudgeLangChainEvaluator
with models Flow-Judge-v0.1-AWQ, Flow-Judge-v0.1, Flow-Judge-v0.1_HF and Flow-Judge-v0.1-AWQ-Async ( note Async functionality is not demonstrated in the notebook, but it was part of the standard stringevaluator class, so I added the option as well)FlowJudgeLangChainEvaluator
with custom metric (in notebook), and built-in metrics (metric=RESPONSE_CORRECTNESS_BINARY, model=model)