issues
search
yujonglee
/
eval
Evaluate your LLM apps, RAG pipeline, any generated text, and more!
MIT License
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump litellm from 0.1.583 to 0.1.590
#105
dependabot[bot]
closed
1 year ago
1
Bump datasets from 2.14.4 to 2.14.5
#104
dependabot[bot]
closed
1 year ago
1
Meta-eval should also consider None prediction
#103
yujonglee
opened
1 year ago
0
Make sure our built-in evaluator works properly
#102
yujonglee
closed
1 year ago
7
Example is missing for len(pipeline) == 2
#101
yujonglee
closed
1 year ago
1
Use chat-bison for our CI
#100
yujonglee
opened
1 year ago
0
Better text formatting on thought generation
#99
yujonglee
opened
1 year ago
0
litellm.gpt_cache should be disabled for num>1 in run call
#98
yujonglee
closed
1 year ago
3
Move litellm.api_base
#97
yujonglee
closed
1 year ago
3
Add EvalResult type
#96
yujonglee
opened
1 year ago
0
Better None handling in Kappa calculation
#95
yujonglee
opened
1 year ago
1
Support num > 2 in runner
#94
yujonglee
closed
1 year ago
0
Support num=2 to check consistency in runner
#93
yujonglee
closed
1 year ago
2
How is the tokenizer used?
#92
krrishdholakia
closed
1 year ago
3
Experimental eval result reporting
#91
yujonglee
closed
1 year ago
1
First public docs in place
#90
yujonglee
closed
1 year ago
8
Fix max_token=2 for togetherAI
#89
yujonglee
opened
1 year ago
1
Add Auto-fixing parser
#88
yujonglee
opened
1 year ago
0
Add more tests for our eval
#87
yujonglee
closed
1 year ago
4
Better position debias strategy
#86
yujonglee
closed
1 year ago
1
Better number handling in grading
#85
yujonglee
closed
1 year ago
3
Bump litellm from 0.1.516 to 0.1.525
#84
dependabot[bot]
closed
1 year ago
0
Bump pre-commit from 3.3.3 to 3.4.0
#83
dependabot[bot]
closed
1 year ago
0
Bump ipykernel from 6.25.1 to 6.25.2
#82
dependabot[bot]
closed
1 year ago
0
Bump pytest from 7.4.0 to 7.4.1
#81
dependabot[bot]
closed
1 year ago
0
Let's eval our eval
#80
yujonglee
closed
1 year ago
1
Fix preview doc deploy
#79
yujonglee
closed
1 year ago
1
Add COTHead
#78
yujonglee
closed
1 year ago
4
Implement custom warning and error handling
#77
yujonglee
closed
1 year ago
3
Add pre-commit hook
#76
yujonglee
closed
1 year ago
1
Add warning mapping in docs
#75
yujonglee
closed
1 year ago
0
API Proxy strategy
#74
yujonglee
closed
1 year ago
2
Fix the problem where the api_base for the proxy is not set
#73
yujonglee
closed
1 year ago
2
Add proxy setting for LLM specific test runner
#72
yujonglee
closed
1 year ago
0
Configure debias strategy for each LLM
#71
yujonglee
opened
1 year ago
0
Add auto fallback to longer context model
#70
yujonglee
closed
1 year ago
0
Update docutils requirement from <0.17 to <0.21
#69
dependabot[bot]
closed
1 year ago
3
Bump textual from 0.32.0 to 0.35.1
#68
dependabot[bot]
closed
1 year ago
1
Bump nbconvert from 7.7.4 to 7.8.0
#67
dependabot[bot]
closed
1 year ago
1
Bump transformers from 4.32.0 to 4.32.1
#66
dependabot[bot]
closed
1 year ago
1
Bump litellm from 0.1.504 to 0.1.509
#65
dependabot[bot]
closed
1 year ago
1
Bump openai from 0.27.9 to 0.27.10
#64
dependabot[bot]
closed
1 year ago
1
Abstraction: EvaluationHead
#63
yujonglee
closed
1 year ago
0
Implement browser based human-eval
#62
yujonglee
opened
1 year ago
0
Use model alias
#61
yujonglee
closed
1 year ago
2
Add debugging using DEBUG context variable
#60
yujonglee
closed
1 year ago
0
Update LiteLLM to 0.1.504
#59
yujonglee
closed
1 year ago
0
Use ContextWindowExceededError
#58
yujonglee
closed
1 year ago
1
Initial implementation of position consensus
#57
yujonglee
closed
1 year ago
0
Make classification fallback policy configurable
#56
yujonglee
closed
1 year ago
1
Previous
Next