confident-ai deepeval issues

confident-ai / deepeval

The LLM Evaluation Framework

https://docs.confident-ai.com/

Apache License 2.0

3.74k stars 297 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Number of golden records in output is not the same as requested in input during synthesis

#1179 nalin-programmer opened 6 hours ago
0
ISSUE 1173: dataset loading and saving

#1178 kritinv opened 8 hours ago
1
fix issue #1174

#1177 kritinv opened 9 hours ago
1
Prompt versioning from confident

#1176 penguine-ip closed 13 hours ago
1
tutorial

#1175 kritinv closed 17 hours ago
1
Is there an initialisation bug in synthesizer/chunking/context_generator.py for class ContextGenerator method generate_contexts() ?

#1174 CAW-nz opened 1 day ago
1
Issue regarding consistent loading of source_files data (from goldens original values)

#1173 CAW-nz opened 1 day ago
1
Corrections for some of the DeepEval documentation/help pages

#1172 CAW-nz opened 1 day ago
0
Bug: dataset.py - All 4 Open statements in this file are lacking encoding="utf-8" argument

#1171 CAW-nz opened 1 day ago
0
new release

#1170 penguine-ip closed 2 days ago
1
Use Poetry build instead of setup.py

#1169 FrancoisMasson1990 opened 2 days ago
0
update guardrails

#1168 kritinv closed 4 days ago
1
Revert "Replace context with retrieval_context in LLMTestCaseParams for HallucinationMetric"

#1167 penguine-ip closed 5 days ago
1
fix tracing

#1166 kritinv closed 5 days ago
1
fix batch processing for large documents

#1165 kritinv closed 5 days ago
1
fix guardrails

#1164 kritinv closed 5 days ago
1
How to integrate llama3 to deepeval?

#1163 drbhushan opened 1 week ago
1
tutorial

#1162 kritinv closed 17 hours ago
1
Replace context with retrieval_context in LLMTestCaseParams for HallucinationMetric

#1161 louisbrulenaudet closed 5 days ago
2
Redteam Jailbreak Linear / Tree Fix

#1160 nabeel-chhatri closed 1 week ago
1
added max concurrent

#1159 kritinv closed 5 days ago
1
new release

#1158 penguine-ip closed 1 week ago
1
new release

#1157 penguine-ip closed 1 week ago
1
change track to monitor

#1156 kritinv closed 1 week ago
1
New metric

#1155 penguine-ip opened 1 week ago
1
new release

#1154 penguine-ip closed 1 week ago
1
fix import require

#1153 penguine-ip closed 1 week ago
1
pass test name to test result

#1152 AugmentMo closed 1 week ago
2
a few tracing fixes

#1151 kritinv closed 1 week ago
1
GEval not focusing on expected_output & Relying on OpenAI instead

#1149 pavan-growexxer opened 1 week ago
2
red-teaming guide

#1148 kritinv closed 1 week ago
1
Asynchronous test runs are sometimes not completed correctly

#1147 jmaczan opened 1 week ago
12
Hallucination metric score is set to 0.0 in one run and 1.0 in another run, despite having the same input values

#1146 jmaczan opened 1 week ago
1
Hallucination metric assigns only either 0.0 or 1.0 score

#1145 jmaczan opened 1 week ago
1
guardrails

#1144 kritinv closed 1 week ago
1
g.

#1143 penguine-ip closed 2 weeks ago
1
Avoid asking for OPENAI_API_KEY when creating an empty EvaluationDataset

#1142 michieletto closed 2 weeks ago
1
Fix link in getting-started.mdx

#1141 NimJay closed 2 weeks ago
2
EvaluationDataset in deepeval suddenly requires an OpenAi key since update

#1140 kbarendrecht closed 2 weeks ago
4
Error while calculating Knowledge retention ; Evaluation LLM outputted an invalid JSON. Please use a better evaluation model.

#1139 jaysudhakaran opened 2 weeks ago
4
new release

#1138 penguine-ip closed 2 weeks ago
1
Improving Answer Relevancy Template

#1137 dipanjanS opened 2 weeks ago
4
observability guide

#1136 kritinv closed 3 weeks ago
1
synthesizer docs breakdown

#1135 kritinv closed 3 weeks ago
1
remove extraction limit print

#1134 penguine-ip closed 3 weeks ago
1
reformat

#1133 penguine-ip closed 3 weeks ago
1
OpenAI Rate Limit Exceeds

#1132 Miriam2040 closed 3 weeks ago
1
Support utf8

#1131 kinga-marszalkowska closed 3 weeks ago
1
faithfulness print extraction_limit for each test case

#1130 rjiangnju closed 3 weeks ago
1
GEval docs: `strict` -> `strict_mode`

#1129 chkimes closed 3 weeks ago
2