issues
search
confident-ai
/
deepeval
The LLM Evaluation Framework
https://docs.confident-ai.com/
Apache License 2.0
3.74k
stars
297
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Number of golden records in output is not the same as requested in input during synthesis
#1179
nalin-programmer
opened
6 hours ago
0
ISSUE 1173: dataset loading and saving
#1178
kritinv
opened
8 hours ago
1
fix issue #1174
#1177
kritinv
opened
9 hours ago
1
Prompt versioning from confident
#1176
penguine-ip
closed
13 hours ago
1
tutorial
#1175
kritinv
closed
17 hours ago
1
Is there an initialisation bug in synthesizer/chunking/context_generator.py for class ContextGenerator method generate_contexts() ?
#1174
CAW-nz
opened
1 day ago
1
Issue regarding consistent loading of source_files data (from goldens original values)
#1173
CAW-nz
opened
1 day ago
1
Corrections for some of the DeepEval documentation/help pages
#1172
CAW-nz
opened
1 day ago
0
Bug: dataset.py - All 4 Open statements in this file are lacking encoding="utf-8" argument
#1171
CAW-nz
opened
1 day ago
0
new release
#1170
penguine-ip
closed
2 days ago
1
Use Poetry build instead of setup.py
#1169
FrancoisMasson1990
opened
2 days ago
0
update guardrails
#1168
kritinv
closed
4 days ago
1
Revert "Replace context with retrieval_context in LLMTestCaseParams for HallucinationMetric"
#1167
penguine-ip
closed
5 days ago
1
fix tracing
#1166
kritinv
closed
5 days ago
1
fix batch processing for large documents
#1165
kritinv
closed
5 days ago
1
fix guardrails
#1164
kritinv
closed
5 days ago
1
How to integrate llama3 to deepeval?
#1163
drbhushan
opened
1 week ago
1
tutorial
#1162
kritinv
closed
17 hours ago
1
Replace context with retrieval_context in LLMTestCaseParams for HallucinationMetric
#1161
louisbrulenaudet
closed
5 days ago
2
Redteam Jailbreak Linear / Tree Fix
#1160
nabeel-chhatri
closed
1 week ago
1
added max concurrent
#1159
kritinv
closed
5 days ago
1
new release
#1158
penguine-ip
closed
1 week ago
1
new release
#1157
penguine-ip
closed
1 week ago
1
change track to monitor
#1156
kritinv
closed
1 week ago
1
New metric
#1155
penguine-ip
opened
1 week ago
1
new release
#1154
penguine-ip
closed
1 week ago
1
fix import require
#1153
penguine-ip
closed
1 week ago
1
pass test name to test result
#1152
AugmentMo
closed
1 week ago
2
a few tracing fixes
#1151
kritinv
closed
1 week ago
1
GEval not focusing on expected_output & Relying on OpenAI instead
#1149
pavan-growexxer
opened
1 week ago
2
red-teaming guide
#1148
kritinv
closed
1 week ago
1
Asynchronous test runs are sometimes not completed correctly
#1147
jmaczan
opened
1 week ago
12
Hallucination metric score is set to 0.0 in one run and 1.0 in another run, despite having the same input values
#1146
jmaczan
opened
1 week ago
1
Hallucination metric assigns only either 0.0 or 1.0 score
#1145
jmaczan
opened
1 week ago
1
guardrails
#1144
kritinv
closed
1 week ago
1
g.
#1143
penguine-ip
closed
2 weeks ago
1
Avoid asking for OPENAI_API_KEY when creating an empty EvaluationDataset
#1142
michieletto
closed
2 weeks ago
1
Fix link in getting-started.mdx
#1141
NimJay
closed
2 weeks ago
2
EvaluationDataset in deepeval suddenly requires an OpenAi key since update
#1140
kbarendrecht
closed
2 weeks ago
4
Error while calculating Knowledge retention ; Evaluation LLM outputted an invalid JSON. Please use a better evaluation model.
#1139
jaysudhakaran
opened
2 weeks ago
4
new release
#1138
penguine-ip
closed
2 weeks ago
1
Improving Answer Relevancy Template
#1137
dipanjanS
opened
2 weeks ago
4
observability guide
#1136
kritinv
closed
3 weeks ago
1
synthesizer docs breakdown
#1135
kritinv
closed
3 weeks ago
1
remove extraction limit print
#1134
penguine-ip
closed
3 weeks ago
1
reformat
#1133
penguine-ip
closed
3 weeks ago
1
OpenAI Rate Limit Exceeds
#1132
Miriam2040
closed
3 weeks ago
1
Support utf8
#1131
kinga-marszalkowska
closed
3 weeks ago
1
faithfulness print extraction_limit for each test case
#1130
rjiangnju
closed
3 weeks ago
1
GEval docs: `strict` -> `strict_mode`
#1129
chkimes
closed
3 weeks ago
2
Next