issues
search
AI-secure
/
DecodingTrust
A Comprehensive Assessment of Trustworthiness in GPT Models
https://decodingtrust.github.io/
Creative Commons Attribution Share Alike 4.0 International
253
stars
54
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix bugs and move templates to module level in privacy evaluation
#57
fjxmlzn
closed
2 weeks ago
0
There is loading issue in the leaderboard..
#56
zhimin-z
opened
3 months ago
0
Field name in the data loader of perspective_api_annotate.py
#55
AntiQuality
opened
3 months ago
0
DecodingTrust/src/dt/perspectives/fairness/fairness_evaluation.py
#54
dongjiancheng77
opened
3 months ago
0
Release
#53
garyxcj
closed
5 months ago
0
Release
#52
garyxcj
closed
5 months ago
0
CI/CD for Docker
#51
danielz02
closed
5 months ago
0
Analysis request for blog finding that "GPT-4 is more vulnerable than GPT-3.5"
#50
crizCraig
opened
5 months ago
1
When should we set `example_prefix` to be True? And what is the difference between put ICL examples into system prompt versus multi-turn user-assistant chat?
#49
peter-peng-w
opened
6 months ago
1
fix: evaluation dir; large num generations
#48
AlphaPav
closed
6 months ago
0
fix removesuffix
#47
AlphaPav
closed
6 months ago
0
update score calculation
#46
AlphaPav
closed
6 months ago
0
fix privacy jb; num of generations
#45
AlphaPav
opened
6 months ago
1
fix privacy jb; num of generations
#44
AlphaPav
closed
6 months ago
0
Hf chat template
#43
garyxcj
closed
7 months ago
0
MissingMandatoryValue
#42
richhh520
opened
7 months ago
5
Merge pull request #38 from AI-secure/scoring
#41
garyxcj
closed
7 months ago
0
update with harmfulness score reading
#40
YiZeng623
closed
7 months ago
0
Detailed harmfulness
#39
YiZeng623
closed
7 months ago
0
Add Full Score Breakdown
#38
danielz02
closed
7 months ago
0
Added harmfulness
#37
YiZeng623
closed
7 months ago
0
With harmfulness 11
#36
YiZeng623
closed
7 months ago
0
add jailbreak prompt and named entity recog
#35
AlphaPav
closed
7 months ago
0
Migrate hf dataset
#34
garyxcj
closed
7 months ago
0
Templates for Advglue
#33
dt-ahmed-touila
opened
7 months ago
0
Bug Fix & Better Score Aggregation Pipeline
#32
danielz02
closed
7 months ago
0
Update score computation for stereotype, fairness, ethics, and ood.
#31
jyhong836
closed
6 months ago
0
num_tokens_from_messages() is not implemented
#30
jinz2014
closed
8 months ago
0
conversation template for GPT-neo
#29
richhh520
opened
9 months ago
0
privacy_evaluation
#28
richhh520
closed
7 months ago
3
Assumption of Privacy Assessment on Llama
#27
richhh520
closed
9 months ago
2
Broken links in GitHub Pages
#26
ashwhall
closed
9 months ago
1
Update README.md
#25
AntiQuality
closed
9 months ago
1
Creating aggregate stereotype scores with missing files
#24
ritik99
closed
9 months ago
0
Division of zero
#23
jyhong836
opened
10 months ago
0
adding json files for privacy
#22
AlphaPav
closed
11 months ago
0
Aggregate results for ethics
#21
jyhong836
closed
8 months ago
3
Add refusal rate in calculate_score.py
#20
chenweixin107
closed
11 months ago
0
How to evaluate toxicity task on local hf-llama2-7B?
#19
AboveParadise
opened
11 months ago
6
Fairnsss Scoring Keywords & Max Tokens
#18
danielz02
closed
9 months ago
3
Remove Duplicate Code in Fairness
#17
danielz02
opened
11 months ago
1
Update calculate_score.py
#16
chenweixin107
closed
11 months ago
0
Modifying assert in stereotype score calculation
#15
ritik99
closed
11 months ago
0
Updating config accesses in stereotype code
#14
ritik99
closed
11 months ago
0
OpenAI API key should not be required
#13
danielz02
closed
11 months ago
1
Hydra override error when running evaluations after non-editable installation
#12
ziyic7
opened
11 months ago
14
Update machine ethics config files
#11
chenweixin107
closed
11 months ago
1
Update calculate_score.py
#10
chenweixin107
closed
11 months ago
1
Update test_machine_ethics.py
#9
chenweixin107
closed
11 months ago
0
Refactor stereotype for code quality improvement
#8
ritik99
closed
11 months ago
0
Next