issues
search
JohnSnowLabs
/
langtest
Deliver safe & effective language models
http://langtest.org/
Apache License 2.0
468
stars
35
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix/alignment issues in bias tests for ner task
#1060
chakravarthik27
closed
11 hours ago
0
Fix alignment issues in bias tests for NER Task
#1059
chakravarthik27
opened
2 days ago
0
Feature/implement the generic to brand drug name swapping tests and vice versa
#1058
chakravarthik27
opened
1 week ago
0
Implement the Generic to Brand Drug Name Swapping Tests and Vice-Versa
#1057
chakravarthik27
opened
1 week ago
0
Add Tests/Metrics for Multimodal models
#1056
dcecchini
opened
1 week ago
0
Feature/integrate prometheus model for enhanced evaluation
#1055
chakravarthik27
opened
2 weeks ago
0
Integrate Prometheus Model for Enhanced Evaluation
#1054
chakravarthik27
opened
2 weeks ago
0
Add support to the LLM eval class in Accuracy Category.
#1053
chakravarthik27
closed
2 weeks ago
0
Enhance `LLMEval` Class to Accept Model Parameters
#1052
chakravarthik27
opened
2 weeks ago
0
adding workflow for github pages
#1051
chakravarthik27
closed
2 weeks ago
0
chore: Update GitHub Pages workflow for Jekyll site deployment
#1050
chakravarthik27
closed
2 weeks ago
0
Migrate/GitHub actions from GitHub pages
#1049
chakravarthik27
closed
2 weeks ago
0
Migrate to GitHub Actions from Pages Legacy Worker
#1048
chakravarthik27
closed
2 weeks ago
0
chore: update dependencies
#1047
chakravarthik27
opened
2 weeks ago
0
Bump mlflow from 2.11.0 to 2.13.2
#1046
dependabot[bot]
closed
2 weeks ago
1
Fix Security and Dependency Issues
#1045
chakravarthik27
opened
2 weeks ago
0
Refactor/improve the transform module
#1044
chakravarthik27
closed
3 weeks ago
0
Refactor the transform module
#1043
chakravarthik27
closed
3 weeks ago
0
expand-entity-type-support-in-label-representation-tests
#1042
chakravarthik27
closed
3 weeks ago
0
Expand Entity Type Support in Label-Representation Tests
#1041
chakravarthik27
closed
3 weeks ago
0
feat: Add SafetyTestFactory and Misuse class for safety testing
#1040
chakravarthik27
closed
1 week ago
0
Feature/add support for multi model with multi dataset
#1039
chakravarthik27
closed
2 weeks ago
0
Enhancements/improve the logging and its functionalities
#1038
chakravarthik27
closed
1 month ago
0
Blogpost on Leaderboard for Benchmarking models in locally
#1037
chakravarthik27
opened
1 month ago
0
Blogpost on LLM evaluation in healthcare
#1036
chakravarthik27
opened
1 month ago
0
Blogpost for LLM Evaluation with Prometheus 2
#1035
chakravarthik27
opened
1 month ago
0
Templates for Chat Models for HuggingFace
#1034
chakravarthik27
opened
1 month ago
0
Add Support for Multi-Model with Multi-Dataset
#1033
chakravarthik27
opened
1 month ago
0
updates web pages
#1032
chakravarthik27
closed
1 month ago
0
chore: update langtest version to 2.2.0
#1031
chakravarthik27
closed
1 month ago
0
Release/2.2.0
#1029
chakravarthik27
closed
1 month ago
0
updated: langtest version in pip
#1028
chakravarthik27
closed
1 month ago
0
Improved: `rank_by` argument add to `harness.get_leaderboard()`
#1027
chakravarthik27
closed
1 month ago
0
Include 'Better Safe Than Sorry' Dataset in Healthcare Test
#1026
chakravarthik27
opened
1 month ago
0
Refactor: Improve Code Organization and Readability
#1025
chakravarthik27
closed
1 month ago
0
Fix: Summary class to update summary dataframe and handle file path
#1024
chakravarthik27
closed
1 month ago
0
website updates
#1023
chakravarthik27
closed
1 month ago
0
Refactor: Improved the `import_edited_testcases()` functionality in Harness.
#1022
chakravarthik27
closed
1 month ago
0
Improving the Importing of Edited Testcases into Harness
#1021
chakravarthik27
closed
1 month ago
0
`random_age` Class not returning test cases
#1020
chakravarthik27
closed
2 months ago
0
Bug in `Randomize_age` test
#1019
chakravarthik27
closed
1 month ago
0
Implementation of prompt techniques
#1018
chakravarthik27
closed
1 month ago
0
Improve the logging and it's functionalities
#1017
chakravarthik27
closed
1 month ago
0
Feature/data augmentation allow access without harness testing
#1016
chakravarthik27
closed
2 months ago
0
Bug fix/performance tests
#1015
chakravarthik27
closed
2 months ago
0
Data Augmentation: Allow Access Without Harness Testing
#1014
chakravarthik27
closed
1 month ago
0
Output format for CasualLM evaluation for NER.
#1013
chakravarthik27
closed
1 month ago
0
Improvements/load and save benchmark report
#1012
chakravarthik27
closed
1 month ago
0
Implement the prompt techniques like 5-shot, 10-shot, few-shot, and CoT.
#1011
chakravarthik27
closed
1 month ago
0
User prompt handling for multi-dataset testing
#1010
chakravarthik27
closed
2 months ago
0
Next