-
This issue is now to track the implementation of various evaluation methods and workflows for LLMs.
Evaluations:
- [x] G-Eval
- [ ] PingPong
- [ ] InfiniteBench
- [ ] Ruler
- [ ] MMLU
- [ ] M…
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### Where…
-
- [ ] I have updated Purchases SDK to the latest version
- [x] I have read the [Contribution Guidelines](https://github.com/RevenueCat/purchases-flutter/blob/main/CONTRIBUTING.md)
- [x] I have searc…
-
# Add `hint` to the artifact specification
### Uploaders
The ocm library has a concept of *uploaders* (also called *blobhandlers*) within the ocm library. These *uploaders*
essentially provide …
-
I've noticed an intriguing phenomenon where the training accuracy is lower than the evaluation accuracy. This seems to deviate from the common trend where training accuracy usually surpasses evaluati…
-
In a meeting between accessibility experts from several Scandinavian organisations, the following questions for 2.4.11 Focus Not Obscured were brought up.
The question is not so much what the sugge…
-
**Is your feature request related to a problem? Please describe.**
I feel that it is too easy for users to rotate their single-sig AIDs accidentally. Once published to witnesses, key rotation is perm…
-
# DevEx/OpEx
While adding OML support to ReportStream, we found a place where supported message types are listed. However, ORM is not included. It turns out warnings are being logged when ORMs are …
-
### Confirm this is a feature request for the .NET library and not the underlying OpenAI API
- [X] This is a feature request for the .NET library
### Describe the feature or improvement you are requ…
-
#### Description:
We aim to develop a framework that applies the Software Development Life Cycle (SDLC) principles to various team-based sports. This framework will help in understanding how project …