-
In the paper in appendix B.2, you briefly describe how you generate the malicious instructions dataset. Could you share the prompt and seed instructions you used to generate this dataset? And how did …
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### Willi…
-
```
There are new models available on their website that we might want to integrate:
model.20120919 (2MB) -- the Twitter POS model with our coarse 25-tag tagset.
This is included with the tagger rel…
-
# Motivation
I want to use MMLU results by task to better understand the characteristics of LLMs. I am curious to see the differences between architectures and how performance in the tasks change as…
-
Hello,
I am using this tool to evaluate and check opening_hours tags from:
https://openingh.openstreetmap.de/evaluation_tool/
And I think it should throw a warning if I put:
Tu-Su,PH inste…
-
Everything stated here is open to discussion and revision for anyone who wants to give their opinion, except for the name, that's non-negotiable.
# Introduction
Following the spirit of https://git…
pvdrz updated
4 years ago
-
I think this would be extremely straightforward to implement and make for a tidier, less disruptive use of the tag helper.
```html
MenuItem
MenuItem
```
Required c…
-
Hey folks, so this looks absolutely promising for helping me automate a massive collection. I plan to use it just for optional tags from my database only. However, I'm a complete novice with only a li…
-
We could add a data quality level either to BaseDataset or to Evaluation. The idea would be something like:
Level 0 (All data): Load everything that is defined in the dataset.
Level 1 (Sane data):…
-
### What happened?
Results from RFP in v11: 20 (this is expected)
Results rom RFP in v12: 60 (this would be the max if we ignored the modifiers)
It looks like in v12 maximize is run synchronously, …