-
Hi, I used `dair-ai/emotion` to fine-tune the `bert-base-cased` model to use it for text-classification task.
The dataset is loaded when I load it in this script, so there is no problem up to here:…
-
Papers that don't fit somewhere else right now but may be relevant in the future:
https://huggingface.co/papers/2409.18943
https://arxiv.org/pdf/2409.16493
-
- [Поддерживается](https://github.com/EleutherAI/lm-evaluation-harness/blob/main/lm_eval/models/openai_completions.py#L368) в оригинальной lm-evaluation-harness.
- Позволяет тестировать неограниченны…
-
# Description
We created a model using mostly test data. We should document the results of this, including the analysis of the results. For:
* Intents
* Entities
* Responses
we will attempt t…
-
-
## CLI Change Report
This report covers changes from October 23, 2024 to October 29, 2024 for the `wrangler` CLI monitor source. This is a result of a scan for version `wrangler --version`: `3.82.0` …
-
You have a kingmiddle and KingEnd... Also the Evaluation section choses when to call the Endgame. What I don't understand is when the AI ever 'evaluates' using the 'kingEnd' numbers. Is this part cu…
-
Hello,
I would like to ask how to create an evaluation dataset.
When I directly run `python evaluate_generation_model.py --model_path ../../LLM_Models/poison-7b-SUDO- --token SUDO --report_path ./…
-
### Description
Add C# language support for the questions
### Acceptance criteria
- User should be able to select C#
- Code in C#
- Then see answers in C#
- We can probably add saving the …
-
# 추천시스템 평가지표(Recommendation System Evaluation Metrics) | 이직러의 AI 걸작선
머신러닝 분야는 모델 구축만큼이나 성능평가도 중요합니다. 추천시스템 또한 머신러닝의 비지도학습에 속하기 때문에, 성능평가가 중요합니다. 그래서 추천시스템에는 Recall/Precision@k, MAP@k, NDCG@k 등 다양한 성능…