Closed Tffffboys closed 8 months ago
Hi @Tffffboys , the inconsistency arises from the differences in pooling implementation between the newer and older versions. To resolve this, you can consider downgrading the angle_emb to 0.1.1
. Below are the results using angle_emb==0.1.1
+-------+-------+-------+-------+-------+--------------+-----------------+-------+
| STS12 | STS13 | STS14 | STS15 | STS16 | STSBenchmark | SICKRelatedness | Avg. |
+-------+-------+-------+-------+-------+--------------+-----------------+-------+
| 75.26 | 85.61 | 80.64 | 86.36 | 82.51 | 85.64 | 80.99 | 82.43 |
+-------+-------+-------+-------+-------+--------------+-----------------+-------+
+-------+-------+-------+-------+-------+--------------+-----------------+-------+
But its performance (82.43) is higher than the original report (82.37) ...
It is a long time ago, maybe I mistakenly uploaded a newer model to HF. I forgot which specific model achieved 82.37 performance😂
Hi @Tffffboys , the inconsistency arises from the differences in pooling implementation between the newer and older versions. To resolve this, you can consider downgrading the angle_emb to
0.1.1
. Below are the results using angle_emb==0.1.1**+-------+-------+-------+-------+-------+--------------+-----------------+-------+** | STS12 | STS13 | STS14 | STS15 | STS16 | STSBenchmark | SICKRelatedness | Avg. | +-------+-------+-------+-------+-------+--------------+-----------------+-------+ | 75.26 | 85.61 | 80.64 | 86.36 | 82.51 | 85.64 | 80.99 | 82.43 | +-------+-------+-------+-------+-------+--------------+-----------------+-------+
Thank you for your reply
But its performance (82.43) is higher than the original report (82.37) ...
It is a long time ago, maybe I mistakenly uploaded a newer model to HF. I forgot which specific model achieved 82.37 performance😂
😂
When I use angle-bert-base-uncased-nli-en-v1 to evaluate STS performance, I find that it is inconsistent with the original report.
The command line I use:
Enviroment:
So is this result acceptable within the error range or is there something wrong with my command?