KennethEnevoldsen / scandinavian-embedding-benchmark

A Scandinavian Benchmark for sentence embeddings
https://kennethenevoldsen.github.io/scandinavian-embedding-benchmark/
MIT License
27 stars 3 forks source link

fix: Updating brokens links #174

Closed KennethEnevoldsen closed 7 months ago

KennethEnevoldsen commented 7 months ago

Fixing broken links from ScandEval. These a temporary fixes @x-tabdeveloping as I imagine, we will update Scandinavian emb. benchmark to the latest version of MTEB soon (once it is settled).

Fixes: #173

Summary by CodeRabbit

coderabbitai[bot] commented 7 months ago

Walkthrough

This update encompasses a variety of improvements and dataset updates across different tasks, focusing on sentiment analysis, language identification, and retrieval tasks. Key changes include the introduction of bootstrapping for average rank computation, updates to task versions and timestamps, and enhancements in evaluation metrics. Task descriptions and metadata have also been refined for better clarity and precision.

Changes

Files Summary
docs/update_benchmark_tables.py Added bootstrapping for average rank computation; updated compute_avg_rank.
src/seb/cache/.../all-MiniLM-L6-v2/*.json,
src/seb/cache/.../sentence-transformers__all-MiniLM-L6-v2/*.json
Added new datasets, updated evaluation scores, task versions, and timestamps.
src/seb/interfaces/model.py Simplified exception handling in encode_queries and encode_corpus.
src/seb/mteb_tasks/retrieval/norquad.py,
src/seb/registered_tasks/.../*.py
Updated task attributes, added custom subclasses for improved descriptions.
makefile, pyproject.toml Adjusted linting commands and updated dependencies in pyproject.toml.

šŸ‡āœØ
In the code's garden, under the moon's gleam,
Changes sprout like flowers, in a coder's dream.
With each line refined, and each dataset new,
Our AI's mind grows, learning more for you.
So hop along, friends, through this digital spree,
Where code meets poetry, in harmony. šŸŒŸšŸ¾

Assessment against linked issues

Objective Addressed Explanation
Extend the dataset to include other Scandinavian languages āŒ The focus was on dataset updates and enhancements, not on adding new languages.
Check the availability of translations for specific resources ā“ Translation availability wasn't explicitly mentioned in the changes.
Integrate resources for Greenlandic, Icelandic, and Faroese āŒ The update primarily focused on existing tasks and datasets.
Potentially include Finnish datasets āŒ The changes did not involve adding Finnish datasets.

Possibly related issues

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share - [X](https://twitter.com/intent/tweet?text=I%20just%20used%20%40coderabbitai%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20the%20proprietary%20code.%20Check%20it%20out%3A&url=https%3A//coderabbit.ai) - [Mastodon](https://mastodon.social/share?text=I%20just%20used%20%40coderabbitai%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20the%20proprietary%20code.%20Check%20it%20out%3A%20https%3A%2F%2Fcoderabbit.ai) - [Reddit](https://www.reddit.com/submit?title=Great%20tool%20for%20code%20review%20-%20CodeRabbit&text=I%20just%20used%20CodeRabbit%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20proprietary%20code.%20Check%20it%20out%3A%20https%3A//coderabbit.ai) - [LinkedIn](https://www.linkedin.com/sharing/share-offsite/?url=https%3A%2F%2Fcoderabbit.ai&mini=true&title=Great%20tool%20for%20code%20review%20-%20CodeRabbit&summary=I%20just%20used%20CodeRabbit%20for%20my%20code%20review%2C%20and%20it%27s%20fantastic%21%20It%27s%20free%20for%20OSS%20and%20offers%20a%20free%20trial%20for%20proprietary%20code)

Tips ### Chat There are 3 ways to chat with [CodeRabbit](https://coderabbit.ai): - Review comments: Directly reply to a review comment made by CodeRabbit. Example: - `I pushed a fix in commit .` - `Generate unit testing code for this file.` - `Open a follow-up GitHub issue for this discussion.` - Files and specific lines of code (under the "Files changed" tab): Tag `@coderabbitai` in a new review comment at the desired location with your query. Examples: - `@coderabbitai generate unit testing code for this file.` - `@coderabbitai modularize this function.` - PR comments: Tag `@coderabbitai` in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples: - `@coderabbitai generate interesting stats about this repository and render them as a table.` - `@coderabbitai show all the console.log statements in this repository.` - `@coderabbitai read src/utils.ts and generate unit testing code.` - `@coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.` Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. ### CodeRabbit Commands (invoked as PR comments) - `@coderabbitai pause` to pause the reviews on a PR. - `@coderabbitai resume` to resume the paused reviews. - `@coderabbitai review` to trigger a review. This is useful when automatic reviews are disabled for the repository. - `@coderabbitai resolve` resolve all the CodeRabbit review comments. - `@coderabbitai help` to get help. Additionally, you can add `@coderabbitai ignore` anywhere in the PR description to prevent this PR from being reviewed. ### CodeRabbit Configration File (`.coderabbit.yaml`) - You can programmatically configure CodeRabbit by adding a `.coderabbit.yaml` file to the root of your repository. - Please see the [configuration documentation](https://docs.coderabbit.ai/guides/configure-coderabbit) for more information. - If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: `# yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json` ### Documentation and Community - Visit our [Documentation](https://coderabbit.ai/docs) for detailed information on how to use CodeRabbit. - Join our [Discord Community](https://discord.com/invite/GsXnASn26c) to get help, request features, and share feedback. - Follow us on [X/Twitter](https://twitter.com/coderabbitai) for updates and announcements.