Closed Mefisto04 closed 1 month ago
hey @thesofakillers , please check this once.
Hi, the CLI works well enough and is certainly not the bottleneck for MLE-bench, so I am closing this issue.
As for parallelized downloads, I believe this risks running into rate limits and overwhelming kaggle, so i think its fine to leave as is.
The current implementation in
mlebench/cli.py
has areas that can be improved for better efficiency and error handling:Error Handling:
IOError
orFileNotFoundError
.new_registry.get_competition()
returns a valid competition object to avoid potentialAttributeError
.args.command
does not match any expected values.Efficiency:
registry.list_competition_ids()
by storing the result in a variable when used multiple times.prepare
anddownload-leaderboard
commands to speed up processing, especially when working with multiple competitions.Implementing these improvements will enhance the performance of the code. Please assign me this issue so that i can contribute in it.