Farama-Foundation / Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
https://minari.farama.org
Other
272 stars 42 forks source link

unable to download dataset #220

Closed GongXudong closed 2 months ago

GongXudong commented 2 months ago

Describe the bug

When I try download dataset with the Cli,

minari download antmaze-large-play-v1

it downloads nothing. But the terminal response with

Downloading antmaze-large-play-v1 from Farama servers...

Dataset antmaze-large-play-v1 downloaded to /home/XXX/.minari/datasets/antmaze-large-play-v1

It seems there's something wrong with the server.

When I try to download dataset with python script,

minari.load_dataset('antmaze-large-play-v1', download=True)

it returns the following,

Downloading antmaze-large-play-v1 from Farama servers...

Dataset antmaze-large-play-v1 downloaded to /home/ucav/.minari/datasets/antmaze-large-play-v1
Output exceeds the [size limit](command:workbench.action.openSettings?[). Open the full output data [in a text editor](command:workbench.action.openLargeOutput?2dc7f041-f6ef-463d-9e00-49c4e93cb91a)
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In [5], line 1
----> 1 minari_dataset = minari.load_dataset("antmaze-large-play-v1", download=True)
      2 env = minari_dataset.recover_environment(render_mode="human")

File ~/anaconda3/envs/minari/lib/python3.8/site-packages/minari/storage/local.py:39, in load_dataset(dataset_id, download)
     33         raise FileNotFoundError(
     34             f"Dataset {dataset_id} not found locally at {file_path}. Use download=True to download the dataset."
     35         )
     37     hosting.download_dataset(dataset_id)
---> 39 return MinariDataset(data_path)

File ~/anaconda3/envs/minari/lib/python3.8/site-packages/minari/dataset/minari_dataset.py:91, in MinariDataset.__init__(self, data, episode_indices)
     89     self._data = data
     90 elif isinstance(data, (str, os.PathLike)):
---> 91     self._data = MinariStorage(data)
     92 else:
     93     raise ValueError(f"Unrecognized type {type(data)} for data")

File ~/anaconda3/envs/minari/lib/python3.8/site-packages/minari/dataset/minari_storage.py:35, in MinariStorage.__init__(self, data_path)
     23 """Initialize a MinariStorage with an existing data path.
     24 
     25 To create a new dataset, use the class method `new`.
   (...)
...
---> 35     raise ValueError(f"The data path {data_path} doesn't exist")
     36 file_path = os.path.join(str(data_path), "main_data.hdf5")
     37 if not os.path.exists(file_path):

ValueError: The data path /home/ucav/.minari/datasets/antmaze-large-play-v1/data doesn't exist

System Info Ubuntu 18.04

Additional context Add any other context about the problem here.

Checklist

younik commented 2 months ago

Thanks for spotting the issue @GongXudong, We are changing the dataset naming convention for the new release (see https://github.com/Farama-Foundation/Minari/pull/217 and https://github.com/Farama-Foundation/Minari/pull/219) and we accidentally broke the remote for previous versions. It should be solved now, let me know if you have any other problem