Open cleong110 opened 4 months ago
SignBank and Sign2Mint are having loading issues in https://github.com/sign-language-processing/datasets/blob/master/examples/load.ipynb, perhaps this is why.
Running PyTest as noted in #53, I find that some datasets have PermissionErrors like the one below. https://stackoverflow.com/questions/36434764/permissionerror-errno-13-permission-denied suggests this is caused suggests that this might be caused by attempting to use with open on a folder instead of a file.
with open
Example:
self = <datasets.sign_language_datasets.datasets.signbank.signbank.SignBank object at 0x000001A8C5C903D0> dl_manager = <tensorflow_datasets.core.download.download_manager.DownloadManager object at 0x000001A8C5C93790> def _split_generators(self, dl_manager: tfds.download.DownloadManager): """Returns SplitGenerators.""" dataset_warning(self) index = dl_manager.download("http://signbank.org/signpuddle2.0/data/spml/") regex = r"\"sgn[\d]+.spml\"" > with open(index, "r", encoding="utf-8") as f: E PermissionError: [Errno 13] Permission denied: 'C:\\Users\\Colin\\projects\\sign-language\\colin_pull_requesting\\datasets\\sign_language_datasets\\datasets\\signbank\\dummy_data' sign_language_datasets\datasets\signbank\signbank.py:218: PermissionError
Datasets that may be affected due to this:
I tried these in The Colab Loading Script, and SignBank and Sign2Mint actually are crashing.
For SignBank the issue might be at https://github.com/sign-language-processing/datasets/blob/3aa515c0da9f3c5f43db5a8cc407a7abbe083db0/sign_language_datasets/datasets/signbank/signbank.py#L216
For Sign2Mint the issue might be at https://github.com/sign-language-processing/datasets/blob/3aa515c0da9f3c5f43db5a8cc407a7abbe083db0/sign_language_datasets/datasets/sign2mint/sign2mint.py#L84
Below is the FULL PYTEST OUTPUT, very long!
Just dropping notes here, seems like:
tf.io.gfile
DL_EXTRACT_RESULT
dummy_data/some file
SignBank and Sign2Mint are having loading issues in https://github.com/sign-language-processing/datasets/blob/master/examples/load.ipynb, perhaps this is why.
Running PyTest as noted in #53, I find that some datasets have PermissionErrors like the one below. https://stackoverflow.com/questions/36434764/permissionerror-errno-13-permission-denied suggests this is caused suggests that this might be caused by attempting to use
with open
on a folder instead of a file.Example:
Affected Datasets
Datasets that may be affected due to this:
Sign2MINT
```python self =SignBank
```python self =ASL Lex
``` self =DGS Corpus
``` filepath_in = 'C:\\Users\\Colin\\AppData\\Local\\Temp\\tmp6sc7ayxv', filepath_out = 'C:\\Users\\Colin\\AppData\\Local\\Temp\\tmp6sc7ayxv.gz' def _gzip_file(filepath_in: str, filepath_out: str) -> None: > with open(filepath_in, "rb") as filehandle_in: E PermissionError: [Errno 13] Permission denied: 'C:\\Users\\Colin\\AppData\\Local\\Temp\\tmp6sc7ayxv' sign_language_datasets\datasets\dgs_corpus\dgs_corpus_test.py:75: PermissionError ```Dicta-Sign
``` self =I tried these in The Colab Loading Script, and SignBank and Sign2Mint actually are crashing.
For SignBank the issue might be at https://github.com/sign-language-processing/datasets/blob/3aa515c0da9f3c5f43db5a8cc407a7abbe083db0/sign_language_datasets/datasets/signbank/signbank.py#L216
For Sign2Mint the issue might be at https://github.com/sign-language-processing/datasets/blob/3aa515c0da9f3c5f43db5a8cc407a7abbe083db0/sign_language_datasets/datasets/sign2mint/sign2mint.py#L84