facebookresearch / SentEval

A python tool for evaluating the quality of sentence embeddings.
Other
2.09k stars 309 forks source link

cabextract: command not found #2

Closed loretoparisi closed 7 years ago

loretoparisi commented 7 years ago

I did this on Ubuntu16.04LTS

git clone https://github.com/facebookresearch/SentEval
cd SentEval/
cd data/
./get_transfer_data.bash
...
./get_transfer_data.bash: line 194: cabextract: command not found
cat: senteval_data/MRPC/_2DEC3DBE877E4DB192D17C0256E90F1D: No such file or directory
cat: senteval_data/MRPC/_D7B391F9EAFF4B1B8BCE8F21B20B1B61: No such file or directory
rm: cannot remove 'senteval_data/MRPC/_*': No such file or directory
loretoparisi commented 7 years ago

[UPDATE]

I have solved the first issue doing on Ubuntu16.04 LTS:

sudo apt-get install cabextract

then the cabinet extraction worked:

Extracting cabinet: senteval_data/MRPC/MSRParaphraseCorpus.msi
  extracting senteval_data/MRPC/_2D65ED66D69C42A28B021C3E24C1D8C0
  extracting senteval_data/MRPC/_2DEC3DBE877E4DB192D17C0256E90F1D
  extracting senteval_data/MRPC/_63DE49D9E7214609BE7E38DD145D8081
  extracting senteval_data/MRPC/_B3CFEFE1C368459BA1D1B8A2FA07A16D
  extracting senteval_data/MRPC/_C5BC91AAB1554DF3AF5E4105DE57C85A
  extracting senteval_data/MRPC/_D18B15DC041F43D7925309EFFCFE0236
  extracting senteval_data/MRPC/_D7B391F9EAFF4B1B8BCE8F21B20B1B61

All done, no errors.
loretoparisi commented 7 years ago

[UPDATE] By the way, I have created a gist that install all dependecies, run the download scripts, and the tasks.

https://gist.github.com/loretoparisi/9311328876526f37f4893de3c293bd46

aconneau commented 7 years ago

Hey, thanks for using SentEval! Great that you solved the problem. Yes indeed cabextract is needed to extract the MSI files from MRPC. Note that we mentionned the installation of cabextract in the "Download" section of the README, you may have missed it :). The problem (and the reason why I didn't include the installation of cabextract in the .bash file) is that "sudo apt-get install cabextract" will not work for CentOS (where you need "yum"). Plus, some users may need to "sudo", which I didn't want to insert in the bash script.

aconneau commented 7 years ago

solved

loretoparisi commented 7 years ago

@aconneau right! another problem is that to install conda you need to accept T&C manually, so that if you launch the gist I have pasted you need some interaction. I do not know a way to install conda silently i.e. with no prompt at this time.

loicbarrault commented 7 years ago

@loretoparisi conda is the working environment, I don't consider it as a dependency. Then it should be installed correctly before. Plus, you might want to run InferSent in a specific environment as well.

loretoparisi commented 7 years ago

well that is true.