ContentMine / quickscrape

A scraping command line tool for the modern web
MIT License
259 stars 42 forks source link

Download multiple supplemental files (e.g. tables) #75

Open petermr opened 8 years ago

petermr commented 8 years ago

Some papers contain links to tables, which can be downloaded with a selector output to table.csv. However they need naming (e.g. table1.csv, table2.csv...) otherwise they overwrite and end with a single file (the last).

petermr commented 8 years ago

Cannot create subdirectories:

    "csv1": {
      "selector": "//a[@id='CSVdownloadButton'][1]",
      "attribute": "href",
      "download": {
        "rename": "tables/table1.csv"
      }
    },

hangs...