openzim / sotoki

StackExchange websites to ZIM scraper
https://library.kiwix.org/?category=stack_exchange
GNU General Public License v3.0
223 stars 26 forks source link

Rewrites list_all.sh based on XML file #60

Closed kelson42 closed 5 years ago

kelson42 commented 7 years ago

Availabel here https://ia800500.us.archive.org/22/items/stackexchange/stackexchange_files.xml

kelson42 commented 5 years ago

@dattaz Seems to not work well (I see a <files> at the first line):

 $ ./list_all.sh > toto
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  102k  100  102k    0     0  82678      0  0:00:01  0:00:01 --:--:-- 82678
$ more toto 
<files>
3dprinting.meta.stackexchange.com
3dprinting.stackexchange.com
Sites.xml" source="original">
academia.meta.stackexchange.com
academia.stackexchange.com
ai.meta.stackexchange.com
ai.stackexchange.com
android.meta.stackexchange.com
android.stackexchange.com
anime.meta.stackexchange.com
kelson42 commented 5 years ago

@dattaz thx