AriZoneVibes / ServLibScrapper

Download manuals from ServLib into local and convert them to PDF
8 stars 1 forks source link

The archive of the original PDF files from ServLib #3

Open arzam16 opened 1 year ago

arzam16 commented 1 year ago

Not really an issue, just wanted to tell I made a "surprise decentralized backup" of all PDF files in the original quality. The size is approx. 175 GB, and the files were split by manufacturer and the type of a product. I found this project while I was searching for a way to get the original files (not scrapped or recompressed) and I thought it'd be cool to get back there and share the results.

magnet:?xt=urn:btih:2936a05ae45d25a48ccc85f4aa3f4735f7ed998c&dn=ServLib%20dump%20(2023-05-03)

Update 2024.01.21 The full copy is now available in the cloud storage: https://github.com/AriZoneVibes/ServLibScrapper/issues/3#issuecomment-1902591834

Also pinging @robertschulze and @powerbroker from https://github.com/AriZoneVibes/ServLibScrapper/issues/1 because the original files might be interesting for them.

marek26340 commented 9 months ago

Hi, any chance of you still having the files around? I can't seem to get the magnet link to work. Thank you! (Looking for a Sharp MX-2630N service manual link)

powerbroker commented 9 months ago

for me it works. downloading now... btw, @arzam16, how did you managed with it?

marek26340 commented 9 months ago

I may be dumb and be doing something wrong, but just pasting the magnet link into μTorrent (also tried qBittorrent) yields 0 on the download speed, zero peers, no files shown. Do I need to manually give it a tracker or something?

powerbroker commented 9 months ago

Do I need to manually give it a tracker or something?

looks like you did everything right. and the only thing we both need to do is to kindly ask @arzam16(or another peers) to be online and continue sharing(my download stopped about a hour ago) ;) anyway, he made a backup. i wonder how - we could do it too maybe...

marek26340 commented 9 months ago

I see. I haven't had a chance to even peek into the file structure, and all I'm really interested about is the Sharp MX-2630N service manual (linked above). Did your download get to that file yet or not? Please, I need that file. Thank you!

powerbroker commented 9 months ago

I see. I haven't had a chance to even peek into the file structure, and all I'm really interested about is the Sharp MX-2630N service manual (linked above). Did your download get to that file yet or not? Please, I need that file. Thank you!

this is how file structure looks like: ServLib dump afaik, MX-2630N most likely is in sharp/printer.7z... so i could set it's priority higher and if they continue to share download it first

marek26340 commented 9 months ago

Yes, it's most likely in there. It's alright, I'll just set up headless qBittorrent on my server and let it wait until it's available again. Thanks.

powerbroker commented 9 months ago

Yes, it's most likely in there. It's alright, I'll just set up headless qBittorrent on my server and let it wait until it's available again. Thanks.

please enjoy MX-2630N manuals at https://drive.google.com/file/d/1ksfaYkUATFmpa93gMfWu-_1xv3Kzjafu/view?usp=sharing

marek26340 commented 9 months ago

Downloaded, thank you very much!!

arzam16 commented 9 months ago

@powerbroker

btw, @arzam16, how did you managed with it?

One of ServLib domains had a misconfigured AWStats instance without authentication. I checked the file access log in there and derived the original file paths. Those weren't protected by auth as well so I just fed the URL list to wget and that's it. I don't know about nowadays, perhaps it could've been fixed? I remember I used ffuf with Bo0oM's list and some fancy filter-by-size option to find the AWStats endpoint.

and the only thing we both need to do is to kindly ask @arzam16(or another peers) to be online and continue sharing(my download stopped about a hour ago) ;)

Seeding from my location is complicated because the incoming connections are restricted. Even when they aren't the speed is crap and there's nothing I could personally do in this situation. However, I've still got ~1.7 TiB uploaded on this torrent. I don't know why people don't keep seeding after downloading.

powerbroker commented 9 months ago

One of ServLib domains had a misconfigured AWStats instance without authentication. I checked the file access log in there and derived the original file paths. Those weren't protected by auth as well so I just fed the URL list to wget and that's it. I don't know about nowadays, perhaps it could've been fixed? I remember I used ffuf with Bo0oM's list and some fancy filter-by-size option to find the AWStats endpoint.

nice, but a little comlicated to run it myself...

Seeding from my location is complicated because the incoming connections are restricted. Even when they aren't the speed is crap and there's nothing I could personally do in this situation. However, I've still got ~1.7 TiB uploaded on this torrent. I don't know why people don't keep seeding after downloading.

maybe, since the torrent isn't registered on Rutracker and resources like that? ;) btw, i could help with sharing ~500G for some period of time...

A000h commented 8 months ago

Unfortunately ServLib appears to have been completely down for a while now, and the torrent linked above also doesn't work for me (cannot even retrieve the file list). Anyone with a copy of the files, can you check if you can find any service manuals for Sony ICD-PX440 microphone recorder? Supposedly ServLib had these.

Thank you 🙃

arzam16 commented 8 months ago

@A000h There are two ICD-PX440 revisions, I attached the archives for both revisions. The first one contains a small 2-page brochure while the other one is a decent service manual. icd-px440.zip icd-px440-sm2.zip

A000h commented 8 months ago

sm2 is exactly what I'm after, thanks a lot for your help!

arzam16 commented 7 months ago

Great news! I spoke with the staff of RosSkhema, the local russian community of device repairmen, and they kindly agreed to host the full copy of Servlib files in their public cloud storage. This means Servlib files from my torrent should be available at any time at nice speed.

Access instructions:

  1. Visit* https://disk.yandex.ru/d/sTIEMacXigVl3w
  2. Scroll down to my folder called arz
  3. The Servlib archives will be there

The file index is available in human/machine-readable form in servlib.json. Yes it will suck to download multiple gigabytes of archives to obtain just a single PDF but I had no other choice back when I was making the dump. I will ask the staff of RosSkhema if it's possible to host the extracted files.

* some european countries block connection to this site (as part of sanctions lol). If you are from FI/EE/LV/LT/UA you might need a proxy. Still better than spending a month on downloading from a semi-dead peer (me).

Cheers!

robertschulze commented 7 months ago

Seeding for anyone interested