fake-name / xA-Scraper

69 stars 8 forks source link

Redundant extension placement #47

Closed God-damnit-all closed 5 years ago

God-damnit-all commented 5 years ago

Out of curiosity, if you try to browse to the relevant URL, does it work?

If the page works properly, but fetching it fails, I'd need to see the actual page to figure out what's going on.

You can e-mail me about it if you don't want to post URLs publicly. anything \<at> fake-url \<dot> com will get to me.

Originally posted by @herp-a-derp in https://github.com/herp-a-derp/xA-Scraper/issues/46#issuecomment-529147940

The actual URL is not important. See how many times 'swf' occurs. It's placing the extension redundantly. It's exactly in this pattern:

http://d.facdn.net swf /art/herpaderp/1234567890/1234567890. swf herpaderp_imodifiedthisurl_video. swf

herp-a-derp commented 5 years ago

Oh, yeah. that's intentional.

Basically, you get <filename it's served as>-<filename derived from webpage>.

There should be a dash or something between the two segments.

Basically, a lot of times the filename the file has as-served is basically useless. I generally still want it for cross-referencing purposes, but having something that you can actually read is handy, so I just use both. Linux is happy with 4K long filenames (well, windows is too if you change some settings), so I just make things longer.