coursera-dl / edx-dl

A simple tool to download video lectures from edx.org (and other openedx sites)
GNU Lesser General Public License v3.0
1.93k stars 640 forks source link

Resource extraction issues with case + url prefixes #353

Open iemejia opened 8 years ago

iemejia commented 8 years ago

Subject of the issue

As reported by @hxhxd there are courses where the resources have different case, e.g. .PDF edx-dl ignore such resources if they are not passed explicitly (--file-formats PDF).

Your environment

This URL containes one case

https://courses.edx.org/courses/course-v1:BUx+Math226.1x+1T2016/courseware/e2ffa6878339447d86234a4d764ccac6/6dd05cbf2e794f13af10a17c0982df35/

Additionally there are courses whose resources url start with different patterns e.g. // that are also skipped by the script.

Expected behaviour

It should download the resources independently if they are in upper or lower case

Actual behaviour

it is strict, it only downloads the resources in lowercase if we don't explicitly tell it to download them in a different case.

iemejia commented 8 years ago

Issue where discussion started was #335