Open artemb123 opened 8 years ago
@artemb123 add your site to the OPENEDX_SITES
collection in edx_dl.py and try using the -x
switch while downloading.
@sudhirmishra It does not work this way. It looks like it's based on edx platform, but is slightly different (at least in the part where you build initial data structures with courses, units and videos). The courseware content and links, though, look pretty much the same as in other edx-based platforms. My guess is that we need custom-tailored page extractor.
I also did not get any luck. What can we do now?
@artemb123 I've looked into the structure of this platform a little bit more. So far I only managed to implement login procedure. Here is my crude, work-in-progress, patch, if you're interested: http://sprunge.us/JRaI Notice that this approach is bad because it breaks existing code and currently supported sites, it needs to be reworked to avoid that.
If you want to contribute, you need to implement page extractor. In fact, the structure is very similar to CurrentEdxPageExtractor but is slightly different here and there. For example, here: https://github.com/coursera-dl/edx-dl/blob/master/edx_dl/parsing.py#L295 class names, tags in extract_sections_from_html
should be adjusted. extract_unit
is likely to be more or less the same, except that it looks like this platform does not provide youtube links (and I'm not sure about subtitles).
Generally, it's not a rocket science, you just need to carefully inspect page source and put the structure into code.
@balta2ar thank you for keeping me in touch. I will try to find some time to work on the above task. Thx!
@balta2ar Unfortunately, in the next 3-4 weeks I can't manage this task. Sorry.
Still no support for openedu.ru?
Dear authors, please, add support for openedu.ru. I am more than happy to contribute. Thank you.