coursera-dl / edx-dl

A simple tool to download video lectures from edx.org (and other openedx sites)
GNU Lesser General Public License v3.0
1.93k stars 639 forks source link

Using edx-dl 0.1.12 , Still downloading empty folder structure, Issue #587 not yet resolved.. #589

Closed pintu4india closed 4 years ago

pintu4india commented 4 years ago

I am using windows 7 .

Command:-

C:\Users........\edx-dl-master>edx-dl -u **** -p **** https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/course/

Output:-

edx_dl version 0.1.12 Building initial headers for future requests. Getting initial CSRF token. Found CSRF token. Logging into Open edX site: https://courses.edx.org/login_ajax Extracting course information from dashboard. Downloading Computer Hardware and Operating Systems [course-v1:NYUx+FCS.OS.1+1T2020/co] Downloading 6 section(s) Section 1: Welcome and Syllabus Welcome and Syllabus Discussion Section 2: Lecture Lecture Quiz Discussion Section 3: Lecture Lecture Quiz Discussion Section 4: Lecture Lecture Quiz Discussion Section 5: Lecture Lecture Quiz Discussion Section 6: Lecture Lecture Quiz Discussion Extracting all units information in parallel. Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@bb0889877ed34854a6138cec657f27b6' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@400332f1c40448e1adcb5b24c366bbc9' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@fb6c5add7a3a49a69103337db7dcc2cd' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@eeef51e44b8642a7a8992ba929827ec2' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@e002a65c849a409fb4598d5016ced009' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@c143e765cfa74c46932d69ab69dd0eba' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@865d4fff89094cca8fd385648916f996' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@645f8b1ac0c84b09802a496040a52373' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@22abfc3931da4949b5d54e8355a22a6d' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@0fb4a83ea1f643e59f6fb02f9b4112da' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@9bae9b4b538f4adea9219a2367697853' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@09bf9162d9e84b1486897ff14d89203a' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@fa46689358c64dc69af6748b4235ba6f' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@1bf9678a638a472cbba7065753d8856d' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@764a089c1d9f429ebee4d0db8116026d' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@9fd5159041a0459aac2155aea19bfc92' Processing 'https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/jump_to/block-v1:NYUx+FCS.OS.1+1T2020+type@sequential+block@fece20747d7b4797ab557b299f41b3fb' Removed 0 duplicated urls from 0 in total

Same problem of empty folders as #587 still persists

youtube-dl also upgraded to latest version, still same problem

@Crystyx @harrisony @iemejia @balta2ar @esantoro

abeckman commented 4 years ago

I'm on Win10. Just upgraded edx-dl and youtube-dl.

One course gets all empty folders. In the one below it finds one video previously downloaded in week 0 and downloads one from week 6. I had previously downloaded through week 2, so it is missing weeks 3-5 videos. In the other course I tested with, I had again downloaded week 1, but it indicated no videos at all for week 1 or subsequent weeks.

edx-dl -u **** -p ** https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/course/ --ignore-error edx_dl version 0.1.12 Building initial headers for future requests. Getting initial CSRF token. Found CSRF token. Logging into Open edX site: https://courses.edx.org/login_ajax Extracting course information from dashboard. Downloading Hello (Real) World with ROS – Robot Operating System [course-v1:DelftX+ROS1x+1T2020/co] Downloading 2 section(s) Section 1: Welcome Welcome Pre-survey Course conventions Course Setup Section 2: Course wrap-up Course wrap-up Post-survey Acknowledgements Extracting all units information in parallel. Processing 'https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/jump_to/block-v1:DelftX+ROS1x+1T2020+type@sequential+block@659887fafb8847359a9e9287825cbd7a' Processing 'https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/jump_to/block-v1:DelftX+ROS1x+1T2020+type@sequential+block@49488c09c7274ee2a3c2ec3cb3eac1c7' Processing 'https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/jump_to/block-v1:DelftX+ROS1x+1T2020+type@sequential+block@9fb4681af77d4b229a2470bed01c7f6c' Processing 'https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/jump_to/block-v1:DelftX+ROS1x+1T2020+type@sequential+block@1e69cfa733964152b8d34232dedd7d2d' Processing 'https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/jump_to/block-v1:DelftX+ROS1x+1T2020+type@sequential+block@304850235f024aa6a383239688cd190f' Processing 'https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/jump_to/block-v1:DelftX+ROS1x+1T2020+type@sequential+block@3591fd3227c1473b9ed6f721b26e9340' Processing 'https://courses.edx.org/courses/course-v1:DelftX+ROS1x+1T2020/jump_to/block-v1:DelftX+ROS1x+1T2020+type@sequential+block@fba3a9c54c59409f8a38cac9c7c4f9ba' Removed 0 duplicated urls from 8 in total Output directory: Downloaded [download] https://youtube.com/watch?v=RoIRFnDLj3c => Downloaded/Hello_Real_World_with_ROSRobot_Operating_System/01-Welcome/01-%(title)s-%(id)s.%(ext)s Downloading video with URL https://youtube.com/watch?v=RoIRFnDLj3c from YouTube. [youtube] RoIRFnDLj3c: Downloading webpage [youtube] RoIRFnDLj3c: Downloading video info webpage [youtube] RoIRFnDLj3c: Downloading MPD manifest [download] Downloaded/Hello_Real_World_with_ROSRobot_Operating_System/01-Welcome/01-ROS1x_2020_Week_0_Overview_course-video-RoIRFnDLj3c.mp4 has already been downloaded [download] 100% of 18.82MiB [download] https://youtube.com/watch?v=8tG0ZEIMgvc => Downloaded/Hello_Real_World_with_ROS__Robot_Operating_System/02-Course_wrap-up/01-%(title)s-%(id)s.%(ext)s Downloading video with URL https://youtube.com/watch?v=8tG0ZEIMgvc from YouTube. [youtube] 8tG0ZEIMgvc: Downloading webpage [youtube] 8tG0ZEIMgvc: Downloading video info webpage [youtube] 8tG0ZEIMgvc: Downloading MPD manifest [download] Downloaded/Hello_Real_World_with_ROS__Robot_Operating_System/02-Course_wrap-up/01-ROS1x_2018_Week_6_Acknowledgements-video-8tG0ZEIMgvc.mp4 has already been downloaded [download] 100% of 22.93MiB

Markpajr commented 4 years ago

Also having this exact issue. I was on version 0.1.11, it pulled in all empty folders. after upgrading to v0.1.12, one video and PDF from my course was downloaded but the rest was empty folders.

Oshibuki commented 4 years ago

you could replace parsing.py with this: parsing.py

pintu4india commented 4 years ago

@tanjiarui15 Does nt solve problem....Still same prob of empty folders...edx-dl and youtube-dl both running on latest versions as per last issue #587 ....Also tried your parsing.py, still same result..

Markpajr commented 4 years ago

I also tried replacing parsing.py with that file, it did not work to solve the issue for me either. This is the course I've been trying to download, though I've tested on past courses that have worked before and it no longer works. https://courses.edx.org/courses/course-v1:GTx+CS1301xIV+3T2019/cour1:GTx+CS1301xIV+3T2019/course/

pintu4india commented 4 years ago

@tanjiarui15 Also tried from Ubuntu 18.04, faced same problem. Plz help..

Oshibuki commented 4 years ago

我还尝试用该文件替换parsing.py,它也无法为我解决问题。这是我一直在尝试下载的课程,尽管我已经对以前有效的过去的课程进行了测试,但现在不再有效。 https://courses.edx.org/courses/course-v1:GTx+CS1301xIV+3T2019/cour1:GTx+CS1301xIV+3T2019/course/

when I open this url,it tell me can not find the course. Are you sure the course url is ok? Maybe you should updated your course to 2020 edition like this : https://courses.edx.org/courses/course-v1:GTx+CS1301xIV+3T2019/course/

Oshibuki commented 4 years ago

@ tanjiarui15也从Ubuntu 18.04尝试过,遇到同样的问题。请帮助

Please download souce code ,and change current folder to source code folder,run this: python3 edx-dl.py -u your-account course-url

pintu4india commented 4 years ago

@ tanjiarui15也从Ubuntu 18.04尝试过,遇到同样的问题。请帮助

Please download souce code ,and change current folder to source code folder,run this: python3 edx-dl.py -u your-account course-url

Since, I have been trying to download particular course, could nt see this.... I am able to download other courses....only courses in this series are giving empty folder downloads...

https://courses.edx.org/courses/course-v1:NYUx+FCS.NET.1+1T2020/course/ https://courses.edx.org/courses/course-v1:NYUx+FCS.OS.1+1T2020/course/ https://courses.edx.org/courses/course-v1:NYUx+FCS.PRG.1+1T2020/course/

plz help

Oshibuki commented 4 years ago

https://courses.edx.org/courses/course-v1:NYUx+FCS.NET.1+1T2020/course/

I opened the url,and I found that in this course there are none content could be download. image In another course,there are some content could be downloaded like this: image

pintu4india commented 4 years ago

https://courses.edx.org/courses/course-v1:NYUx+FCS.NET.1+1T2020/course/

I opened the url,and I found that in this course there are none content could be download. image In another course,there are some content could be downloaded like this: image

Thanks....Closing...this is issue related to edx site, rather than edx-dl..