dgorissen / coursera-dl

A script for downloading course material (video's, pdfs, quizzes, etc) from coursera.org
http://dirkgorissen.com/2012/09/07/coursera-dl-a-coursera-download-script/
GNU General Public License v3.0
1.74k stars 300 forks source link

can not download videos, only two files: index.html and lectures.html #13

Closed thinkall closed 11 years ago

thinkall commented 11 years ago

coursera-dl-error

I tried the command in the picture, but only got two files: 'index.html' and 'lectures.html',

the course "Think again" is in Week 7.

Other finished courses like "Model Thinking" also had this problem.

I can't watch youtube video directly, usually I will use proxy, maybe this is the reason. But I don't know how to set proxy for coursera-dl, so I can't confirm it.

caspik commented 11 years ago

have the same problem

dgorissen commented 11 years ago

coursera just redesigned their site which broke the script. I did a quick fix, see if it works now.

Anrock commented 11 years ago

Same problem right now. Redesigned again?

dgorissen commented 11 years ago

yup, I am actively fixing this now

update: I have a fix but ran into a problem with one of the libraries Im using (spynner), fix blocked until thats solved

dgorissen commented 11 years ago

Ok, I think I have fixed it, at the cost of a new dependency on pyqt.. Not ideal but it did allow me to keep most of the other code unchanged, minimizing the chance of breaking something else. Will try to refactor this further in the future.

Closing issue now, please test and repoen if you still have problems.

Edit: while it worked it required a dependency on pyqt, this has now been removed again as a better solution was found.

rodch-us commented 11 years ago

Is it working for anyone else? I am getting errors in login and timeouts - any idea what am I doing wrong here?

here is dump from my Win7 machine, showing errors:

c:>python.exe courseradownloader.py -u XXX@yahoo.com -p XXXXX precalculus-001 HTML parser set to lxml

cdoremus commented 11 years ago

I get the same results (only html files download) for dataanalysis-001 and maththink-002. I am using Python 2.7.3 on Windows XP.

danzean commented 11 years ago

I am unable to download as well. It quits after downloading two pages and exits to the terminal prompt. Here's what I am getting:

Warning: lxml not available, falling back to built-in 'html.parser' (see -q option), this may cause problems on Python < 2.7.3 HTML parser set to html.parser

dgorissen commented 11 years ago

Please use the latest version. Should all work now.


Web / Blog : http://dirkgorissen.com Twitter : https://twitter.com/elazungu

On Tue, Mar 19, 2013 at 4:05 AM, danzean notifications@github.com wrote:

I am unable to download as well. It quits after downloading two pages and exits to the terminal prompt. Here's what I am getting:

Warning: lxml not available, falling back to built-in 'html.parser' (see -q option), this may cause problems on Python < 2.7.3 HTML parser set to html.parser

  • Authenticating as xxxxx@yahoo.com...
  • Already logged in
  • Collecting downloadable content from http://class.coursera.org/sciwrite-2012-001/lecture/index
  • Got all downloadable content for sciwrite-2012-001
  • sciwrite-2012-001 will be downloaded to /Users/xxxx/test/sciwrite-2012-001
    • Downloading lecture/syllabus pages
    • "index.html"
    • "lectures.html" MacBook-Pro:$

— Reply to this email directly or view it on GitHubhttps://github.com/dgorissen/coursera-dl/issues/13#issuecomment-15096449 .

togaurav commented 11 years ago

I ran the latest script but still getting the same issue... I am using Python 2.7.3 version.

stack trace::

D:\gaurav-git-repo\coursera-dl\courseradownloader>python.exe courseradownloader.py -u ****@*** -p **** -d F:\tech\coursera introACpartI-001 Warning: lxml not available, falling back to built-in 'html.parser' (see -q option), this may cause problems on Python < 2.7.3 HTML parser set to html.parser

dgorissen commented 11 years ago

works perfectly here, did you accept the honour code?


Web / Blog : http://dirkgorissen.com Twitter : https://twitter.com/elazungu

On Wed, Mar 20, 2013 at 4:00 PM, Gaurav notifications@github.com wrote:

I ran the latest script but still getting the same issue... I am using Python 2.7.3 version.

stack trace::

D:\gaurav-git-repo\coursera-dl\courseradownloader>python.exe courseradownloader.py -u ****@*** -p **** -d F:\tech\coursera introACpartI-001

Warning: lxml not available, falling back to built-in 'html.parser' (see -q option), this may cause problems on Python < 2.7.3 HTML parser set to html.parser

  • Authenticating as ****@*******...
  • Already logged in
  • Collecting downloadable content from http://class.coursera.org/introACpartI-001/lecture/index
  • Got all downloadable content for introACpartI-001
  • introACpartI-001 will be downloaded to F:\tech\coursera\introACpartI-001
    • Downloading lecture/syllabus pages
    • "index.html" already exists, skipping
    • "lectures.html" already exists, skipping

— Reply to this email directly or view it on GitHubhttps://github.com/dgorissen/coursera-dl/issues/13#issuecomment-15184779 .

togaurav commented 11 years ago

Thanks boss, I deleted the old script, and downloaded the new script then it started working thank you guys

dgorissen commented 11 years ago

Ok, just to be sure here, I made some minor changes to log output. Can you upgrade, run again, and paste the complete output.


Web / Blog : http://dirkgorissen.com Twitter : https://twitter.com/elazungu

On Sat, Mar 23, 2013 at 7:36 AM, Gaurav notifications@github.com wrote:

Yes, I accepted the honor code, last week It was working, but from 3-4 days facing the same issue?

— Reply to this email directly or view it on GitHubhttps://github.com/dgorissen/coursera-dl/issues/13#issuecomment-15333199 .