dgorissen / coursera-dl

A script for downloading course material (video's, pdfs, quizzes, etc) from coursera.org
http://dirkgorissen.com/2012/09/07/coursera-dl-a-coursera-download-script/
GNU General Public License v3.0
1.74k stars 300 forks source link

issue(s) downloading some classes #50

Closed dpastoor closed 11 years ago

dpastoor commented 11 years ago

Get following error message only downloading some courses

courses noticed problem:

econ1scientists-2012-001 introstats-001

courses tried in same session that successfully downloaded:

GTG-2013-001 macroeconomics-2012-001

error message:

Course 1 of 1 *collecting downloadable content from https://class.coursera.org/introstats-001/lecture/index Traceback (most recent call last): File “c:\Python27\scripts\coursera-dl-script.py”, line 9, in load_entry_point(’coursera-dl==1.4.8’, ‘console_scripts’, ‘coursera-dl’)Q Fi le “c:\Python27\lib\site-packages\courseradownloader\courseradownloader.py”, line 598, in main d.download_course(cn,dest_dir=args.dest_dir,reverse_sections=args.reverse) Fi le “c:\Python27\lib\site-packages\courseradownloader\courseradownloader.py”, line 301, in download_course (weeklyTopics, allClasses) = self.get_downloadable_content(course_url) Fi le “c:\Python27\lib\site—packages\courseradownloader\courseradownloader.py”, line 186, in get_downloadable_content hrefs = classResources.findAll(’a’) AttributeError: ‘NoneType’ object has no attribute ‘findAll’

altimerk commented 11 years ago

Hi, this error appears with lxml as default parser. Try use html.parser instead

dpastoor commented 11 years ago

Sorry for my naiveity - what would the best way of forcing the HTML parser by default. Can this be done in the script?

Thanks!

On Wednesday, May 29, 2013, altimerk wrote:

Hi, this error appears with lxml as default parser. Try use html.parser instead

— Reply to this email directly or view it on GitHubhttps://github.com/dgorissen/coursera-dl/issues/50#issuecomment-18638683 .

altimerk commented 11 years ago

You can use -q option for script: coursera-dl -u username -p password -d directory -q html.parser

Sorry for my naiveity - what would the best way of forcing the HTML parser by default. Can this be done in the script?

Thanks!

On Wednesday, May 29, 2013, altimerk wrote:

Hi, this error appears with lxml as default parser. Try use html.parser instead

Reply to this email directly or view it on GitHub< https://github.com/dgorissen/coursera-dl/issues/50#issuecomment-18638683> .

Reply to this email directly or view it on GitHubhttps://github.com/dgorissen/coursera-dl/issues/50#issuecomment-18640056 .

dgorissen commented 11 years ago

assuming fixed like #49 . Reopen if not.