coursera-dl / edx-dl

A simple tool to download video lectures from edx.org (and other openedx sites)
GNU Lesser General Public License v3.0
1.92k stars 638 forks source link

edx-dl Skip downloading videos and also convert to PDF #479

Open adimitrova opened 6 years ago

adimitrova commented 6 years ago

FYI Your troubleshooting section doesn't work

🚨 Troubleshooting section

Subject of the issue

  1. I would like to download subtitles only and tried with several commands in Terminal as per the --help instructions, however I can't skip downloading videos. At this stage I don't want them, but only the subs. This is related to youtube-dl right? I tried extracting only "pdf" & "srt" & "vtt" with: (even though i saw that -s is for subtitles) -f --export-format --file-formats but it extracts everything.

  2. Is there a way for me to convert the course information (incl pictures, text etc) to a PDF and have a single PDF for a single course (w/o videos).

environment

Steps to reproduce

edx-dl COURSE_URL -u EMAIL -p PASSWORD --export-format "pdf" -s -o /location/to/download/

Expected behaviour

I though it would only extract pdf or srt or whatever i specify.

Actual behaviour

Extracts everything

floviolleau commented 4 years ago

Hi,

I see that this is an old issue but I will reply.

For the first point, I think this is due to the fact that the course contains only youtube videos like you suspected.

Maybe combine it with --prefer-cdn-videos parameter.

For the 2nd point, for now there is no way to merge into 1 PDF because resources are downloaded as is from the website (parsing HTML and look for all href to download). I suggest to use another tool to combine all PDF like PDFSamBasic

Hopes that help a bit Kind regards