mitodl / ocw-to-hugo

A command line utility for taking master.json output from ocw-data-parser and producing markdown for use with hugo-course-publisher
3 stars 0 forks source link

some PDFs are missing resource pages #436

Closed pdpinch closed 2 years ago

pdpinch commented 2 years ago

On the legacy site, PDF files were sometimes stored on a separate file server, identified by a root path of /ans7870/.

Theses files should be identified by ocw-data-parser and the PDF files should be collected on S3. However, we don't seem to be creating resources to go with these files.

On the nextgen site, these files shouldn't be treated any differently from other resources (unless they are an HTML file).

Steps to Reproduce

legacy: https://ocw.mit.edu/resources/res-6-001-continuum-electromechanics-spring-2009/textbook-contents/ nextgen: https://ocwnext-rc.odl.mit.edu/courses/res-6-001-continuum-electromechanics-spring-2009/pages/textbook-contents/

  1. Visit https://ocwnext-rc.odl.mit.edu/courses/res-6-001-continuum-electromechanics-spring-2009/pages/textbook-contents/
  2. Click the link to "Front matter (PDF)"

Expected Behavior

Actual Behavior

Related issues

pdpinch commented 2 years ago

@mbertrand does your work to backfill missing resources touch on this?

mbertrand commented 2 years ago

Sorry, missed your last comment. No, the parser does not recognize ans7870 external links as resources, nor does it upload them.

mbertrand commented 2 years ago

Actually never mind, there is a resource for this pdf, going to close this.