openzim / openedx

Open edX (to zim) scraper
GNU General Public License v3.0
8 stars 7 forks source link

Extra bulletpoints in https://mooc.phzh.ch/ ZIM file #74

Closed Popolechien closed 4 years ago

Popolechien commented 4 years ago

In Topic 1 This is me > Unit 1 About my person an extra set of bulletpoints appears next to bulletpoints: Capture d’écran 2020-07-20 à 10 34 50

satyamtg commented 4 years ago

This is because the xblock html that we scrape has them. The first one is the

  • bulletpoint and the second one is an image. Here's a screenshot of what the actual page looks from where we scrape Screenshot from 2020-07-22 15-17-10

  • Popolechien commented 4 years ago

    Not what I see with Chrome on Mac. Could it be a Linux issue that carries into the zim? Capture d’écran 2020-07-22 à 12 03 21

    satyamtg commented 4 years ago

    Not what I see with Chrome on Mac. Could it be a Linux issue that carries into the zim?

    Did you see the course on the LMS or the xblock URL? The xblock URL is where we scrape from. This is the xblock URL - https://mooc.phzh.ch/xblock/block-v1:PHZH+W-IB+2019_E+type@html+block@3b7395bfc98247e581e5e5d3812c8eea

    rgaudin commented 4 years ago

    @Popolechien, there's one important information I think ; in Openedx, all blocks are available both within the learning flow (Learning Management System) and individually. Both should be mostly identical but there might be differences. The scraper fetches content from the individual view.

    Popolechien commented 4 years ago

    @satyamtg Ah yes in that case I see both bulletpoints.