Benjamin-Loison / DocSolus-extractor

Reverse engineering, with credentials on https://www.doc-solus.fr it is easy to export as pictures all corrections with this code
1 stars 0 forks source link

Is it possible to do something without credentials? #1

Open Benjamin-Loison opened 7 months ago

Benjamin-Loison commented 7 months ago

If I remember correctly the images are randomly shown. It is no more the case, see below.

On 13/01/20 I paid for:

  • Objet: Tous les corrigés de PC à tous les concours, toutes les matières, toutes les années (2000-2018), accès ouvert jusqu'au 1er août 2020
  • Prix: 19.99 euros

Same for PSI and I potentially switched PC to MP.

Anyway they do not matter as they were only available until 1er août 2020.

As shown on https://www.doc-solus.fr/prepa/sci/adc/bin/view.corrige.html?q=PC_CHIMIE_CCP_1_2014 only the first answer is free and others are locked.

The first answer image resolves to https://www.doc-solus.fr/prepa/sci/adc/img/questions-1/2014/PC_CHIMIE_CCP_1_2014/1918aab1712dd334e6df1f99c1dcbb19.jpg

image

While the second to https://www.doc-solus.fr/prepa/sci/adc/img/miniatures/2014/PC_CHIMIE_CCP_1_2014/PC_CHIMIE_CCP_1_2014__q-002.w100px.jpg

Removing .w100px does not help and directory listing is disabled.

image

Could try super resolution but the resolution seems too bad.

So if do not want to store a copy making a temporary proxy seems to be the best solution. Otherwise just share a copy.

sudo find -iname '*doc*solus*' 2> /dev/null

does not return interesting results on my Linux Mint Framework and Pegasus.

Benjamin-Loison commented 7 months ago

On 6 To HDD I have:

Pegasus/home/benjamin/Desktop/BensFolder/School/CPGE/Fenelon/MPX/Contests/DocSolus/

I have a sibling folder DocSolus - Copy not being identical, even having less files it seems, but seems to have 3 additional contest subjects without corrections, cf diff.txt.

tree result.

The folder is about 5 GB but how much are subjects and corrections to possibly avoid subjects.

1.5 GB of subjects and 3.6 GB of corrections. What compression have the corrections? At least with GIMP exported maximal quality:

-rwxr-xr-x 1 benjamin benjamin 104254 Apr 24  2022 1.jpg
-rw-rw-r-- 1 benjamin benjamin 234073 Mar  3 22:09 1_gimp.jpg

So let us host this folder.

4.3 GB as a zip (generated with zip default parameters).

Letting the ability to download each file/folder would be nice.

Having encryption to avoid the file host to know what is hosted would be nice.

Due to Proton Drive restrictions I have not uploaded subjects.

To not spend my Proton Drive quota could ask the interested person to give me access to his, if a restricted API exists. However, I like to be people independent but if I have an API to ease the process I do not really care. Note that this also assume that they do not change their mind to hope to keep it hosted for as long as possible.

Related to Improve_websites_thanks_to_open_source/issues/{419,420}.

Benjamin-Loison commented 7 months ago

https://codeberg.org/Benjamin_Loison/Improve_websites_thanks_to_open_source/issues?q=[drive.proton.me]

Finally uploaded to Proton Drive as a single .zip with subjects. It was normally fast in my opinion both for upload and download contrarily to folder upload.