dipu-bd / lightnovel-crawler

Generate and download e-books from online sources.
https://pypi.org/project/lightnovel-crawler/
GNU General Public License v3.0
1.48k stars 291 forks source link

https://bato.to/ #1601

Closed SirGryphin closed 2 years ago

SirGryphin commented 2 years ago

Would really like this site, I've tried making it myself but something in the chapter body is blocking the scrape and I don't have free time to figure it out.

Note: This site also has many many domains as well, if you do get it working you might want to these to base url list.

https://battwo.com/
https://mto.to/
https://mangatoto.net/
https://dto.to/
https://batocc.com/
https://batotoo.com/
https://wto.to/
https://mangatoto.com/
https://comiko.net/
https://batotwo.com/
https://mangatoto.org/
https://hto.to/
https://mangatoto.net/
idMysteries commented 2 years ago

изображение

Easy for Russian hacker :smile_cat: I will try to do it, but without guarantees.

SirGryphin commented 2 years ago

@idMysteries thanks for making this so quickly. Just testing it now. Saves me using another manga downloader then converting images to epub.

Update: only problem now is the epub is 750mb lol.

idMysteries commented 2 years ago

Perhaps it is necessary to implement a search function.

idMysteries commented 2 years ago

Update: only problem now is the epub is 750mb lol.

Maybe we need to convert to another format

idMysteries commented 2 years ago

I downloaded 10 chapters and it's 170 mb.

idMysteries commented 2 years ago

But no. It already converts to jpeg изображение

idMysteries commented 2 years ago

The file size does not seem to be able to be reduced. T_T 10 chapters = 270 images (x 200 kb)

idMysteries commented 2 years ago

@SirGryphin search is ready

idMysteries commented 2 years ago

thanks for making this so quickly.

I was just curious about how encryption was done here. :smiley_cat:

As a result, I wrote such a terrible code:

def decode_pass(code):
    code = code.replace('!+[]', '1').replace('!![]', '1').replace('[]', '0')
    code = code.lstrip('+').replace('(+', '(').replace(' ', '')
    code = code.replace('+((1+[+1]+(1+0)[1+1+1]+[1+1]+[+0])+0)[+1]+', '.')
    code = code.replace(']+[', ' ').replace('[', '').replace(']', '')

    res = ''
    for num_part in code.split('.'):
        for num in num_part.split():
            res += str(num.count('1'))
        res += '.'

    return res.strip('.')
SirGryphin commented 2 years ago

as long as it works, terrible codes okay lol. The manga I download is like 200 chapters 2000+ images.

idMysteries commented 2 years ago

I have a lot of work now, I will have to pay less attention to this project. If there is no work today, I will make another crawler for website with manga (your prev issue).

SirGryphin commented 2 years ago

@idMysteries no worries, thanks for all the work anyway.

dipu-bd commented 2 years ago

@idMysteries Thanks for the contributions