rharish101 / dilbert-viewer

A simple comic viewer for Dilbert by Scott Adams
https://dilbert-viewer.herokuapp.com
GNU Affero General Public License v3.0
26 stars 0 forks source link

I think there should be a indexed searcher for the comics like lasagna.cz. #14

Open Biggrafis opened 9 months ago

Biggrafis commented 9 months ago

i feel like it would be a very good idea.

rharish101 commented 9 months ago

For that, I would need transcripts for all comics, along with the characters in those comics. Do you know if something like that is available publicly? I can't find it on the archived web pages.

Biggrafis commented 9 months ago

..nope. maybe you could try and contact the guy behind lasagna.cz and ask him for help?

rharish101 commented 9 months ago

Found this on lasagna.cz's page:

This website doesn't use http://john.ccac.rwth-aachen.de:8000/ftp/dilbert/garfield.txt as its source like other websites. The transcription database lasagna.cz uses was created from scratch.

It would be a monumental effort to create one from scratch for Dilbert, I'm sorry.

However, I did find this: http://john.ccac.rwth-aachen.de:8000/ftp/dilbert/dilbert.txt. Unfortunately, it's missing quite a few dates, not to mention that it doesn't say which character said what, only the texts within the comics. So the best that one can do is to make a text search, without the characters.

I'll mark this as a feature request, but as I don't have much time these days, it might not be completed soon, if ever. I'm open to PRs that implement it!

stephanerosa commented 3 months ago

I do not think full transcript with characters is needed but just transcript of bubbles would allow searching for keywords. Looks like this could help as a quick start

https://github.com/largecats/comics-ocr