quambene / bogrep

Grep your bookmarks
Apache License 2.0
99 stars 2 forks source link

Spacing errors in cached files #64

Open mattmartini opened 9 months ago

mattmartini commented 9 months ago

If there is a link in the text or formatting like italics spaces are omitted between the linked text and adjacent words.

In the example below, popularCookie Clicker, "Cookie Clicker" is a link, but cached there is no space between it and the preceding word "popular".

In apaperclip maximizer, "paperclip maximizer" is italicized and cached there is no space between it and the preceding word "a".

In Thissoundslike the italicized "sounds" is run together with the words before and after it.

Also, the paragraphs are run together instead of having a newline between them.

The example URL is active so you can see the original document.

$ bogrep paperclips Match in bookmark: https://www.vice.com/en/article/xwgnxq/this-game-about-paperclips-will-make-you-ponder-the-apocalypse Ever since the wildly popularCookie Clicker,idle clicker games have been about hockey stick curves, about exponential growth unleashed by multiplicative advances in productivity. InCookie Clicker,that was employed in service of an absurdist joke about cookies.Universal Paperclips,a new free game from designer Frank Lantz, instead takes this to its darkly literalistic conclusion.It's a clicker game where you play as apaperclip maximizer,an AI that, once tasked with making paperclips, proceeds to turn the entire universe into paperclips.Thissoundslike a premise arrived at specifically to spoof clicker games, but it harkens back to a thought experiment proposed by Nick Bostrom, an Oxford philosophy professor, in a2003 paper:The risks in developing superintelligence include the risk of failure to give it the supergoal of philanthropy. […] Another way for it to happen is that a well-meaning team of programmers make a big mistake in designing its goal system. This could resul

quambene commented 9 months ago

Partly related: https://github.com/quambene/bogrep/issues/16. But the present issue is much more complete.