dteviot / WebToEpub

A simple Chrome (and Firefox) Extension that converts Web Novels (and other web pages) into an EPUB.
Other
679 stars 132 forks source link

Paragraph/word deletation when the symbols '<>' are used #278

Closed DicerX closed 5 years ago

DicerX commented 5 years ago

I came across some VRMMO novels that take liberty with "< system >" and the general use of "<>" parameters every time the author wants to advance the plot. The problem is, when compiling WebtoEpub will simply ignore the letter or the paragraph that is encased between the parameters.

Reference: Novel (Martial gamer; Chapter 2)

Exact quote:

The four of them were in a gaming studio which was dedicated to farming gold in various games and were currently operating in a game called << REBIRTH>>. <> was the latest game that Mu Zi Xian's company had released just 3 days ago that was exploding in popularity.

Paragraph with WebtoEpub:

The four of them were in a gaming studio which was dedicated to farming gold in various games and were currently operating in a game called <>. <> was the latest game that Mu Zi Xian's company had released just 3 days ago that was exploding in popularity.

dteviot commented 5 years ago

@DicerX I need URL of site (or chapter) please.

dteviot commented 5 years ago

@DicerX I've tried the following URLs

Note, examining the raw HTML shows the problem is the "angle brackets" in the second case are not being properly escaped.

To make any more progress, I need the URL for the source.

DicerX commented 5 years ago

Sorry for the delay, but I just came back to recompiling this epub, and it turns out that this is not a problem from your side at all.

I use Sigil for heavy duty editing on my epubs, and it is the reason for paragraph/sentence deletion, it immediately seeks "to correct" these paragraphs by adding

</REBIRTH> to <<REBIRTH>>

and so on. This just destroyed half this novel. Sorry for bothering you with this mate.