Open enb34 opened 6 years ago
I tried scrolling through the Japanese text to see examples of furigana--since if they were wrapped in parenthesis or something similar you could maybe use that and regex to remove them--but I couldn't find an example. Is there any pseudo-markup like () for the furigana?
This brought up another question for me. Are there any instances/are you taking into considering any times where katakana are used stylistically?
The site's looking awesome! What kind of things are you going to put into the text once it's on the site? Will you have links back-and-forth between specific chapters of the Alice translations, hover-over sections that discuss the context of the translation (or lack thereof), etc?
@brucknerp Unfortunately there isn't really any markup that I've found around the furigana--my only clue to go on has been that it immediately follows the kanji it goes over (ex. 鍵かぎ). My strategy thus far is reading carefully and comparing the original text with my XML copy and hoping I don't skip anything. (If you're interested, the HTML used to denote this in the original was
@Idi0teque Thank you! Dana and I are still determining exactly how the text is going to look up on the site, but one of our main goals is to be able to view both texts side-by-side in a way that makes it easy to see the English and Japanese at the same time. The Japanese is fairly consistent with the English in terms of structural markup, so we're hoping that this will make it easier for us. We aren't certain about how the marked-up content will come into play yet, but we'll be figuring that out in the next week or two and let you know.
Our project goals remain very similar to what they were before break, but our group is continuing to make progress. We've finished marking up the English text (excluding a repetition tag that was added this evening) and I am currently in the process of marking up the Japanese text. One problem I ran into is the use of furigana within the text, which I didn't catch when initially marking up structural elements (it's used rarely in this particular text). If anyone has advice on how to remove this without manually finding and deleting all of it, I would appreciate your input. Dana is continuing to update our HTML wireframe and web pages, and we have placeholders up for all our pages without content. She's currently creating a new format for the menu bar.