Closed nicholasjhorton closed 3 months ago
@nicholasjhorton The final steps for LDA analysis are just reviewing the comments and making some final adjustments, which I hope to have done over the weekend.
That sounds great.
In the interim, I look forward to having a chance to see some of this in class today.
I had a chance to review your draft report. I really enjoyed it and think that it is shaping up well.
I suspect that your report would be useful for two audiences: those seeking to learn more about Frankenstein and those looking for an annotated example of text analytics.
For the first audience, you'll want to include more of the text, some sample paragraphs, and examples to complement Figures 1 and 2.
For the latter, you'll want to describe how things look at each stage of your wrangling, e.g., the tokenized text (for a sample paragraph), some beta's and gamma's for sample paragraphs, etc. Note that this will also be valuable for the first type of reader.
Is there a better way to order the 12 topics? Can you think of a chronological progression?
Here are some more specific comments:
as.numeric()
always worries me, given how brittle it is. Would readr::parse_number()
be appropriate to replace here?@nicholasjhorton I worked on this a bit more and covered most of the comments you left. Does my figure 2 visual look better? I tried to make it more easy to view where each topic is located in the novel.
Still to complete: more examples for beta & gamma, talk about shiny app, chronologically order topics
Commit: 1d8bb1a53a89ba0d1a7b753194edc1bfdb89e933 pdf: https://github.com/STAT325-S24/Frankenstein/blob/main/vignette/LDA.pdf
@jpapagelis24 are you all set with your proposed analyses? I see that the only open issue related to your book project is #9.
It would be helpful today to start to flesh out what tasks you are planning to complete over the next week as you finish up your work. As always, please don't hesitate to reach out with questions.