Closed nicholasjhorton closed 7 months ago
This feels like an important next step. Can it happen in parallel with the footnote processing? Let's talk.
We need to figure out what we want to keep when addressing the page breaks. We could perhaps keep the page numbers because they are referenced in other places. Perhaps keep the page number along with the first word or line. Some questions to think of:
We created a table that extracts the page number, page header (the text next to the page number), and the first line of the page. Our next steps are to remove the extra spaces that make up the page break.
Thanks for your work on this front. Where can we find the table?
@nicholasjhorton here is the table we were working on last class: d6ff211
Created new folders that contain the page tables (page_tables
) and the depaginated texts for each chapter (data-raw-depaginate
). This issue is complete unless there are any issues with the tables, text.
It's great to see how this is coming together: nice work!
If the code isn't already being organized in a Quarto file (with associated pdf) can you please move your code into there so that we can track the workflow (I've already fixed some things that will need to be run through one more time).
Same thing will be needed for the code that generates the page subtitles.
Any updates on this front? It would be great to have a pdf which listed the subtitles as a side effect of processing the workflow for this issue. (This would address open issues #11.)
And #23!
@nicholasjhorton
Completed this with commit: df01cc7e289a5a222c31b2fa2f87ea330834f993
See the pdf with the subtitles here: https://github.com/STAT325-S24/HistoryAmherstCollege/blob/main/data-raw/subtitles.pdf
Closing the issue.
Nick and Justin will work through changing the working directory of the qmd file
This is looking good but I'm only seeing subtitles.pdf
not subtitles.qmd
. Any guidance welcomed.
@nicholasjhorton I did most of the things we talked about in this commit: 79bdf042aa012d18addcd9383cbce23e754d2f83
Some notes:
Thanks for your work on this front.
Is the appendix change needed (since I renamed it to chapter29.txt
)?
@jpapagelis24 @tknightly24 might you be willing to update this issue with the current status of this work? Thanks in advance, Nick
@nicholasjhorton This is complete. Closing the issue.
This issue will be closed when page breaks are addressed in the wrangling process. This is needed before #2 since some pages may break with a hyphen!