programminghistorian / ph-submissions

The repository and website hosting the peer review process for new Programming Historian lessons
http://programminghistorian.github.io/ph-submissions
139 stars 114 forks source link

Linear and Logistic Regression #436

Closed drjwbaker closed 2 years ago

drjwbaker commented 3 years ago

The Programming Historian has received the following tutorial on 'Linear and Logistic Regression' by @mjlavin80. This lesson is now under review and can be read at:

http://programminghistorian.github.io/ph-submissions/en/drafts/originals/linear-regression

http://programminghistorian.github.io/ph-submissions/en/drafts/originals/logistic-regression

Please feel free to use the line numbers provided on the preview if that helps with anchoring your comments, although you can structure your review as you see fit.

I will act as editor for the review process. My role is to solicit two reviews from the community and to manage the discussions, which should be held here on this forum. @svmelton and the English team have already read through the lesson and provided feedback, to which the author has responded.

Members of the wider community are also invited to offer constructive feedback which should post to this message thread, but they are asked to first read our Reviewer Guidelines (http://programminghistorian.org/reviewer-guidelines) and to adhere to our anti-harassment policy (below). We ask that all reviews stop after the second formal review has been submitted so that the author can focus on any revisions. I will make an announcement on this thread when that has occurred.

I will endeavor to keep the conversation open here on Github. If anyone feels the need to discuss anything privately, you are welcome to email me.

Our dedicated Ombudsperson is (Ian Milligan - http://programminghistorian.org/en/project-team). Please feel free to contact him at any time if you have concerns that you would like addressed by an impartial observer. Contacting the ombudsperson will have no impact on the outcome of any peer review.

Anti-Harassment Policy

This is a statement of the Programming Historian's principles and sets expectations for the tone and style of all correspondence between reviewers, authors, editors, and contributors to our public forums.

The Programming Historian is dedicated to providing an open scholarly environment that offers community participants the freedom to thoroughly scrutinize ideas, to ask questions, make suggestions, or to requests for clarification, but also provides a harassment-free space for all contributors to the project, regardless of gender, gender identity and expression, sexual orientation, disability, physical appearance, body size, race, age or religion, or technical experience. We do not tolerate harassment or ad hominem attacks of community participants in any form. Participants violating these rules may be expelled from the community at the discretion of the editorial board. Thank you for helping us to create a safe space.

Permission to Publish

@mjlavin80: please could you post the following statement to the Submission ticket.

I the author|translator hereby grant a non-exclusive license to ProgHist Ltd to allow The Programming Historian English|en français|en español to publish the tutorial in this ticket (including abstract, tables, figures, data, and supplemental material) under a CC-BY license.

anisa-hawes commented 2 years ago

Thank you, @svmelton! I'll work on the copyediting of these two lessons this week. I'll aim to complete by e.o.d Friday 22nd.

You didn't miss it, @drjwbaker. The revised editorial guidelines (that integrate copyediting as one of the final phases of our workflow) remain in draft, so aren't on the live site yet. They are currently being tested by the newest members of our EN team, to help ensure the guidance is clear and covers all that is needed.

anisa-hawes commented 2 years ago

Hello @mjlavin80,

I hope you're well.

I have applied copyedits to your lesson, which you can review in the commit History:

linear-regression.md edits here logistic-regression.md edits here

I've also made a list of the edits. This list includes some queries, which are marked with a Q.

linear-regression

l.74

l.75

l.76

l.81

l.85

l.87

l89.

l.91

l.95-115, and throughout these two lessons, I have replaced bold (formatted with double underscore) with bold (formatted with double asterisk) for consistency across our journals

l.130-334

l.116

l.120

l.125

l.131

l.133

l.141

l.143

l.145

l.150

l.152

l.159

l.165

l.172

l.184

l.225 and 227

l.269

l.271

l.273

l.286

l.295

l.332

l.346

l.372

l.417

l.496

l.517

l.519

l.524

l.526

l.532

l.559

l.565

l.643

l.645

l.681

logistic-regression

l.74

l.74 -in this case scaled

l.76

l.76

l.84

l.86

l.92

l.94

l.96

l.102

l.108

l.139

l.170

l.233

l.240

l.242

l. 252

l.256

l.263

l.279

l.285

l.287

l.289

l.291

l. 293

l.315

l.319

l.338

l.394

l.413

l.425

l.447

l.474

l.480

l.500

l.538

l.576

l.617

l.625

l.629

l.631

l.637

l.639

l.693

l.697

l.703

l.705

l.715

Please let me know if you'd like to chat about any of these suggestions. Feel free to adjust my edits if there's something I've misunderstood, or if anything simply doesn't feel right to you.

I've prepped the Perma.cc links for both lessons, so they are ready to slot in as soon as these copyedits are approved.

Very best, and thank you for your patience. Anisa

--

Hello @drjwbaker,

Apologies for the delay. I hoped to complete this work on Friday, but it took me longer than I expected to work through both lessons.

I've made some minor adjustments to the YAML headers in both files:

Do you know what might be causing this? If not, Jenn may be able to help us with this (I think it is a 'table wrapping' issue).

That's all, I think. Thank you for your patience here.

A.

drjwbaker commented 2 years ago

@mjlavin80 : I see a small number of questions here. Do you have capacity to pick these up soon?

drjwbaker commented 2 years ago

I've made some minor adjustments to the YAML headers in both files:

Thanks @anisa-hawes!

I've sentence capped the lesson titles

Okay.

Added hyphen and new line following editors:

Okay.

Replaced mathjax: yes with mathjax: true (Or do these two do the same job?)

Don't know. I borrowed the sequence syntax from @acrymble's python articles. Hopefully this will work here?

I have a question about the field output: pdf_document. I’m not familiar with this one?

Me neither. I tend to just grab a recent header syntax and edit it. Feel free to add/remove any syntax you think we don't need (as you see these way more than I do!)

One other technical note, is that I notice one of the tables in logistic-regression isn't displaying correctly in the Preview – all columns from and including “Correct” are truncated. It is the second table at Step 9: Model Validation or l.338-344 of the Markdown file.

Hmm. No idea why this is happening I'm afraid.

anisa-hawes commented 2 years ago

Hello @jenniferisasi and @ZoeLeBlanc, Do you remember the table-wrapper fix for wide tables we applied to css/style.css in PR 2504? It seems like that functionality isn't quite working as it should do in the lesson logistic-regression.

This is a wide table, so our hope would be for it to be responsive/scroll-able, but in the Preview – all columns from and including the column headed “Correct” are truncated.

It is the second table at Step 9: Model Validation in the Preview, or at lines 338-344 of the Markdown file here. Might you have time to take a look? With many thanks, Anisa

mjlavin80 commented 2 years ago

@drjwbaker I think I can get to these by Monday or Tuesday. It's the last week classes here, so I'm a bit bogged down.

drjwbaker commented 2 years ago

Thanks so much @mjlavin80.

mjlavin80 commented 2 years ago

@drjwbaker @anisa-hawes I believe have completed responses to all of your questions. Should I push changes myself or just reply to this thread?

anisa-hawes commented 2 years ago

Many thanks, @mjlavin80!

Whichever is simplest for you!

Let me know if you'd like to discuss any of the edits or questions. I'd be pleased to connect and talk things through.

Very best, Anisa

mjlavin80 commented 2 years ago

Thanks @anisa-hawes I just pushed a round of changes. To summarize:

l.91

l.225 and 227

l.295

l.346

l.417

l.517

l.519

l.645

l.681


Logistic

l.74

l.76

l.84

l.86

l.108

l.291

l.447

l.697

l.703

anisa-hawes commented 2 years ago

Excellent. Huge thanks for your clarifications and replies to my queries here @mjlavin80 – I really appreciate your time.

My next step will be to replace external links in the lessons with archival perma.cc links (except in cases where the link directs to an action on the live web, e.g., install software).

I will also follow up with my colleagues to resolve an issue with the second table at Step 9: Model Validation or l.338-344 of the Markdown file, which I noticed isn't displaying correctly in the Preview.

drjwbaker commented 2 years ago

Great job all. @anisa-hawes: let me know if/when you need further input from me.

anisa-hawes commented 2 years ago

Just a note to say that I applied the perma.cc links to both lessons this afternoon.

I've also taken a look at css/style.css to double check the snippet which makes tables responsive on our site:

.table-wrapper {
  overflow-x: scroll;
}

It looks right... but I've noticed a wide table in another lesson on Submissions which is also truncated in this way, so something isn't quite as it should be. I'm following up with colleagues and will return with an update asap.

drjwbaker commented 2 years ago

@anisa-hawes Is this now just stuck with the table problem?

anisa-hawes commented 2 years ago

Hello @drjwbaker. Yes, that is the final thing to resolve. Spoke with Zoe again on Friday, and she assured me that this is on her list.

anisa-hawes commented 2 years ago

Dear @drjwbaker and @mjlavin80. Thank you for your extended patience. Good news! The table-wrapper issue is fixed!.

Checking the Preview here, (specifically the 2nd table at Step 9 Model Validation), I can confirm that the scroll action is working as it should. Thank you, @ZoeLeBlanc!

Hello @svmelton This pair of lessons are now ready for your final checks ahead of publication.

Key info via James's previous comment:

Linear Regression analysis with scikit-learn

Logistic Regression analysis with scikit-learn

  name: Matthew J. Lavin
  team: false
  orcid: 0000-0003-3867-9138
  bio:
      en: |
          Matthew J. Lavin is an Assistant Professor of Data Analytics specializing in Humanities Analytics at Denison University. His scholarship focuses on book history, cultural analytics, and turn-of-the-twentieth-century U.S. literature and culture.
drjwbaker commented 2 years ago

Woo! Apologies for the slow reply (covid wiped out last week for me). Anything else I need to do here @anisa-hawes?

anisa-hawes commented 2 years ago

Hello @drjwbaker and @mjlavin80. We are almost there! A few final things:

Sarah will take care of things from there, including requesting the DOI and slotting that in.

Something to think about for after publication is preparing a couple of posts for our Twitter Bot. We'll Tweet directly after we publish, but the Bot will help us to publicise the lessons in the future.

svmelton commented 2 years ago

Thanks @anisa-hawes! I've requested the DOI and will move forward with publication this week.

anisa-hawes commented 2 years ago

Hello @svmelton. A quick note to say that Matthew's bio has now been updated as part of PR #2611 so you don't need to do this when it comes to your final steps of the workflow.

drjwbaker commented 2 years ago

Looks like these two articles we be publishes later this week. Huge thanks to @rcmapp and @thomjur for their thoughtful, constructive, and detailed reviews, to @mjlavin80 for two (as it turned out!) superb articles, and on the PH side to both @anisa-hawes & @svmelton for supporting the publication process. I'm really looking forward to seeing these articles appear!

rcmapp commented 2 years ago

Really excited to hear this! I'm happy to have contributed and will look forward to doing so again, whenever needed.

Best, Rennie

Rennie Mapp, PhD, MSMITProject manager for

strategic DH initiativesOffice of the VP for IT University of Virginia(717)713-1204

@. @.>*

On Wed, Jul 6, 2022 at 9:59 AM James Baker @.***> wrote:

Looks like these two articles we be publishes later this week. Huge thanks to @rcmapp https://github.com/rcmapp and @thomjur https://github.com/thomjur for their thoughtful, constructive, and detailed reviews, to @mjlavin80 https://github.com/mjlavin80 for two (as it turned out!) superb articles, and on the PH side to both @anisa-hawes https://github.com/anisa-hawes & @svmelton https://github.com/svmelton for supporting the publication process. I'm really looking forward to seeing these articles appear!

— Reply to this email directly, view it on GitHub https://github.com/programminghistorian/ph-submissions/issues/436#issuecomment-1176257839, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADA4MWXSETNI35XR66VREN3VSWGMDANCNFSM5I2JEZ6Q . You are receiving this because you were mentioned.Message ID: @.***>

drjwbaker commented 2 years ago

This feels stuck. Is there anything I can do to help?

anisa-hawes commented 2 years ago

Thank you, @drjwbaker. Sarah has prepared the PR to publish these lessons, but there is a problem with Jekyll's Build Checks, which I'm unable to fix without technical advice. I will update you here/in the PR as soon as a solution is identified.

A minor detail, but I did note that the two thumbnail images chosen to represent these lessons are colour rather than greyscale. For consistency, I think you could make them greyscale. That is not causing the hold up, but it is something to do in the meantime!

Thank you for your patience.

drjwbaker commented 2 years ago

A minor detail, but I did note that the two thumbnail images chosen to represent these lessons are colour rather than greyscale. For consistency, I think you could make them greyscale. That is not causing the hold up, but it is something to do in the meantime!

Any idea how I do that?

anisa-hawes commented 2 years ago

This is done. I have uploaded greyscale versions of both images to Sarah's branch. The Previews are currently available here and here, if you want to double check you are happy/see if you want to adjust the contrast slightly? (I think they look good).

svmelton commented 2 years ago

Hi all—happy to announce that this lesson has now been published. Congrats, @mjlavin80, and thanks @drjwbaker for your work editing this piece.

drjwbaker commented 2 years ago

We got there. Amazing. Thanks all.

rcmapp commented 2 years ago

Fabulous! So glad, and really appreciative of everyone’s hard work. 🥳

On Fri, Jul 29, 2022 at 9:33 AM Sarah Melton @.***> wrote:

Hi all—happy to announce that this lesson has now been published. Congrats, @mjlavin80 https://github.com/mjlavin80, and thanks @drjwbaker https://github.com/drjwbaker for your work editing this piece.

— Reply to this email directly, view it on GitHub https://github.com/programminghistorian/ph-submissions/issues/436#issuecomment-1199291685, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADA4MWR7654JIKN5SGOLITLVWPMSPANCNFSM5I2JEZ6Q . You are receiving this because you were mentioned.Message ID: @.***>

--

Rennie Mapp, PhD, MSMITProject manager for

strategic DH initiativesOffice of the VP for IT University of Virginia(717)713-1204

@. @.>*

thomjur commented 2 years ago

I am happy to see these two lessons out there, looking forward to using them in one of my classes! Great work everyone! <3