Closed Devadeut closed 1 year ago
Hey,
here is my review of your report:
I don’t have time to structure my critique, but I will first focus on negative points and finish with the positive. The title is ‘Logistic Regression‘, which gives the impression that it is the main focus of your work, but this is just one out of three accomplishments. Therefore, the title might be a bit misleading. Your position, ‘Sophomore CSE‘, is centered but should be below your name. I think this is a problem with LaTeX formatting. I am also not sure how relevant the term ‘sophomore‘ is for this part. In the introduction, you claim that you will explain 'why I chose them,' referring to the analyses you used, but you didn’t do it in the text. There is inconsistent formatting in the 'Embeddings' part of your report. It seems that you forgot to edit the title that is displayed on non-title pages - your report just uses ‘Running Title‘ and ‘Lastname‘ for this purpose. The legend of the second plot is, for some reason, in the middle of the plot.
What I like about your work: all parts are of the same size. The report is, let's say, laconic and easy to read and navigate. Also, the first two plots are nice. Linear regression actually represents that blue and red points are different. All figures are numbered. The last one is three-dimensional. It looks cool, and different shades of blue are used.
and here is ChatGPT's:
The writing is generally clear, but it could be more detailed. For example, in the Introduction, you mention that you will describe each of the analyses you performed, but the descriptions in the following sections are quite brief.
There are a few grammatical errors and awkward phrases, such as "Here I will briefly describe each of the analyses I performed, why I chose them, and the math equation for the linear regression, logistic regression, and embedding I used." This sentence could be rephrased for clarity, perhaps something like: "In this report, I will provide a brief description of each analysis I performed, explain why I chose these particular methods, and present the mathematical equations used in the linear regression, logistic regression, and embedding techniques."
The conclusion is very brief and could be expanded to provide a more detailed summary of what you learned and the results of your analyses.
Without seeing the actual visualizations, it's hard to critique them. However, based on your descriptions, it seems like you've used appropriate methods for visualizing your data.
Make sure your visualizations are clearly labeled and include a legend where necessary. This will make it easier for readers to understand what they're looking at.
The content is generally good, but it could be more detailed. For example, in the section on embeddings, you mention that you're not sure how you did the embedding. It would be beneficial to understand this process more fully and explain it in more detail in your report.
The report would benefit from a more detailed discussion of the results of your analyses. For example, how well did the linear and logistic regression models perform? What insights were you able to gain from the embeddings?
The report could benefit from more detailed explanations of your methods and results. This would make it more informative and valuable to readers.
Including the actual visualizations in the report would also be a significant improvement. This would allow readers to see the results of your analyses for themselves.
Finally, proofreading the report carefully before submission would help catch grammatical errors and improve the overall quality of the writing.
I hope these critiques are helpful! Let me know if you have any questions or if there's anything else you'd like me to review.
Doing
[x] Clean Data Thinking Zulip chat data, located at https://github.com/onefact/datathinking.org-codespace/blob/main/data/datathinking.zulipchat.com/raw/messages-000001.json - put it in a
polars
dataframe and compute summary statistics of the dataset[x] Analyze this Zulip chat data using logistic regression, linear regression, and embeddings with the tools we have learned in the lectures (don't forget to ask ChatGPT, Claude, Lex, GPT-4 for help as much as you need, and ask for help on the Data Thinking Zulip chat :)
[x] Create a visualization of logistic regression of the Data Thinking Zulip chat data
[x] Create a visualization of linear regression applied to the Data Thinking Zulip chat data
[x] Create a visualization of embeddings using the Data Thinking Zulip Chat data
[x] Make a copy of the Overleaf template: https://www.overleaf.com/read/ghpyzqwqwxpv (need to create an account and/or sign in if this is your first time using Overleaf). To make a copy, open the project after signing in using this link, and click on
Menu
, thenCopy Project
:[x] In Overleaf, edit the template and figure out how to include a PDF figure in the report, alongside a brief description (a few sentences or paragraphs is fine!) of each of the analyses you performed, why you chose them, and the math equation for the linear regression, logistic regression, and embedding you used.
[x] Add the PDF of the report to this issue as a comment.
[x] Send a message on Zulip with a link to this comment, alongside the image representing your favorite visualization
Reviewing
Reading
json
format with chatgpt] https://genmon.github.io/braggoscope/about & https://news.ycombinator.com/item?id=35073603The Boy Whose Light Went Out
by Jack Clark http://techpolicylab.uw.edu/wp-content/uploads/2022/04/Telling_Stories_Pages_4-4-22.pdfWatching
(message Jaan if you need a VPN or these links don't work)