geanders / csu_msmb

1 stars 11 forks source link

Vocab 11 Pull Request #62

Closed camronp closed 4 years ago

camronp commented 4 years ago

Here is the vocab 11. I'm not sure if the "2020-04-13-vocabulary-for-chapter-11.Rmd" is in the proper folder or not. Let me know if you have any issues! There was also a lot of things the commited that I'm not entirely sure where they came from... but hopefully it makes sense to you!

geanders commented 4 years ago

@camronp : Really nice start on this! I just had a few suggestions on some edits, all pretty minor. If you make these changes and push them to your own GitHub repo, then they should come through on the Pull Request as long as we have it open (send me an email when you push just in case so I can make sure it came through).

camronp commented 4 years ago

Hi Brooke,

Thank you for the helpful feedback! I still had issues redefining and understanding the spatial point process and Poisson process, but I tried to rework the definitions to make them more specific. I went through with the rest of you suggestions and changed the necessary definitions. Everything was pushed to my own repository, so hopefully it all makes it into the opened pull request.

Let me know if I need to change anything else today! Thanks, Camron

On Mon, Apr 27, 2020 at 6:57 PM Brooke Anderson notifications@github.com wrote:

@camronp https://github.com/camronp : Really nice start on this! I just had a few suggestions on some edits, all pretty minor. If you make these changes and push them to your own GitHub repo, then they should come through on the Pull Request as long as we have it open (send me an email when you push just in case so I can make sure it came through).

  • For "slots", I recommend changing the term to "slot", so it will agree with the definition in terms of plural / singular, and adding to the beginning "In the context of object-oriented programming in R,"
  • For "classification", I think they might mean that as a process, rather than as the resulting set. I suggest changing to something like "the process of grouping observations in a dataset by their similarities in terms of measured characteristics"
  • I think you're on the right track with "feature extraction", in terms of capturing why we often do it. However, I think the definition is missing a bit of the heart of the concept. In particular, it would be helpful to include the idea that this is a way of creating new measurements (i.e., new columns in a dataset) from the data you're given (or have measured). I think Wikipedia has some text that would be helpful in this definition (https://en.wikipedia.org/wiki/Feature_extraction). Maybe something like (using some of the language in Wikipedia) "the process of building derived values to describe observations in a dataset from the initial set of measured data, with the aim of creating a set of characteristics that is informative and non-redudant"? In your current definition, the idea of using this to reduce the required resources (in terms of memory storage or computational power, I guess) is often a nice side benefit, but the main goal is to create a "new" set of measurements for the observations that is derived from the original ones but in some way more helpful.
  • For "Poisson process", I recommend making "process" lowercase
  • For "spatial point process" and "Poisson process", I think that the first is a specific type of the second, so it would be nice for the definitions to make that a bit clearer. Let's see if @baileyfosdick https://github.com/baileyfosdick has any suggestion on those two definitions and how we could make connections between the two clearer.
  • For "Ripley's K function", I recommend changing "and can help" to "that can help"
  • For "linear filter", I recommend adding at the beginning "A tool for"
  • For "binary images", could you either change the term to "binary image" or change the definition to start "images" instead of "an image" so that the term and definition agree on singular / plural?
  • I'm not sure that the definition for "morphological operations" has completely captured that idea. mathworks.com has some language that could be used in edits for this definition---maybe something like "image processing operations in which each pixel in the image is adjusted based on other pixels in its neighborhood"? (if you use that, be sure to add mathworks.com to the works cited section)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/geanders/csu_msmb/pull/62#issuecomment-620313587, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKTCICJEG2RO3TYS2D3S2ODROYSWBANCNFSM4MRP3SFQ .

baileyfosdick commented 4 years ago

@camronp Here are some suggestions for definitions for Poisson process and spatial point process -- Poisson process = mechanism that generates instantaneous events (in time and/or space) based on the Poisson distribution; spatial point process = mechanism that generates a random collection of coordinates or points randomly located along an underlying mathematical space. There is at most one point observed at any location.

geanders commented 4 years ago

@camronp : Great, the post is now up! Could you go back to Quizlet and update that with the latest version of the terms? You can copy the latest tsv from here: https://raw.githubusercontent.com/geanders/csu_msmb/master/content/post/vocab_lists/chapter_11.tsv

camronp commented 4 years ago

Of course! Just did it. I shouldn't need to change the embedding code right? Thanks! Camron

On Wed, Apr 29, 2020 at 1:06 PM Brooke Anderson notifications@github.com wrote:

@camronp https://github.com/camronp : Great, the post is now up! Could you go back to Quizlet and update that with the latest version of the terms? You can copy the latest tsv from here: https://raw.githubusercontent.com/geanders/csu_msmb/master/content/post/vocab_lists/chapter_11.tsv

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/geanders/csu_msmb/pull/62#issuecomment-621404708, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKTCICKBLK6P47UH54CRS7DRPB3D5ANCNFSM4MRP3SFQ .

geanders commented 4 years ago

That’s right. When you embed, it’s just pointing to what’s posted somewhere else.


From: camronp notifications@github.com Sent: Wednesday, April 29, 2020 1:30 PM To: geanders/csu_msmb csu_msmb@noreply.github.com Cc: Anderson,Brooke Brooke.Anderson@colostate.edu; Comment comment@noreply.github.com Subject: Re: [geanders/csu_msmb] Vocab 11 Pull Request (#62)

Of course! Just did it. I shouldn't need to change the embedding code right? Thanks! Camron

On Wed, Apr 29, 2020 at 1:06 PM Brooke Anderson notifications@github.com wrote:

@camronp https://github.com/camronp : Great, the post is now up! Could you go back to Quizlet and update that with the latest version of the terms? You can copy the latest tsv from here: https://raw.githubusercontent.com/geanders/csu_msmb/master/content/post/vocab_lists/chapter_11.tsv

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/geanders/csu_msmb/pull/62#issuecomment-621404708, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKTCICKBLK6P47UH54CRS7DRPB3D5ANCNFSM4MRP3SFQ .

— You are receiving this because you commented. Reply to this email directly, view it on GitHubhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeanders%2Fcsu_msmb%2Fpull%2F62%23issuecomment-621412520&data=02%7C01%7Cbrooke.anderson%40colostate.edu%7C1376e93be26148177f8008d7ec73d095%7Cafb58802ff7a4bb1ab21367ff2ecfc8b%7C0%7C0%7C637237854472098818&sdata=2nybYUg9Juy%2BGQb%2F%2FQncKN0ipwSZx%2F2wyjJl%2BzPDJ%2BY%3D&reserved=0, or unsubscribehttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FABKULCBZPWH6SV6PEHFM2N3RPB56LANCNFSM4MRP3SFQ&data=02%7C01%7Cbrooke.anderson%40colostate.edu%7C1376e93be26148177f8008d7ec73d095%7Cafb58802ff7a4bb1ab21367ff2ecfc8b%7C0%7C0%7C637237854472098818&sdata=TylxziSUfv2gekuNqfAKUEDJJ2PyKQyGzy4dhpMGSvA%3D&reserved=0.

geanders commented 4 years ago

@camronp : Alright, I know that the title of this pull request is for the vocab, but I think that it's also got your exercise post in it, which is why I've kept it open, and I see that it looks like you've done quite a bit on that. Is that post ready for me to take a look at and add some suggestions for edits?

camronp commented 4 years ago

@camronp : Alright, I know that the title of this pull request is for the vocab, but I think that it's also got your exercise post in it, which is why I've kept it open, and I see that it looks like you've done quite a bit on that. Is that post ready for me to take a look at and add some suggestions for edits?

@geanders I'm embarrassed to admit that I thought I had submitted this to you nearly a month ago! If you don't mind, lets use this as the exercise submission. Again, I apologize for the super late submission and am open to your edits and suggestions on the excercise! Thank

geanders commented 4 years ago

@camronp : No, that's my fault! I think that the exercise got incorporated into the vocab pull request, so I just wasn't sure if it was ready for review or not. I'll take a look and add my suggestions!

geanders commented 4 years ago

@camronp : Really nice work on this! I have some suggestions for edits. Most are to add discussion or explanations along the way, particularly in Part A. We'll leave this pull request open, and so once you make your edits and push them to your own GitHub repo, they should automatically come through the pull request as well. Please let me know if you have any questions about any of these comments.

sil %>% 
  unclass() %>% 
  as.data.frame() %>% 
  tibble::rownames_to_column(var = "orig_order") %>% 
  arrange(as.numeric(orig_order)) %>% 
  bind_cols(simdat) %>% 
  ggplot(aes(x = x, y = y, shape = as.factor(cluster), color = sil_width)) + 
  geom_point() + 
  facet_wrap(~ class)

image

In this, there's a facet for each of the "true" groupings of the points (the four groups you originally simulated). Then, the shape shows the group it was assigned to by k-means (with the pam call). For the most part, all the points that were part of a true original group are assigned to the same cluster, although occasionally ones along the border with another cluster are mis-assigned. For the silhouette index, you can see from this that it gets close to zero (or even negative) when you get close to those borders between the original groups (around where x and y equal 4--5), but then are nice and high in the middle of the x-y space for a cluster and on edges that are away from any other cluster.

camronp commented 4 years ago

@geanders Thank you so much for all of your suggestions and edits. I spent quite a bit of time changing and improving the exercise 5 document with your recommendations. I just pushed it up to my repository and it looks like it made it the pull request.

geanders commented 4 years ago

@camronp : Really, really nice work on these revisions! It's now live here: https://kind-neumann-789611.netlify.app/post/exercise-solution-for-5-1/