sta-363-s20 / community

Discussion, Q&A, everything you want to say, formatted nicely
1 stars 0 forks source link

Some slide questions before midterm #44

Closed Cindyyyhey closed 4 years ago

Cindyyyhey commented 4 years ago

Dear Professor @LucyMcGowan ,

I have a few questions from slides about the regression part.

  1. What does this equation mean? Is that we draw the variance-bias trade off from this equation? 1

  2. I know hat means predicted values, but what does the hat matrix mean here? 屏幕快照 2020-02-11 下午9 15 31

  3. I'm so confused about this. What is the sigma here? Is the Var(beta) for calculating the variance of the whole regression model, or just one predictor? Also, I don't understand the formula for SE(beta) here. 屏幕快照 2020-02-11 下午9 36 09

  4. For assessing a model, why do we use RSE instead of RSS(since we always calculate RSS for a test dataset to test the model )? What is the difference between RSE and RSS? 屏幕快照 2020-02-11 下午10 06 36

  5. What does the summarise(reduce variables to values) mean here? 屏幕快照 2020-02-11 下午11 28 28

  6. Do we need to memorize or understand these two complicated formulas? 屏幕快照 2020-02-11 下午11 33 22 屏幕快照 2020-02-11 下午10 59 10

  7. Why false positive rate keep decreasing and false negative keep increasing? Is this tendency for all lda models? What does the x-axis threshold mean here? 屏幕快照 2020-02-11 下午11 50 40

  8. The last one!! What does the area under ROC curve mean? Which one is better, higher or lower AUC? (I think low auc is better because both false rates are low.) 屏幕快照 2020-02-11 下午11 51 03

Thanks a lot!!!

LucyMcGowan commented 4 years ago

What does this equation mean? Is that we draw the variance-bias trade off from this equation?

I know hat means predicted values, but what does the hat matrix mean here?

I'm so confused about this. What is the sigma here? Is the Var(beta) for calculating the variance of the whole regression model, or just one predictor? Also, I don't understand the formula for SE(beta) here.

For assessing a model, why do we use RSE instead of RSS(since we always calculate RSS for a test dataset to test the model )? What is the difference between RSE and RSS?

What does the summarise(reduce variables to values) mean here?

Do we need to memorize or understand these two complicated formulas?

Why false positive rate keep decreasing and false negative keep increasing? Is this tendency for all lda models? What does the x-axis threshold mean here?

The last one!! What does the area under ROC curve mean? Which one is better, higher or lower AUC? (I think low auc is better because both false rates are low.)

Cindyyyhey commented 4 years ago

Thanks a lot!!!!!!