mbcann01 commented 10 months ago

Overview

In the Fall of 2023, I moved over a bunch of stuff from PowerPoint slides (nearly) verbatim. I was in a rush, so I told myself to move it just move it over and improve it later.

Go back, reread, and improve. PowerPoint doesn't always translate perfectly to book format.

Left off

2023-09-25

Finished the first draft of the chapter. There is lots of room for improvement.

Tasks

[ ] Add life table and Kaplan-Meier table calculations. There are slides and R code in the cohort studies module.
[ ] In Fall 2023, I was just trying to get everything moved over from PP. It's probably worth rereading for clarity and figuring out if there are places where it would make sense to replace PP slide images with native R images.
[ ] Add a discussion of relative and absolute differences (took this out in Fall 2023). See slides (measures_of_association_tree_01, measures_of_association_tree_02, and measures_of_association_tree_03)
[ ] Add difference between relative and absolute difference example from GPLI presentation
[ ] Use Rothman's investment analogy for absolute vs. relative differences (see below)
[ ] Add a terminology table for probabilities (see below)
[ ] The slides for null values and no association are heavily geared toward incidence measures. However, they could also be about prevalence measures. l should make these more general.
[ ] I deleted the slides about null values (incidence_proportion_difference_null, incidence_proportion_ratio_null, and odds_ratio_null). I don't dislike them. I think they are good, but I was having trouble describing them. You may want to add them back.
[ ] Show readers how to interpret relative measures < 1
[ ] Show readers how to interpret absolute measures < 0
[ ] Show readers how to calculate confidence intervals, p-values, and p-value curves for each measure of association.
[ ] Add a terminology table for measures of association. For example, incidence proportion ratio is also called risk ratio and relative risk.
[ ] Demonstrate the equivalence between exposure OR and outcome OR
[ ] Break probability off into its own chapter

mbcann01 commented 9 months ago

Terms to consider adding

[ ] Absolute differences
[ ] Relative differences
[ ] Exposed
[ ] Unexposed
[ ] Relative Risk (Risk Ratio)
[ ] Odds Ratio (Relative Odds)
[ ] Effectiveness
[ ] Efficacy
[ ] Mean difference
[ ] Attributable risk in exposed
[ ] Etiologic fraction
[ ] Excess fraction
[ ] Percent attributable risk
[ ] Absolute risk reduction
[ ] Number needed to treat
[ ] Population attributable risk
[ ] Point prevalence rate ratio
[ ] Multiple testing, correcting from multiple testing. Correcting for multiple testing in the context of confidence intervals
[ ] Add sensitivity, specificity. Confusion matrix

mbcann01 commented 9 months ago

Rothman's investment analogy for absolute vs. relative differences

•Difference measures such as RD and IRD measure the absolute effect of an exposure. It is also possible to measure the relative effect. As an analogy, consider how to assess the performance of an investment over a period of time. Suppose that an initial investment of $100 became $120 after 1 year. The difference in the value of the investment at the end of the year and the value at the beginning, $20, measures the absolute performance of the investment. The relative performance is obtained by dividing the absolute increase by the initial amount, which gives $20/$100, or 20%. Contrast this investment experience with that of another investment, in which an initial sum of $1000 grew to $1150 after 1 year. For the latter investment, the absolute increment is $150, far greater than the $20 from the first investment, but the relative performance of the second investment is $150/$1000, or 15%, which is worse than the first investment.

•Rothman, Kenneth J.. Epidemiology: An Introduction (p. 59). Oxford University Press. Kindle Edition.

mbcann01 commented 9 months ago

Risk difference

“The 5-year risk difference comparing study participants exposed to the contaminant to those who were unexposed was 4%.” (Epi by Design)
for every 1,000 persons who are observed to be exposed to the contaminant (and if those people resemble those we studied), we would expect to likewise observe an additional 40 new cancer cases above the background cancer rate over 5 years. (Epi by Design)

Several somewhat technical points about the risk difference measure should be called out here. First, the range of the risk difference is from −1 to +1, inclusive (which we express as [−1, 1]). This is because the highest possible risk is 1, while the lowest is 0; if, as in Table 2.2, the risk in the exposed is higher than in the unexposed, then the risk cannot be higher than 1 − 0 = 1. Similarly, when the risk in the exposed is lower than in the unexposed, then the risk cannot be lower than 0 − 1 = −1. A negative risk difference would occur if the exposure was protective; for instance, if we were considering the association of daily aspirin use with risk of heart attack. Whether the exposure is associated with increased or decreased risk, the risk difference is considered relative to the null value. Again, this is the value which reflects no differences between the two groups being compared. No differences here would mean that the risk in exposed participants and the risk in unexposed participants are the same value P. Therefore, the null for the risk difference is P − P = 0.

mbcann01 commented 9 months ago

Terminology recap

prob_def <- "If some process is repeated a large number of times, $n$, and if some resulting event with the characteristic $Y$ occurs, $m$ times, the relative frequency of occurrence of $Y$, $\frac{m}{n}$ will be approximately equal to the probability of $Y$."
conditional_prob_def <- "The probability that some event occurs given that we know that some other event has already occurred."

Our Term	Definition	Equation
Probability	`r prob_def` @Daniel2013-qq	$P(Y) = \frac{m}{n}$
Conditional probability	`r conditional_prob_def`	$P(Y	X) = \frac{P(Y \cap X)}{P(X)}$

mbcann01 commented 9 months ago

Predictions

I took this material out in Fall 2023. I may want to add it back in at some point.

Predictions, especially good ones, can obviously be useful on their own. We may know that people of a certain race/ethnicity are most likely to get a particular form of cancer. Knowing that may allow us to concentrate screening efforts more effectively. We may know that older adults who begin to have trouble managing their finances are more likely to develop dementia. We may be able to use that information as an early indicator of important health problems to come.

However, in epidemiology, we are very often not content with predictions alone. It is extremely common for our questions and studies to either directly ask causal questions or imply causal relationships between variables. The reason we are often more interested in causal associations than mere predictions can be found directly in our definition of epidemiology. We want to control health problems. Said another way, we want to know why ”bad” things happen so that we can stop them from happening and/or why “good” things happen so that we can make them happen more often.

This idea is simultaneously so straightforward and so complex. As we will see throughout the semester.

Notice that in the cases above these predictions may be perfectly valid, but do they get us any closer to our ultimate goal of “controlling health problems?” We can’t change anyone’s race or ethnicity, can we? Even if we could, I’m hard-pressed to think of an example of a health outcome that is caused directly by a person’s race or ethnicity. Race and ethnicity are just a proxy for the true unmeasured cause. Likewise, do you really believe that if we hired an accountant to help an older person manage their finances that they would no longer develop dementia? Of course not.

mbcann01 commented 9 months ago

Relative vs absolute difference example

Example

Smoking and pill
Smoking and smoking cessation program
1,000,000 smokers
100,000 will die
Pill will save 50% who take it, 10% can afford it.
- IP = 0.50, 5,000 lives saved
Smoking cessation program will save 10%, but 100% get it.
- IP = 0.01, 10,000 lives saved

brad-cannell / r4epi

Review and improve the measures of association chapter #107

Overview

Left off

Tasks

Terms to consider adding

Rothman's investment analogy for absolute vs. relative differences

Risk difference

Terminology recap

Predictions

Relative vs absolute difference example