izxi / Learning

9 stars 3 forks source link

Evaluate the relationship between a categorical variable and a numerical variable? #17

Closed izxi closed 6 years ago

izxi commented 6 years ago

Which of the following statistical techniques can be used to evaluate the relationship between a categorical variable and a numerical variable?

Select one:

izxi commented 6 years ago

Analysis of Variance (ANOVA) is a statistical method used to test differences between two or more means. It may seem odd that the technique is called "Analysis of Variance" rather than "Analysis of Means." As you will see, the name is appropriate because inferences about means are made by analyzing variance.

izxi commented 6 years ago

The Pearson correlation coefficient measures the linear relationship between two datasets. Strictly speaking, Pearson’s correlation requires that each dataset be normally distributed. Like other correlation coefficients, this one varies between -1 and +1 with 0 implying no correlation. Correlations of -1 or +1 imply an exact linear relationship. Positive correlations imply that as x increases, so does y. Negative correlations imply that as x increases, y decreases.

izxi commented 6 years ago

The root-mean-square deviation (RMSD) or root-mean-square error (RMSE) (or sometimes root-mean-squared error) is a frequently used measure of the differences between values (sample and population values) predicted by a model or an estimator and the values actually observed.

izxi commented 6 years ago

There are basically two types of random variables and they yield two types of data: numerical and categorical. A chi square (X2) statistic is used to investigate whether distributions of categorical variables differ from one another. Basically categorical variable yield data in the categories and numerical variables yield data in numerical form. Responses to such questions as "What is your major?" or Do you own a car?" are categorical because they yield data such as "biology" or "no." In contrast, responses to such questions as "How tall are you?" or "What is your G.P.A.?" are numerical. Numerical data can be either discrete or continuous. The table below may help you see the differences between these two variables.

johnnieng commented 5 years ago

A. ANOVA *

om-ganesh commented 3 years ago

@johnnieng do you have a reference to verify that the ANOVA is correct answer? I still think and go with @izxi (chi-squared)

johnnieng commented 3 years ago

@johnnieng do you have a reference to verify that the ANOVA is correct answer? I still think and go with @izxi (chi-squared)

A chi-square test is used when you want to see if there is a relationship between two categorical variables.

iaminebriki commented 2 years ago

Answer: ANOVA Certainly, One-Way ANOVA or Independent t test. Chi square test is when both groups are categorical. Statisticians frown upon chopping numerical variables into groups. Best,

Another answer source: ANOVA

image