Closed caglarmert closed 4 years ago
Descriptive Statistics
bimodal : two values appeared same time and more than the rest of the values then the data set multimodel : for n nodes
interquartile range (IQR) : Q3 - Q1
Statistical Distributions
Continuous : A number within a range of values, usually measured
Uniform : a type of probability distribution in which all outcomes are equally likely; each variable has the same probability that it will be the outcome. continuous random variable x has a uniform distribution, denoted U(a, b), if its probability density function is:
f(x) = 1 / (b−a)
> Nominal : the specific form of the normal distribution depends on 2 parameters: the expectation (µ) and variance (σ^2) - N(m,σ ^2) -
µ (expectation) : the distribution center, which corresponds to the maximum height of the graph. σ^2 (variance) : characterizes the range of variation, the “spreading” of the data.
f(x) = 1 / (σ√2π) exp (−(x−μ) ^ 2 / 2 σ ^2)
Discrete : Only take certain values (can’t be decimal), usually counted
Bernoulli (binomial) : the upcoming event (positive outcome) is also called "success." the ratio of positive outcomes to the total number of tests tends to the probability of the occurrence of this event.
f(x) = (n x) p ^ x(1−p) ^ (n−x) n = the number of experiments in the series; x = a random variable (the number of occurrences of event A); p^x = the probability that A happens exactly m times; q = 1 - p (the probability that A does not appear in the test)
>Poisson : is obtained as a limiting case of the Bernoulli distribution, if we push p to zero and n to infinity, but so that their product remains constant: np = a.
f(x) = [(e ^−λ) * (λ^x ) ] / x! The average number of events in an interval is designated λ. λ = the event rate also called the rate parameter. It is also equal to mean and variance.
Descriptive Analysis
Helps you detect outliers and typos, and enable you identify associations among variables, thus preparing you for conducting further statistical analyses.
There are two types
- Descriptive analysis for each individual variable
- Descriptive analysis for combinations of variables
Variable can be classified into quantitative and categorical
- Quantitative variables : represent quantities or numerical values
- Categorical variables : describe quality or characteristics of individuals
Descriptive Analysis for each individual variable
Quantitative variables ;
Categorical variables ;
Descriptive analysis for combination of two variables
- Both variables quantitative: Create a scatter plot.
- One variable categorical and the other quantitative: Calculate summary statistics and box-and-whisker plots of the quantitative variable classified by the categorical variable.
- Both variables categorical: Prepare a contingency table.