=## Linear Equations
Two-variable linear regressions take on the form:
where is the independent variable and is the dependent variable.
Scatterplots
Scatterplots show the direction of a bivariate relationship.
The Regression Equation
The line of best fit is also called a least-squares line in the form . is an estimate.
- The least-squares line always passes through the point of the and sample means
Error/Residuals
Residuals are calculated using the formula .
The sum of squared errors (SSE) is calculated as . The minima of the SSE returns the line of best fit.
A residual plot should be random/show no pattern when a linear relation is present. A dataset with a strong linear relation has residuals which produce no pattern.
Correlation Coefficient
The correlation coefficient measures the strength of the relationship between and .
- Values of close to -1 or 1 indicate a strong linear relationship
- can be a horizontal linear correlation or no linear correlation
- or indicate a perfect correlation
Coefficient of Determination
The coefficient of determination is typically stated as a percentage.
- as a percentage represents the percent variation in that can be explained by
- represents the percent variation not explained by