Encyclopedia of General Culture: Regression analysis

Regression analysis[EXTRACT]

From Wikipedia, the free encyclopedia

Regression analysis is any statistical method where the mean of one or more random variables is predicted conditioned on other (measured) random variables. In particular, there is linear regression, logistic regression, Poisson regression, supervised learning, and unit-weighted regression. Regression analysis is more than curve fitting (choosing a curve that best fits given data points); it involves fitting a model with both deterministic and stochastic components. The deterministic component is called the predictor and the stochastic component is called the error term.

The simplest form of a regression model contains a dependent variable (also called "outcome variable," "endogenous variable," or "Y-variable") and a single independent variable (also called "factor," "exogenous variable," or "X-variable").

Typical examples are the dependence of the blood pressure Y on the age X of a person, or the dependence of the weight Y of certain animals on their daily ration of food X. This dependence is called the regression of Y on X.

Regression is usually posed as an optimization problem as we are attempting to find a solution where the error is at a minimum. The most common error measure that is used is the least squares: this corresponds to a Gaussian likelihood of generating observed data given the (hidden) random variable. In a certain sense, least squares is an optimal estimator: see the Gauss-Markov theorem.

The optimization problem in regression is typically solved by algorithms such as the gradient descent algorithm, the Gauss-Newton algorithm, and the Levenberg-Marquardt algorithm. Probabilistic algorithms such as RANSAC can be used to find a good fit for a sample set, given a parametrized model of the curve function.

Regression can be expressed as a maximum likelihood method of estimating the parameters of a model. However, for small amounts of data, this estimate can have high variance. Bayesian methods can also be used to estimate regression models. A prior is placed over the parameters, which incorporates everything known about the parameters. (For example, if one parameter is known to be non-negative a non-negative distribution can be assigned to it.) A posterior distribution is then obtained for the parameter vector. Bayesian methods have the advantages that they use all the information that is available and they are exact, not asymptotic, and thus work well for small data sets. Some practitioners use maximum a posteriori (MAP) methods, a simpler method than full Bayesian analysis, in which the parameters are chosen that maximize the posterior. MAP methods are related to Occam's Razor: there is a preference for simplicity among a family of regression models (curves) just as there is a preference for simplicity among competing theories.

[hide]

1 Purpose and formulation
2 Choice of the regression function
- 2.1 Linear regression
- 2.2 Logistic regression
3 Criterion to optimize
4 Choice of an estimator
5 Confidence interval for estimation assuming normality, homoscedasticiy, and uncorrelatedness
6 Examples
- 6.1 First example
- 6.2 Second example
7 See also
8 References
9 External links

Height (in)	58	59	60	61	62	63	64	65	66	67	68	69	70	71	72
Weight (lbs)	115	117	120	123	126	129	132	135	139	142	146	150	154	159	164

1	x	x^3
1	58	195112
1	59	205379
1	60	216000
1	61	226981
1	62	238328
1	63	250047
1	64	262144
1	65	274625
1	66	287496
1	67	300763
1	68	314432
1	69	328509
1	70	343000
1	71	357911
1	72	373248

Encyclopedia of General Culture

Thursday, February 09, 2006

Regression analysis

From Wikipedia, the free encyclopedia

Contents

Purpose and formulation

Choice of the regression function

Linear regression

Logistic regression

Criterion to optimize

Choice of an estimator

The Gauss-Markov hypothesis

Gauss-Markov least-squares estimation of the coefficients

Alternatives to Gauss-Markov

Confidence interval for estimation assuming normality, homoscedasticiy, and uncorrelatedness

Examples

First example

Second example

See also

References

External links

No comments:

Landkarte Und Stadtplan

Blog Archive