Gasoline consumption in the United States has been steadily increasing. Consumption data from 1994 to 2004 is shown in
[link] .
http://www.bts.gov/publications/national_transportation_statistics/2005/html/table_04_10.html Determine whether the trend is linear, and if so, find a model for the data. Use the model to predict the consumption in 2008.
Year
'94
'95
'96
'97
'98
'99
'00
'01
'02
'03
'04
Consumption (billions of gallons)
113
116
118
119
123
125
126
128
131
133
136
The scatter plot of the data, including the least squares regression line, is shown in
[link] .
We can introduce new input variable,
representing years since 1994.
The least squares regression equation is:
Using technology, the correlation coefficient was calculated to be 0.9965, suggesting a very strong increasing linear trend.
Using this to predict consumption in 2008
The model predicts 144.244 billion gallons of gasoline consumption in 2008.
Scatter plots show the relationship between two sets of data. See
[link] .
Scatter plots may represent linear or non-linear models.
The line of best fit may be estimated or calculated, using a calculator or statistical software. See
[link] .
Interpolation can be used to predict values inside the domain and range of the data, whereas extrapolation can be used to predict values outside the domain and range of the data. See
[link] .
The correlation coefficient,
indicates the degree of linear relationship between data. See
[link] .
A regression line best fits the data. See
[link] .
The least squares regression line is found by minimizing the squares of the distances of points from a line passing through the data and may be used to make predictions regarding either of the variables. See
[link] .
Section exercises
Verbal
Describe what it means if there is a model breakdown when using a linear model.
When our model no longer applies, after some value in the domain, the model itself doesn’t hold.
A regression was run to determine whether there is a relationship between hours of TV watched per day
and number of sit-ups a person can do
The results of the regression are given below. Use this to predict the number of sit-ups a person who watches 11 hours of TV can do.
Step 1: Find the mean. To find the mean, add up all the scores, then divide them by the number of scores. ...
Step 2: Find each score's deviation from the mean. ...
Step 3: Square each deviation from the mean. ...
Step 4: Find the sum of squares. ...
Step 5: Divide the sum of squares by n – 1 or N.
The sample of 16 students is taken. The average age in the sample was 22 years with astandard deviation of 6 years. Construct a 95% confidence interval for the age of the population.
Bhartdarshan' is an internet-based travel agency wherein customer can see videos of the cities they plant to visit. The number of hits daily is a normally distributed random variable with a mean of 10,000 and a standard deviation of 2,400
a. what is the probability of getting more than 12,000 hits?
b. what is the probability of getting fewer than 9,000 hits?
Bhartdarshan'is an internet-based travel agency wherein customer can see videos of the cities they plan to visit. The number of hits daily is a normally distributed random variable with a mean of 10,000 and a standard deviation of 2,400.
a. What is the probability of getting more than 12,000 hits