Multiple Linear Regression: Statistics
y = (6.7, 14.15, 62.11, 7.8, 7.9, 8.1)
x1= (1.2, 4.5, 8.7, 3.3, 6.1, 7.2)
x2= (1.11, 7.5, 4.2, 9.1, 7.4, 8.0)
1). Construct individual 95% CIs on β1 and β2.
2). Construct simultaneous 95% CIs on β1 and β2.
Answer:
Multiple Linear Regression: Statistics
y = (6.7, 14.15, 62.11, 7.8, 7.9, 8.1)
x1= (1.2, 4.5, 8.7, 3.3, 6.1, 7.2)
x2= (1.11, 7.5, 4.2, 9.1, 7.4, 8.0)
1). Construct individual 95% CIs on β1 and β2.
95% CIs on β1 = (-2.540837, 15.285902) and β2= (-11.915604 , 4.339309)
2). Construct simultaneous 95% CIs on β1 and β2.
95% CIs on β1(-5.325072, 18.070137) and β2= (-14.454346, 6.878052)
If we want simultaneous confidence intervals for both the slopes, using the Bonferroni method with joint confidence level α, set the level equal to 1 – α / 2. In our case we set level as 0.975.
R used for computation:
R code:
y=c(6.7,14.15,62.11,7.8,7.9,8.1)
x1=c(1.2,4.5,8.7,3.3,6.1,7.2)
x2=c(1.11,7.5,4.2,9.1,7.4,8.0)
mydata <- data.frame(y,x1,x2)
model <-lm(y~x1+x2, data = mydata)
summary(model)
anova(model)
#Construct individual 95% CIs
confint(model,level=0.95)
#Construct simultanuous 95% CIs
confint(model,level=0.975)
R output:
Call:
lm(formula = y ~ x1 + x2, data = mydata)
Residuals:
1 2 3 4 5 6
-5.167 5.460 14.155 12.818 -11.365 -15.902
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 8.425 19.368 0.435 0.693
x1 6.373 2.801 2.275 0.107
x2 -3.788 2.554 -1.483 0.235
Residual standard error: 16.36 on 3 degrees of freedom
Multiple R-squared: 0.6642, Adjusted R-squared: 0.4404
F-statistic: 2.967 on 2 and 3 DF, p-value: 0.1946
> anova(model)
Analysis of Variance Table
Response: y
Df Sum Sq Mean Sq F value Pr(>F)
x1 1 999.76 999.76 3.7342 0.1488
x2 1 589.06 589.06 2.2002 0.2346
Residuals 3 803.19 267.73
> #Construct individual 95% CIs
> confint(model,level=0.95)
2.5 % 97.5 %
(Intercept) -53.211907 70.060994
x1 -2.540837 15.285902
x2 -11.915604 4.339309
> #Construct simultanuous 95% CIs
> confint(model,level=0.975)
1.25 % 98.75 %
(Intercept) -72.465048 89.314136
x1 -5.325072 18.070137
x2 -14.454346 6.878052
Multiple Linear Regression: Statistics y = (6.7, 14.15, 62.11, 7.8, 7.9, 8.1) x1= (1.2, 4.5, 8.7,...
Customer Waiting Times (in minutes) 7.4 6.9 6.9 6.8 7.4 6.6 7.7 6.5 7.2 7.9 8.7 7.2 7.4 7.9 6.9 6.7 8.9 7.1 7.6 6.1 7.2 7.6 7.3 5.9 7.3 6.8 6.7 6.9 7.8 7.4 6.8 6.9 6.1 6.4 5.3 7.7 7.7 7.8 7.5 6.3 7.6 7.1 6.9 8.7 7.4 6.4 6.8 7.2 8.6 6.1 7.1 6.7 7.5 7.6 7.1 8.1 7.3 7.5 7.8 6.3 The accompanying data table includes weights (in grams) of a simple random sample of 40...
Parametirc test or not:Test statistic:p-value:decision:Is There A Difference Between the Means?6.7 6.2 3.1 310.3 10 5 5.56.9 5.5 3.3 3.110.5 6.3 4.3 5.44.5 4.6 1.8 25.6 5.6 2 2.65.9 6.1 2.1 2.58 11.7 4 4.68 7.4 3.3 3.15.8 5.2 3.1 2.96 7.3 3.0 3.28.7 5.3 2.7 36 5.5 2.1 2.27.2 6.3 3.5 3.25.9 4.6 2.9 3.46 7.4 3 3.37.2 7.8 3.7 3.48.6 9.4 5.1 5.77.2 8.1 2.8 3.15.8 5.4 2.2 1.83.3 4 1.7 1.86.8 5.1 2 1.83.7 3.5 2.2 2.112...
The data on the below shows the number of hours a particular drug is in the system of 200 females. Develop a histogram of this data according to the following intervals: Follow the directions. Test the hypothesis that these data are distributed exponentially. Determine the test statistic. Round to two decimal places. (sort the data first) [0, 3) [3, 6) [6, 9) [9, 12) [12, 18) [18, 24) [24, infinity) 34.7 11.8 10 7.8 2.8 20 9.8 20.4 1.2 7.2...
11.38 Building a multiple linear regression model. Let’s now build a model to predict the life-satisfaction score, LSI. (a) Consider a simple linear regression using GINI as the explanatory variable. Run the regression and summarize the results. Be sure to check assumptions. (b) Now consider a model using GINI and LIFE. Run the multiple regression and summarize the results. Again be sure to check assumptions. (c) Now consider a model using GINI, LIFE, and DEMOCRACY. Run the multiple regression and...
Compute Regression Analysis for following relationship: The relationship between death rate X1 (USD) vs. population density X5. Population as a Predictor, X, then death rate as a Response variable, Y. Get Regression Output, and Scatter plot between these variables and compute Coefficient of Determination, R2, and Interpret your findings. X1 X2 X3 X4 X5 The data (X1, X2, X3, X4, X5) are by city. 8 78 284 9.1 109 X1 = death rate per 1000 residents 9.3 68 433 8.7 144...
Compute the correlation coefficient, r, for all five variables (columns). Interpret your findings whether you have determined any relationship between variables. X1 X2 X3 X4 X5 The data (X1, X2, X3, X4, X5) are by city. 8 78 284 9.1 109 X1 = death rate per 1000 residents 9.3 68 433 8.7 144 X2 = doctor availability per 100,000 residents 7.5 70 739 7.2 113 X3 = hospital availability per 100,000 residents 8.9 96 1792 8.9 97 X4 = annual...
The data contained in the file named StateUnemp show the unemployment rate in March 2011 and the unemployment rate in March 2012 for every state.† State Unemploy- ment Rate March 2011 Unemploy- ment Rate March 2012 Alabama 9.3 7.3 Alaska 7.6 7.0 Arizona 9.6 8.6 Arkansas 8.0 7.4 California 11.9 11.0 Colorado 8.5 7.8 Connecticut 9.1 7.7 Delaware 7.3 6.9 Florida 10.7 ...
An object of weight 1 N is falling vertically. The time vs. speed data can be found here. In this case the effect of air-drag cannot be neglected. Use your critical thinking to estimate the air-drag coefficient . Make sure you include the units in your answer. 0 0 0.1 0.9992 0.2 1.993 0.3 2.978 0.4 3.948 0.5 4.898 0.6 5.826 0.7 6.728 0.8 7.599 0.9 8.438 1 9.242 1.1 10.01 1.2 10.74 1.3 11.43 1.4 12.09 1.5 12.7 1.6 ...
We consider the multiple linear regression with LIFE (y) as the response variable, and MALE, BIRTH, DIVO , BEDS, EDUC, and INCO, as predictors. QUESTION: Plot the standardized residuals against the fitted values. Are there any notable points. In particular look for points with large residuals or that may be influential. # please screenshot the Rcode for the plot. # data information are as follows: "STATE" "MALE" "BIRTH" "DIVO" "BEDS" "EDUC" "INCO" "LIFE" AK 119.1 24.8 5.6 603.3 14.1 4638...
We consider a multiple linear regression model with LIFE (y) as
the response variable, and MALE (x1), BIRTH (x2), DIVO (x3), BEDS
(x4), EDUC (x5), and INCO (x6), as predictors.
"STATE" "MALE" "BIRTH" "DIVO" "BEDS" "EDUC" "INCO" "LIFE"
AK 119.1 24.8 5.6 603.3 14.1 4638 69.31
AL 93.3 19.4 4.4 840.9 7.8 2892 69.05
AR 94.1 18.5 4.8 569.6 6.7 2791 70.66
AZ 96.8 21.2 7.2 536.0 12.6 3614 70.55
CA 96.8 18.2 5.7 649.5 13.4 4423 71.71
CO 97.5...