Question
can someone explain the ols and relationship of age and income. below is the output
DA E , av Normal No Spacing Residuals vs Fitted 40 20 Residuals OOOOO DERBO 0 -20 -40 49 Fitted values Imagss_2018Sage-gss_20
AaBbCcDdEe AaBbCcDdEe I Ă Aa~ PO EEVEEE 21 AO-A E E E -2.8837 Normal No Spacing Call: Im formula = gss_2018$age gss_2018$inco
0 0
Add a comment Improve this question Transcribed image text
Answer #1

Here, linear regression is run between Age and Income using function lm() in R.

Here, multiple R-squared and adjusted R-squared tell us the goodness of fit or the amount of variability Income is capable of explaining in Age. The more it is close to 1 , better is the model and more it is close to 0, poorer is the model. Here,

Multiple R-squared : 0.009128 and adjusted R-squared : 0.0003154 which is approximately 0. Hence, model is worst or cannot explain even 1% variability for Age.

Also, for various indpendent variable like Income16below average, average, above average and far above average, we have the values of corresponding coefficients that explain the model.

Age ~ 49.7037 + 0.1791 * Income16below average -0.2757*  income16average - 4.6685 * income16 above average - 2.8837 * income16far above average

Correponding t-statistics and p-value explain the significance of these individual factors in the model. Here, only income16 above average is significant. Since, only its p-value is less than the signifacnt levels (1%,5%,10%) and makes it important to the model by rejection corresponding Ho of no significance relationship.

Also, the residual plot explains Age is categorial and not linear which cannot have same variability.

Therefore, OLS is not a good idea for this model, since ages form categories or classes, one can run classification or clustering algorithms like decision trees, logistic regression, etc.

Add a comment
Know the answer?
Add Answer to:
can someone explain the ols and relationship of age and income. below is the output DA...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • How do I interpret the p-values in terms of rejecting or failing to reject H0 at...

    How do I interpret the p-values in terms of rejecting or failing to reject H0 at a 95% confidence level? What does the intercept column mean in terms of p-value? How does the p-value of the F test compare and what does it mean? In the simple linear regression I'd conclude age isn't related to pulmonary disease (what does intercept p-value mean) but for the multiple regression I'd say age and height aren't related to pulmonary disease but smoking is...

  • Consider the following regression results: Describe how the response y depends on the regressor x. What...

    Consider the following regression results: Describe how the response y depends on the regressor x. What is the formula for the regression line? What is the B0 and B1, and what do these coefficients represent? The Residuals vs. fitted plot is used to assess what assumption? What does the above plot tell you about your data? (remember to round all answers to 3 decimal places) Call: Im(formula = y ~ X, data = d) Residuals: Min 1Q Median 3Q Max...

  • What are the implications of predictability results in Part 2 and 3 for investment decisions? Part...

    What are the implications of predictability results in Part 2 and 3 for investment decisions? Part 2 use log dividend-price ratio to predict the 5-year stock market excess log returns: lm(formula = lnexret[2:t] ~ dp[1:t - 1]) Residuals:      Min 1Q Median 3Q Max -0.54389 -0.07305 0.01977 0.10712 0.34107 Coefficients:             Estimate Std. Error t value (Intercept) 0.58469 0.12768 4.579 dp[1:t - 1] 0.13510 0.03771 3.582                  Pr(>|t|)     (Intercept) 1.58e-05 *** dp[1:t - 1] 0.000567 *** --- Signif. codes:   0 ‘***’ 0.001 ‘**’...

  • A sample of 75 undergraduates were asked to participate in a study to investigate the relationship...

    A sample of 75 undergraduates were asked to participate in a study to investigate the relationship between a person's grip strength (in newtons) and their forearm circumference (in centimeters). The following plot shows a scatterplot of these data. 24 26 28 30 Forearm.Circumference What is the most correct interpretation of this plot? OThere is a no clear relationship between grip strength and forearm circumference OThere is a weak negative relationship between grip strength and forearm circumference OThere is a moderate...

  • A statistician wishes to estimate the price of a used car of a certain brand using...

    A statistician wishes to estimate the price of a used car of a certain brand using linear regression based on the variables Age, Mileages, Crash history, and Owner’s plate number. To do so, she randomly selected a certain number of used cars of the brand and measures AGE, MILEAGE, CRASH HISTORY, and NUMBER OF DIGITS ON PLATE NUMBER. The following is the output from R. Based on the R output, answer the following questions. f) Which variable would you first...

  • It has been established for a long time that height has a positive correlation with weight....

    It has been established for a long time that height has a positive correlation with weight. As people gets taller their weight increases. In a research study, a linear regression model was proposed to predict weight based on height. R output below provides the analysis. Interpret it, list any strengths and limitations of the result. Call: lm(formula = Weight ~ Height) Residuals:     Min      1Q Median      3Q     Max -6.7104 -2.9217 0.4276 2.3973 7.8586 Coefficients:             Estimate Std. Error t value...

  • Regression Analysis Question: is there any relationship between the economic growth rate(GDP) and unemployment rate(Unemp_Rates), Poverty...

    Regression Analysis Question: is there any relationship between the economic growth rate(GDP) and unemployment rate(Unemp_Rates), Poverty (Poverty_Rates), Technological and science workforce(ech_Scien_Wforce), high school graduate (HS_Grad_Rates) , and housing cost( Housing_Cost) This is a regression analysis result : Call: lm(formula = GDP_Rate ~ Unemp_Rates + Poverty_Rates + Tech_Scien_Wforce + HS_Grad_Rates + Housing_Cost) Residuals: Min 1Q Median 3Q Max -1.5562 -0.3988 -0.1126 0.4971 1.6748 Coefficients: Estimate Std. Error t value Pr(>|t|)    (Intercept) 15.822486 5.424619 2.917 0.00555 ** Unemp_Rates -0.053450 0.136427 -0.392...

  • please show your explanation thanks! ## ## Call: ## Im(formula = mpg ~ disp + hp...

    please show your explanation thanks! ## ## Call: ## Im(formula = mpg ~ disp + hp + wt + osec, data = mtcars.train.df) ## ## Residuals: Min 1Q Median ## -4.3442 -1.1687 -0.4033 3Q Max 1.0519 5.9623 ## ## Coefficients: Estimate Std. Error t value Pr>t) ## (Intercept) 31.204891 10.909916 2.860 0.00967 ** ## disp 0.009432 0.012308 0.766 0.45245 ## hp -0.032908 0.025528 -1.289 0.21208 ## wt -4.978374 1.434757 -3.470 0.00242 ** ## qsec 0.434043 0.576267 0.753 0.46011 ## ---...

  • Interpreting regression results 2. This is the result of a regression where goals is the dependent...

    Interpreting regression results 2. This is the result of a regression where goals is the dependent variable and minutes played is the explanatory variable. a. Write out the simple linear regression equation that predicts goals based on time played using the output displayed here. If the average soccer player played one additional game (90 minutes), how many additional goals would you predict them to have scored? b. Call: 1m(formula goalstimeplayed, data -data) Residuals: Min 1Q Median 3Q Max 5.0572-1.6294 -0.3651...

  • A client of yours wants to find out the best microbial environment for C. elegans. In...

    A client of yours wants to find out the best microbial environment for C. elegans. In previous meetings, the client told you that C. elegans feed on bacteria but may also be killed by certain bacteria. Therefore, it is important to figure out what bacteria are beneficial to C. elegans. In particular, the client was interested in studying the association between the density of Gluconobacter and the density of C. elegans. The client had collected some pilot data for this...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT