The correlation matrix indicate how the variables, Risk , Age, Blood pressure and smoker dummy are related to each other. When the correlation is more than 0.5 it is strongly correlated to the other variables.
Risk, Y = -91.76 + 1.08 Age + 0.25 Blood Pressure + 8.74 smoker Dummy
The coefficient gives the least squares of the respective variables. The coefficient of regression is multiplied with the respective variables value to estimate the forecast.
Multiple R is the correlation coefficient. There is a positive relationship as the value is closer to 1. The Squared R conveys that 87.3% of the variation of risk around the mean are explained by the variables, Age, blood pressure and smoking.
Age = 65, Blood Pressure = 164, Smoker dummy = 1 (Smoking)
Risk = -91.76 + (1.08*65) + (0.25*164) +(8.74*1)
Risk = 28.18
The lower & upper confidence interval values for 95% defines the confidence interval for the variables. These are the confidence interval for the slope co-efficients.
Intercept (-124.03,-59.49)
Age (0.72, 1.43)
Blood Pressure (0.16, 0.35)
Smoker dummy (2.38, 15.1)
Using Lower co-efficient:
Risk = -124.03 + (0.72 *65) + (0.16*164) + (2.38*1) = -48.61
Using Upper co-efficient :
Risk = -59.49 + (1.43 *65) + (0.35*164) + (15.1*1) = 105.96
The risk estimate is the mean of upper & lower limit.
g) The regression model can be improved by having the level of smoking. The smoking dummy is binary in the current model which does not justifies the risk associated with it.
h) Strength & weakness of Regression.
1.For multi variable correlation, regression is the best solution as it is a straightforward method.
2. It assumes only the linearity of the data and eliminates the extreme data.
QUESTION 1: Regression and Forecasting 125 marks] Huka A recent 10-year study of senior Taupo residents...
A recent 10-year study conducted by a research team at the Medical School was conducted to assess how age, blood pressure, and smoking relate to the risk of strokes. Assume that the following data are from a portion of this study. Risk is interpreted as the probability (times 100) that the patient will have a stroke over the next 10-year period. For the smoking variable, define a dummy variable with 1 indicating a smoker and 0 indicating a nonsmoker. Blood...
A 10-year study conducted by the American Heart Association provided data on how age, blood pressure, and smoking relate to the risk of strokes. Assume the following data are from a portion of this study. Risk is interpreted as the probability (times 100) that the patient will have a stroke over the next 10-year period. For the smoking variable, define a dummy variable with 1 indicating a smoker and 0 indicating a nonsmoker. (See the Stroke file in the document...
A recent 10-year study conducted by a research team at the Great Falls Medical School was conducted to assess how age, systolic blood pressure, and smoking relate to the risk of strokes. Assume that the following data are from a portion of this study. Risk is interpreted as the probability (times 100) that the patient will have a stroke over the next 10-year period. For the smoking variable, define a dummy variable with 1 indicating a smoker and 0 indicating...
A recent 10-year study conducted by a research team at the Medical School was conducted to assess how age, blood pressure, and smoking relate to the risk of strokes. Assume that the following data are from a portion of this study. Risk is interpreted as the probability (times 100) that the patient will have a stroke over the next 10-year period. For the smoking variable, define a dummy variable with 1 indicating a smoker and 0 indicating a nonsmoker Risk...
2. A recent study conducted by a research team at the University Medical School was conducted to assess how age, systolic blood pressure, and smoking relate to the risk of strokes. The data is is found in the worksheet Stroke data. a. Which variables are the explanatory (independent) variables? b. Which variable is the response (dependent) variable? Run the correlation analysis to produce the correlation matrix. c. Write an interpretation of the correlation matrix. d. At this point are there...
A recent 10-year study conducted by a research team at the Medical School was conducted to assess how age, blood pressure, and smoking relate to the risk of strokes. Assume that the following data are from a portion of this study. Risk is interpreted as the probability (times 100) that the patient will have a stroke over the next 10-year period. For the smoking variable, define a dummy variable with 1 indicating a smoker and 0 indicating a nonsmoker Blood...
A 10-year study conducted by the American Heart Association provided data on how age, blood pressure, and smoking relate to the risk of strokes. Data from a portion of this study follow. Risk is interpreted as the probability (times 100) that a person will have a stroke over the next 10-year period. For the smoker variable, 1 indicates a smoker and 0 indicates a nonsmoker Click on the datafile logo to reference the data. DATA file Risk Blood Pressure Smoker...
A recent 10-year study conducted by a research team at the Medical School was conducted to assess how age, blood pressure, and smoking relate to the risk of strokes. Assume that the following data are from a portion of this study. Risk is interpreted as the probability (times 100) that the patient will have a stroke over the next 10-year period. For the smoking variable, define a dummy variable with l indicating a smoker and 0 indicating a nonsmoker Blood...
Risk
Age
Blood Pressure
Smoker
10
59
220
1
33
67
129
0
14
68
170
0
59
63
198
0
30
65
173
0
52
74
172
1
9
77
159
0
28
73
173
0
32
68
117
1
20
80
209
1
40
62
176
1
41
82
110
0
25
67
151
1
56
55
191
0
36
61
208
1
32
61
112
1
26
78
125
0
28
75
129
0
18
90
184...
A 10-year study conducted by the American Heart Association provided data on how age, blood pressure, and smoking relate to the risk of strokes. Data from a portion of this study follow. Risk is interpreted as the probability (times 100) that a person will have a stroke over the next 10-year period. For the smoker variable, 1 indicates a smoker and 0 indicates a nonsmoker. Risk Age Blood Pressure Smoker 14 58 201 0 23 82 98 1 25 74...