Measurements of length and weight for a sample of 20 low birth weight infants are contained in the data set twenty. The length measurements are saved under the variable name length, and the corresponding birth weights under weight.
(a) Construct a two-way scatter plot of birth weight versus length for the 20 infants in the sample. Without doing any calculations, sketch your best guess for the least-squares regression line directly on the scatter plot.
(b) Now compute the true least-squares regression line. Draw this line on the scatter plot. Does the actual least-squares line concur with your guess? Based on the two-way scatter plot, it is clear that one point lies outside the range of the remainder of the data. This point corresponds to the ninth infant in the sample of size 20. To illustrate the effect that the outlier has on the model, remove this point from the data set.
(c) Compute the new least-squares regression line based on the sample of size 19, and sketch this line on the original scatter plot. How does the least-squares line change? In particular, comment on the values of the slope and the intercept.
(d) Compare the coefficients of determination (R2) and the standard deviations from regression (sylx) for the two least-squares regression lines. Explain how these values changed when you removed the outlier from the original data set. Why did they change?
| length | weight |
| 41 | 1360 |
| 40 | 1490 |
| 38 | 1490 |
| 38 | 1180 |
| 38 | 1200 |
| 32 | 680 |
| 33 | 620 |
| 38 | 1060 |
| 30 | 1320 |
| 34 | 830 |
| 32 | 880 |
| 39 | 1130 |
| 38 | 1140 |
| 39 | 1350 |
| 37 | 950 |
| 39 | 1220 |
| 38 | 980 |
| 42 | 1480 |
| 39 | 1250 |
| 38 | 1250 |
*data in a copiable format
Measurements of length and weight for a sample of 20 low birth weight infants are contained...
3. The data set lowbwt.csv contains information for a sample of 100 low birth weight infants born in two teaching hospitals in Boston, Massachusetts. Measurements of systolic blood pressure are saved under the variable name sbp and indicators of gender – with 1 representing a male and 0 a female – under the name sex. Answer the following questions using SAS. Please provide your SAS code and relevant SAS output. a. Construct a histogram of systolic blood pressure measurements for...
The average gestational age (time from conception to birth) of a newborn infant is 40 weeks. The following table presents the gestational age in weeks (?) and the corresponding birth weight in pounds (?) for 14 female infants born in the US. x 36 37 32 36 41 39 38 40 42 32 31 37 39 41 y 7.2 7.5 5.1 5.9 8.9 7.8 7.2 8.0 8.8 4.2 3.9 6.9 7.8 9.1 (a) Compute the least-squares regression line for predicting...
Question 3 A researcher is interested in the relationship between the birth weights of infants and mothers' smoking habits. He uses the birth weight of an infant (ounces) and the average number of cigarettes the mother smokes per day during the pregnancy as the dependent and independent variables, y and x, respectively. Using a sample of size (1388 the following model is obtained by the method of least squares: y-119.770.514.x (3.15) (0.13) SE e the figures in brackets are the...
A popular, nationwide standardized test taken by high-school juniors and seniors may or may not measure academic potential, but we can nonetheless exarmiune the relationship between scores on thos test and peroracen measure academie potental. but We have chosen a random sample of fifteen students just finishing their first year of college, and for each student we've recorded her score on this standardized test (from 400 to 1600) and her grade point average (from 0 to 4) for her first...
4. The data set below shows the length of service (in years)and salaries in thousands of dollars) for 10 randomly selected employees. Length of service, 2 Salary, y 0.5 2 4 6 8 10 .75 39 41 41 40.5 42 41 40 1 6 8 38 41 42 (a) Construct a scatter plot for the data showing the regression line. (b) Find the equation of the regression line for the data. (c) Find the value of the correlation coefficient, r....
Birth weight and gestational age. The Child Health and Development Studies considered pregnancies among women in the San Francisco East Bay area. Researchers took a random sample of 50 pregnancies and used statistical software to construct a linear regression model to predict a baby's birth weight in ounces using the gestation age (the number of days the mother was pregnant). A portion of the computer output and the scatter plot is shown below. Round all calculated results to four decimal...
please answer question 7 (confidence interval).
(14 points) Birth weight and gestational age. The Child Health and Development Studies considered pregnancies among women in the San Francisco East Bay area. Researchers took a random sample of 50 pregnancies and used statistical software to construct a linear regression model to predict a baby's birth weight in ounces using the gestation age (the number of days the mother was pregnant). A portion of the computer output and the scatter plot is shown...
-+ Ft to page Page view 2. A doctor wanted to determine whether there is a Age versus HDL Cholesterol relation between a male's age and his HDL (so-called good) cholesterol. He randomly selected 17 of his patients and determined their HDL cholesterol. He obtained the following data: (Use Minitab only for graphs. Show all the steps) HDL HDL Age, x Cholesterol,y Age, x Cholesterol, y 38 42 46 32 57 54 34 56 35 40 42 38 47 38...
In the United States, tire tread depth is measured in 32nds of an inch. Car tires typically start out with 10/32 to 11/32 of an inch of tread depth. In most states, a tire is legally worn out when its tread depth reaches 2/32 of an inch. A random sample of four tires provides the following data on mileage and tread depth: Mileage (10,000 miles) Tread Depth (32nds of an inch) Tire You vwill use the sample data to find...
Route 1 Route 2 Route 3 30 20 3 33 38 41 31 31 32 28 25 28 35 30 31 (1) 2A. The number of groups = k = __3__ (1) 2B. The sample size is n = _15___ (3) 2C. Complete the ANOVA table. Source of Variation SS Df MS F Factor (Between) 48.933 2 24.467 0.28616 Error (Within) 1026 12 85.5 Total 1074.933 14 (4) 3. What is an outlier? A. How many standard...