Question

Step 1

Data Set A- 7,7,7,9,9,9,10

Data Set B- 4,6,6,6,8,9,9,9,10,10,10

Step 1

1. Find the sample mean, median (if it exists) mode for each set of data
2. Next find the sample standard deviations for each set of data
3. Eliminate any outliers from the samples. Redo part 1 for each variable (if necessary, if not explain why). Compare your two sets of results. What happened to your averages? Standard deviations? Etc.
4. Write a brief explanation of what you found in the data.

Step 2

Data Set A

 x 1 2 3 4 5 6 7 y 7 7 7 9 9 9 10

Data Set B

 x 1 2 3 4 5 6 7 8 9 10 11 y 4 6 6 6 8 9 9 9 10 10 10

Step 2

1. Using the two variables from make a scatter plot.
2. Find the equation for the least-squares line, and graph the line on the scatter plot.
3. Find the sample correlation coefficient r and the coefficient of determination r2. Is r significant? What criteria did you use to determine if it was significant? For each set of data
4. What predictions can you make using this data sets? Write a brief explanation of what you found in the data sets.

A= c(7,7,7,9,9,9,10)
B= c(4,6,6,6,8,9,9,9,10,10,10)
mean(A)=sum of the observations/7 =  8.285714

standard deviation(A)=1.253566
median(A)= 9 (the middle most observation)

mode= 3 median - 2 mean=10.42857=10 (approx)

Q1(A)=7 (25% th i.e. the 2 nd observation)

Q3(A)=9 (75%th i.e the 6th observation)

QD(A)={Q3(A)-Q1(A)}/2=(9-2)/2=3.5

inner fences(A) are {Q3(A)+5.25, Q1(A)-5.25} i.e. {14.25,1.75}

i.e. the boundaries of our outlier fences for data set A is (1.75, 14.25)

Now Data set A= c(7,7,7,9,9,9,10)

Clearly no data of data set A lies outside the boundary .
similarly,

mean(B)=7.909091
standard deviation(B)= 2.071451
median(B)= 9

mode= 3 median - 2 mean=11.18182=11 (approx)

Q1(B)=6

Q3(B)=10

QD(B)={Q3(B)-Q1(B)}/2=(10-6)/2 =2

inner fences(B) are {Q3(B)+3, Q1(B)-3} i.e. {13,3}

i.e. the boundaries of our outlier fences for data set B is (3,13)

B= c(4,6,6,6,8,9,9,9,10,10,10)

Clearly, In data set B no data lies outside the boundary (3, 13).

So clearly not a single data is found neither in data set A nor in the data set B. So all the mean, sd, median, quantiles  will be same as we calculate already above.

Please come again with the other questions. Thank you

#### Earn Coins

Coins can be redeemed for fabulous gifts.

Similar Homework Help Questions
• ### 1. Describe the trend of the data, if any. 2. Calculate the linear correlation coefficient and is the linear correlatio... 1. Describe the trend of the data, if any. 2. Calculate the linear correlation coefficient and is the linear correlation coefficient significant? Why/why not? 3. Find the least-squares line of regression. 4. Graph the regression line on the scatter plot 5. Plot the residuals (give it your own title and labels for the axes!) with lines for 2 standard deviations of the residuals. 6. Predict the gas mileage of a 2000, 3000 and 4000 lb car. Make a scatter plot...

• ### 1) Find the percentage of data within 2 sample standard deviations from the mean: (12,14,67,46,35,84,67,85,93,104 2)... 1) Find the percentage of data within 2 sample standard deviations from the mean: (12,14,67,46,35,84,67,85,93,104 2) Find the sample mean and the sample standard deviation, and determine the skew of the data set: 01112355689 1133466899 2122556788999 3133888 3) An extreme outlier is defined to be any value outside of 3 standard deviations from the mean. Does the following data have any extreme outliers? (4,33,45,75,27,66,101,151,133,279) 4) Suppose that a data set has the following 5-number summary: min 0, Qi 5, Med=5,Q3=5,max...

• ### Consider the following two sample data sets. Set 1: 6 39 8 7 0 Set 2:... Consider the following two sample data sets. Set 1: 6 39 8 7 0 Set 2: 2 16 18 7 4 a. Calculate the coefficient of variation for each data set. b. Which data set has more variability? a. The coefficient of variation for set 1 is %. (Round to one decimal place as needed.)

• ### Need help with solving problem (b) to find r = For the accompanying data set, (a)... Need help with solving problem (b) to find r = For the accompanying data set, (a) draw a scatter diagram of the data, (b) by hand, compute the correlation coefficient, and (c) determine whether there is a linear relation between x andy Data set er er (a) Draw a scatter diagram of the da 4 A. 10 12 20 PrintDone 10 10 10 10 (b) By hand, compute the correlation coefficient. The correlation coefficient is rRound to three decimal places...

• ### Find some "real world" data. This data needs to have two sets of numbers that correspond... Find some "real world" data. This data needs to have two sets of numbers that correspond to each other in an exponential pattern. Choose data with at least 10 pairs of data values. Caution: Do not select specific problems already in your text! Also, please be careful not to select exponential growth models that do not need data fitting such as doubling models. For example, if you already know a particular model grows by doubling each time period, then you...

• ### Question # 3. The following data set records the cost of advertising (in thousands of dollars)...

Question # 3. The following data set records the cost of advertising (in thousands of dollars) and the number of prescriptions written for a new drug (in thousands). Cost (x) 9 2 3 4 2 6 9 10 Number of prescriptions (y) 85 61 64 67 60 75 83 87 Using the above data: (Please use Minitab for parts a, b, d) a) A bivariate (scatter) plot for the data set. (5 points) b) Find the correlation (r) and its...

• ### Name: Instructor Date: Section 6. The regression line equation for a set of data is given... Name: Instructor Date: Section 6. The regression line equation for a set of data is given by y-23x+. -10. The value of for this same data set is 10.1. Use α .05 a. If the lincar correlation coefficient for this data is -521, what is the best y -value for x 57 b. If the linear correlation coefficient for this data is r 972, what is the best predicted y-value for x=5? size n 15 with a 01 7. What...

• ### Below are four bivariate data sets and the scatter plot for each. (Note that each scatter... Below are four bivariate data sets and the scatter plot for each. (Note that each scatter plot is displayed on the same scale.) Each data set is made up of sample values drawn from a population. x y 1.0 7.9 2.0 5.1 3.0 10.1 4.0 6.4 х 11 10+ 9+ 8+ 7+ 6+ 5- X X 1.0 7.3 117 10+ 2.0 9.0 9+ 3.0 7.3 8+ 7 4.0 5.6 6+ 5.0 7.9 5 4 6.0 5.3 2 7.0 4.8 5.0...

• ### from part 5-12 plz dont do 1-4 BUNUS Assignment! A Sample Data AnalySIS III Licer Please... from part 5-12 plz dont do 1-4 BUNUS Assignment! A Sample Data AnalySIS III Licer Please answer all of the following questions. Make sure you show all your work, providing snapshots from Excel as well as listing all formulas and relevant explanation, as needed. We are interested in exploring the relationship between the income of professors (measured in thousands of dollars) and the number of years they have been employed by the university. We collect the following data from eight...

• ### The following two sample data sets both have sample mean 6. Set I 13.4 3.7 0.8... The following two sample data sets both have sample mean 6. Set I 13.4 3.7 0.8 103 1.8 Set II 2.6 8.2 5.2 7.2 6.8 (a) If μ is the population mean perform t tests for each set to test Ho : ,,-10 against H 11 #10. (b) Do you consider the conclusions of the tests reasonable? Do you have any reservations about Use significance level a 0.05. using a t-test for either of these data sets?