Use Random number generator (under Data Analysis) to simulate the following data set. Create 10 columns, each 20 points long and use the following parameters: Number of variables (10), number of data point (20), Distribution (Normal), Mean (40), Standard Deviation (10), Random seed (1234). The data should be in columns: A,B,C,….,I,J. Randomly pick two columns (say Column B and Column H) and perform 2-sided t-test on these two data columns. Record the P-value and repeat this procedure several times (at least 5 times). That is, each time randomly pick two columns, perform 2-sided t-test and record the P-value. And answer the questions. (Pick the closest answer) 17. What did you observe? a. Most of the P-values are very small, and some below 5%. b. P-values are very different, some small and some large, but very few, if any, below 5% c. Most of the P-values are very large, around 0.9 and 0.95 range. d. Essentially all P-values are below 5% and some even below 1% range. 18. What is the Statistical interpretation? a. Since data are created randomly one expect to see small P-value for t-test. b. The t-test worked as designed, since in most cases it detected the difference, sometimes even with 1% threshold. c. The t-test worked as designed, in most cases it did not detect the difference since the data are created with equal means (equal averages). d. None of the above Create one more random column of data. This time use the following parameters: Number of variables (1), number of data point (20), Distribution (Normal), Mean (50), Standard Deviation (10), Random seed (3434). CUT &PASTE this data in the same sheet as the previous 10 columns and put it in column M. Randomly pick one column out of A,B,…,J (say Column F) and perform 2-sided t-test based on this randomly picked column and the newly created column M. Record the P-value and repeat this procedure several times (at least 5 times). That is, each time randomly pick one data from the ten previously created and perform 2-sided t-test versus the newly created column M. Record the P-value. And answer the questions. (Pick the closest answer) 19. What did you observe? a. Most of the P-values are very small, and some below 5%. b. P-values are very different, some small and some large, but very few, if any, below 5% c. Most of the P-values are very large, around 0.9 and 0.95 range. d. Essentially all P-values are below 5% and some even below 1% range. 20. What is the Statistical interpretation? a. Since data are created randomly one expect to see small P-value for t-test. b. The t-test worked as designed, since in most cases it detected the difference, sometimes even with 1% threshold. c. The t-test worked as designed, in most cases it did not detect the difference since the data are created with equal means (equal averages). d. None of the above
ANSWER:
'Given that,
17)
option B is correct.
18)
option C is correct.
part -2:
> set.seed(3434)
> M = rnorm(20,50,10)
> M
[1] 45.93904 35.87163 60.34832 44.38388 49.84033 49.61631 48.19858
46.23254 46.82688 67.83189 62.74389 50.82694 50.86437 57.84251
68.30389 28.44567 51.51725 44.33325 37.06757 47.23892
T-Test:
> t.test(A,M)
Welch Two Sample t-test
t = -4.3517, df = 19.384, p-value = 0.0003296
> t.test(I,M)
Welch Two Sample t-test
t = -4.1386, df = 19.497, p-value = 0.0005327
> t.test(F1,M)
Welch Two Sample t-test
t = -4.3046, df = 19.142, p-value = 0.0003767
> t.test(D,M)
Welch Two Sample t-test
t = -4.1123, df = 19.425, p-value = 0.0005699
> t.test(H,M)
Welch Two Sample t-test
t = -4.1446, df = 19.247, p-value = 0.0005379
19)
option D is correct.
20)
option B is correct.
Use Random number generator (under Data Analysis) to simulate the following data set. Create 10 columns, each 20 points...
Use Random number generator (under Data Analysis) to simulate the following data set. Create 10 columns, each 20 points long and use the following parameters: Number of variables (10), number of data point (20), Distribution (Normal), Mean (40), Standard Deviation (10), Random seed (1234). The data should be in columns: A,B,C,….,I,J. Randomly pick two columns (say Column B and Column H) and perform 2-sided t-test on these two data columns. Record the P-value and repeat this procedure several times (at...
Use Random number generator (under Data Analysis) to create THREE columns of data each 20 points long. -For the FIRST data set use the following parameters: number of variables (2), number of data point (20), Distribution (Normal), Mean (20), Standard Deviation (5), Random seed (1234). Keep these data in columns labeled A and B. -For the SECOND data set use the following parameters: number of variables (1), number of data point (20), Distribution (Normal), Mean (30), Standard Deviation (5), Random...
Use Random number generator (under Data Analysis) to create two sets of data each 20 points long. Use the following parameters: number of variables (2), number of data point (20), Distribution (Normal), Mean (20), Standard Deviation (5), Random seed (4565). Perform t-test on these two data sets. And answer the questions for Assessment. (Pick the closest answer) 1. What is the range P-value? a. Between 0% and 1% b. Between 1% and 2% c. Between 2% and 3% d. None...
DATA IN COMMENT BELOW In what follows use any of the following tests/procedures: Regression, multiple regression, confidence intervals, one sided T-test or two sided T-test. All the procedures should be done with 5% P-value or 95% confidence interval.Some answers are approximated, choose the most appropriate answer. Open Cars04 data. SETUP: Is it true that average (retail) cost for the cars with 6 cylinders is higher than $35000? Given the data your job is to help answer this question. I. What...
data: https://www.limes.one/Content/DataFiles/Brains.txt In what follows use any of the following tests/procedures: Regression, confidence intervals, one-sided t-test, or two-sided t-test. All the procedures should be done with 5% P-value or 95% confidence interval Open Brain data. SETUP: Common sense dictates that a person’s IQ and Weight should not be related. However, one never knows until one examines the data. Given the data your job is to check if the common sense assumption is reasonable or maybe it is not. 13. What...
Based on the question what is the answer to number one
In what follows use any of the following tests/procedures: Regression, confidence intervals, one-sided t-test, or two-sided t-test. All the procedures should be done with 5% P-value or 95% confidence interval The empirical literature suggests that class absenteeism undermines students' academic performance and that an enforced mandatory attendance policy may be beneficial Open Attendance data. Using data on 90 second-year students enrolled in a Statistics course, test students who are...
TRY IT 1.11 You are going to use the random number generator to generate different types of samples from the data. This table displays six sets of quiz scores (each quiz counts 10 points for an elementary statistics class #1 #2 #3 34 #5 #6 5 7 10 9 8 10 5 9 8 7 6 9 10 8 6 7 9 9 10 10 9 8 9 7 8 9 5 7 4 9 9 9 10 8 7...
8. A random sample of 25 college males was obtained and each was
asked to report their actual height and what they wished as their
ideal height. A 95% confidence interval for μd= average difference
between their ideal and actual heights was 0.8" to 2.2". Based on
this interval, which one of the null hypotheses below (versus a
two-sided alternative)can be rejected?
A. H0: μd= 0.5
B. H0: μd= 1.0
C. H0: μd= 1.5
D. H0: μd= 2.0
9. The...
In what follows use any of the following tests/procedures: Regression, multiple regression, confidence intervals, one sided T-test or two sided T-test. All the procedures should be done with 5% P-value or 95% confidence interval.Some answers are approximated, choose the most appropriate answer. Open Brains data. SETUP: Are the IQ’s for male and female subjects different? Given your job is to confirm or disprove this assertion. I. What test/procedure did you perform? (6.66 points) a. One sided T-test b. Two sided...
In what follows use any of the following tests/procedures: Regression, multiple regression, confidence intervals, one sided T-test or two sided T-test. All the procedures should be done with 5% P-value or 95% confidence interval.Some answers are approximated, choose the most appropriate answer.Open Brains data. SETUP: Some people believe that the first born children are more intelligent. Given the data your job is to confirm or disprove this assertion. I. What test/procedure did you perform? (6.66 points) a. One sided T-test...