A survey is carried out to study the number of hours, X, per day spent on using the Internet by the customers of an Internet Service Provider (ISP). Responses from 15 randomly selected customers give the following data:
3 2 5 6 1 5 3 4 5 2 4 5 9 5 1
(a) Determine the mean and variance of the data set.
(b) Is there an outlier in the data set? Justify your answer.
(c) Suppose a selected customer is found to spend at most 5 hours on using the Internet. What is the
probability that this customer spends more than 3 hours on using the Internet?
(d) Suppose the customer service manager of the ISP discovers that one data point has been mistakenly recorded to 4, and that data point should be less than 4. State whether the mode, median and mean of the correct sample become smaller than, greater than or the same as those of the incorrect sample.
Explain your answer.
Consider a random variable X representing a number of hours per day spend on using the Internet by the customers. The 15 randomly selected customers give the following values:
X : 3, 2, 5, 6, 1, 5, 3, 4, 5, 2, 4, 5, 9, 5 and 1.
(a):
The mean of a set of observations is the sum of the observations
divided by the total number of observations. It is denoted by x̅.
Mathematically, it is defined as,
where ∑x_{i }denotes the sum of all data points and n denotes the total number of observations.
As there are 15 records thus the value of n is 15.
Add all the observations and divide it by the total number of observations.
Substitute the respective values to obtain the value of x̅.
Thus, the mean of the given data set is 4.
The sample standard deviation indicates how far are the data points from the mean value in the sample and is denoted by s. The square value of s i.e. s^{2} is the variance of the data set.
Mathematically, it is defined as,
, where ∑x_{i}^{2}
are the sum of the square of data points, x̅ is the mean, and 'n' is the total number of observations.
Substitute the respective values to obtain the value of s^{2}.
Thus, the value of variance, s^{2} is 4.429.
(b):
Box plot also known as box-and-whisker diagrams represents the distribution of data based on the five measures like minima, maxima, first quartile, median and third quartile of the data.
It contains a box whose bottom side represents the first quartile and the top side represents the third quartile.
The desired box plot is shown below of the given observations.
Minitab Technology: Graph-> Boxplot-> Select desired variable under graph variables-> OK.
The obtained Box Plot is:
The above box plot suggests that there is no outlier in the given data (Hours).
The distribution of hours is skewed as it has long whisker on the right side.
The five measure summary of the box plot listed below:
Minimum | 1 |
Q_{1} | 2 |
Median | 4 |
Q_{3} | 5 |
Maximum | 9 |
(c):
As the continuous random variable X denotes the rate i.e. hours per day thus it follows the exponential distribution with parameter λ per hour in a day.
Since the mean value of X is 4 hours thus the value of λ will be 1/4 per hour. Then, the pdf of X will be,
Thus, the probability that a customer spends more than 3 hours given that he spends at most 5 hours on the using the internet i.e..
Obtian the value of and respectively.
and
Divide 0.185 and 0.713 to obtain the desired probability.
That is,
Thus, the value of desired probability is 0.259.
(d):
Mode is the most frequent value of the data, thus the value of mode is 5 as it occurs most times as compared to others.
The value of both, mean and median is 4.
If the record 4 of the data was incorrect and it was actually less than 4 then the value of mean and median will affect as both depend on incorrect record i.e. 4.
Since the actual record is less than 4 thus the value of mean and median will be less than 4 (or less than the mean and median based on the incorrect sample).
As the mode of the given data is 5 which is not based on the incorrect record thus the value of the model will remain unchanged.
A survey is carried out to study the number of hours, X, per day spent on using the Internet by the customers of an Internet Service Provider (ISP). Responses from 15 randomly selected customers give the following data:
survey is carried out to study the number of hours, X, per day spent on using the Internet by the customers of an Internet Service Provider (ISP). Responses from 15 randomly selected customers give the following data: 3 2 5 6 1 5 3 4 6 2 4 5 6 5 1 (a) Find out the five‐number summary and sketch a boxplot. Describe the distribution shape. (b) Suppose a selected customer is found to spend at most 5 hours...
An internet service provider states that the average number of hours its customers are online each day is 4.01. Suppose a random sample of 14 of the companys customers is selected and the average number of hours that they are online each day is measured. The sample results are shown in the accompanying table. _____________________ 3.85. 3.02. 6.16. 2.59. 3.71. 4.26. 8.07 3.63. 6.42. 1.37. 6.38. 7.89. 7.31. 1.19 _____________________ Based on the sampling of 14 customers, how much sampling...
An Internet service provider states that the average number of hours its customers are online each day is 4.08. Suppose a random sample of 14 of the company's customers is selected and the average number of hours that they are online each day is measured. The sample results are shown in the accompanying table. 5.76 6.08 4.26 5.85 6.82 1.77 7.66 5.11 3.89 2.67 6.89 6.68 3.58 1.12 Based on the sample of 14 customers, how much sampling error exists?...
iPad 11:04 AM 59% (1 point) An internet service provider advertises that customers on its "high speed" plan will experience average download speeds of 20 Mbps. A dissatisfied customer believes his download speeds are significantly less than advertised. To test this claim, he checks his download speed at 49 randomly selected times over the course of one month. Assume this can be treated as a Simple Random Sample. Grades 1. Let u be the true average download speed (in Mbps)...
gnt wll take 20 or less minutes to get 6. (10 marks) AbeNet, an Internet service provider 0SP), has esperieeed rapid guowth in and past five years. As a part of its marketing strategy, AbcNet promists dependable service. To achieve its obiectives. AbcNet constantl its servers. On fast connections and y evaluates the capacity of e component of its evaluation is an analysis of the average amount of time connected and using the Internet daily. A random sample of 12...
An Internet service provider has three different subscription packages for its customers: Package A: For $15 per month with 50 hours of access provided. Additional hours are $2.00 per hour over 50 hours. Assume usage is recorded in one-hour increments. Package B: For $20 per month with 100 hours of access provided. Additional hours are $1.50 per hour over 100 hours. Package C: For $25 per month with 150 hours access is provided. Additional hours are $1.00 per hour...
QUESTION 1: Nine randomly selected customers at a local fast-food restaurant ordered meals having the following calorie counts: 975, 520, 1560, 872, 1028, 533, 362, 954, and 1016. Let y denote the calorie content of a meal ordered at this restaurant. Find the following sum. ∑y2= Enter your answer in accordance to the question statement. QUESTION 2: The following data give the time (in minutes) that each of 20 students waited in line at their bookstore to pay for their...
Need help asap!!! You are given the following dataset about the number of hours spent at the library on one day. 1.05 1.051.08 1.10 1.11 1.13 1.18 1.20 1.20 1.24 1.28 1.34 1.45 1.50 1.52 1.90 1.99 2.55 1.09 1.09 1.14 1.16 1.23 1.24 1.38 1.39 1.53 1.86 3.03 4.19 1. Using your dataset, make a frequency distribution table that displays intervals. Fill in the table below. Start your interval at 1.00. Use 2 decimals to help make your intervals,...
1. A baseball team claims that the mean length of its games is 2.8 hours. State H0 and Ha in words and in symbols. Then determine whether the hypothesis test for this claim is left-tailed, right-tailed, or two-tailed. Explain your reasoning. State the null hypothesis in words and in symbols. Choose the correct answer below. 2. An Internet provider is trying to gain advertising deals and claims that the mean time a customer spends online per day is greater than...
3. Waiting times (in minutes) of customers in a bank where all customers enter a single waiting line and a bank where customers wait in individual lines at three different teller windows are listed below. Find the mean and median for each of the two samples, then compare the two sets of results. Single Line 6.36.3 6.66.6 6.76.7 6.86.8 6.96.9 7.17.1 7.67.6 7.87.8 7.87.8 7.87.8 Individual Lines 4.14.1 5.55.5 6.06.0 6.16.1 6.26.2 7.87.8 7.87.8 8.78.7 9.29.2 10.010.0 The mean waiting...
> How about using normal distribution in (c)?
john7450333 Sun, Oct 10, 2021 11:14 PM