
1. Which variables are ordinal? Which are nominal?
Nominal variables: and
Ordinal Variables: (although it is coded as continuous)
Answer Options:
1a. Nominal variables selections: Calories, Type, Protein or
Potassium, MFG
1b. Ordinal Variables selections: Calories, Shelf, Protein
2. Use Cols>Columns Viewer to obtain summary statistics.
Which, if any, of the variables is missing values?
The variables , , and have missing values. The data set has missing
values in total.
Answer Options:
2a. Calories, Carbo, Protein & Fat
2b. Fiber, Fat, Sodium & sugar
2c. Potassium, Vitamins, weight Shelf
2d. 3,4,5
3. Use Analyze > Distribution to plot a histogram for each of the continuous variables and create summary statistics. Based on the histograms and summary statistics, choose the correct answers:
and have the largest standard deviations, and so are the most variable.
The variables , , , and seem (right) skewed.
Answer Options:
3a. fat, sodium, calories
3b. fibe, rating potassium
3c. shelf, fat, cups
3d. names, cups, fiber
3e. potassium, carbs, sugars
3f. protein, calories, rating
4. Use the Graph Builder to plot a side-by-side box-plot comparing the calories in hot vs. cold cereals. What does this plot show us?
We see that in cold cereals, the different cereals vary in the amount of calories mainly between approximately , whereas all 3 of the hot cereals have 100 calories.
Answer Options:
100-140
90-120
50-100
5. Use the Graph Builder to plot a side-by-side box-plot of consumer rating as a function of the shelf height (the variable shelf). If we were to predict consumer rating from shelf height, does it appear that we need to keep all three categories of shelf height?
The following conclusion is :
Since the distribution of ratings seems to differ at each of the three shelf heights, it appears that we need to keep all three categories.
Answer Options:
True
False
Nominal variables are those which do not have numerical value or cannot be ranked whereas ordinal values can be ranked.
1> Based on this, the type is nominal and calorie, shelf, potassium, protein amount can be measured and thus they are ordinal. In this way, we can classify variables.
1. Which variables are ordinal? Which are nominal? Nominal variables: and Ordinal Variables: (although it is...
11. Quantitative variables are also referred to as continuous or interval variables. 12. Categorical variables consist of separate, indivisible categories. 13. Categorical variables may also be refered to as nominal, ordinal, discrete, or qualitativ 14. A dichotomous variable is one that has only two possible levels or categories 15. Age is a quantitative variable, but one could recode the values so that it would be transformed into a dichotomous variable 16. When conducting a multivariate analysis, the best recommendation is...
I can't attach the data due to the file being real large i can email it to you so i can have your help on it # Assignment 1 # R Programming Language # ---- Why do Exploratory Data Analysis (EDA)? ---- # We will be looking at ## identifying outliers ## null values ## generating plots ## examining correlations # -------------------------------------------------------------- # In this video we will cover: ## univariate plots for continuous variables (boxlots, historgrams) ## bivariate plots...
Question 1. Choose one of the three options described below. Analyze the data using what you have learned in class. Instructions: Your paper will include the following: A statement of the research question A description of the source of the data A description of the variables used in the analysis and an explanation of why you chose these variables Graphs (with clear labels) and numerical summaries to support your analysis Explanations that reflect the use of Unit 2 concepts An...
1. State whether the following variables are nominal, ordinal, interval or ratio. (7 pts.) Variable Type (nominal, ordinal, interval or ratio) Gender Race Patient Satisfaction Scores Temperature Weight Height Blood pressure 3. Define and provide examples of the following terms. Use 3-5 sentences for each term. (30 points) a. Health Informatics b. Data Quality Management c. Interoperability d. Data Lake e. Data f. Information g. Standardized h. Unstandardized i. Data Standard j. Health Information Exchange (HIE) k. Relational Database l....
What does the five number tell us about the time spent on email
(Hint, interpret the five number summary in plain English) and what
does the Boxplot and the normality test show? Explain.
Use the 1.5xIQR rule to identify possible outliers. List the
cutoff points for outliers, Show your workings. Explain what you
found out. (Hint: Are there any excessive time spent on email for
Male(1) or Female(2) or both).
GET DATA /TYPE-XLS /FILE='C: \Users\rmanda 1 \ Desktop\homework! . xls'...
For each of the following variables, identify the type of
variable (categorical vs. numeric).
(I) Number of auto insurance claims in a month
(II) Film genre (e.g. comedy, horror, drama, etc)
Question 1 options:
1)
(I) Categorical , and (II) Categorical
2)
There is no correct match.
3)
(I) Categorical , and (II) Numeric
4)
(I) Numeric , and (II) Numeric
5)
(I) Numeric , and (II) Categorical
Question 2 (1 point)
Saved
For each of the following variables, identify...
Managers at the Turquoise Oasis Spa are interested in knowing whether there is a relationship between the age of their clients and the amount of money they spend in the spa. Upon analyzing the data, they found a correlation coefficient of .672. Thus, the relationship between the two variables is _________. strong moderate weak very weak Calculating the correlation coefficient using the_________ function is useful to determine the type and strength of a relationship between two variables. CAUSATION COVARIANCES CORREL...
can i get help answering these questions please i cant seem to
undestand how to solve
Find the equation of the regression line for the given data. The construct a scatter plot of the data and draw the regression in Each of the has a signifantomation) Then use the regression in to pred value ofy for each of the given to meaning. The color content and the sodium content in migrans) forte he dogs are shown in the late to...
identify the 25 correct statements in the set below
identify 25 correct statements
Please identify the 25 correct statements in the set below: Discovery analytics focuses on the question "Why did it happen?" Predicting a presidential candidate's percentage of the statewide vote from a sample of 800 voters would be an example of inferential statistics. This year, Oxnard University produced two football All-Americans. This is an example of continuous data. Sturges' Rule is merely a suggestion, not an ironclad requirement....
Question 1 ASW, a regional shoe chain, has recently launched an online store. Sales via the Internet have been sluggish compared to their brick and mortar stores, and management suspects that its regular customers have concerns regarding the security of online transactions. To determine if this is the case, they plan to survey a random sample of their regular customers. Under consideration are several plans for selecting the sample. Name the sampling strategy for each. Plan A - Regular customers...