Analysis and Experimental Design

Question 1
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is commonly referred to as ‘data fishing’?
Data bagging
Data booting
Data merging
None of the mentioned
Explanation:
Data dredging is sometimes referred to as “data fishing”.
Question 2
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is the top most important thing in data science?
answer
question
data
none of the mentioned
Explanation:
The second most important is the data.
Question 3
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is a good way of performing experiments in data science?
Measure variability
Generalize to the problem
Have Replication
All of the mentioned
Explanation:
Experiments on causal relationships investigate the effect of one or more variables on one or more outcome variables.
Question 4
Marks : +2 | -2
Pass Ratio : 100%
Which of the following approach should be used if you can’t fix the variable?
randomize it
non stratify it
generalize it
none of the mentioned
Explanation:
If you can’t fix the variable, stratify it.
Question 5
Marks : +2 | -2
Pass Ratio : 100%
Which of the following design term is perfectly applicable to the below figure?
Correlation
Confounding
Causation
None of the mentioned
Explanation:
Confounding can be dealt with either at the study design stage, or at the analysis stage.
Question 6
Marks : +2 | -2
Pass Ratio : 100%
Which of the following figure correctly shows approximate order of difficulty?
All of the mentioned
Explanation:
Predictive analysis is the practice of extracting information from existing data sets.
Question 7
Marks : +2 | -2
Pass Ratio : 100%
Which of the following data mining technique is used to uncover patterns in data?
Data bagging
Data booting
Data merging
Data Dredging
Explanation:
Data dredging, also called as data snooping, refers to the practice of misusing data mining techniques to show misleading scientific ‘research’.
Question 8
Marks : +2 | -2
Pass Ratio : 100%
If X predicts Y, it does mean X causes Y.
True
False
Explanation:
If X predicts Y, it does not mean X causes Y.
Question 9
Marks : +2 | -2
Pass Ratio : 100%
Point out the correct statement.
If equations are known but the parameters are not, they may be inferred with data analysis
If equations are not known but the parameters are, they may be inferred with data analysis
If equations and parameter are not, they may be inferred with data analysis
None of the mentioned
Explanation:
Usually the random component of data is measurement error.
Question 10
Marks : +2 | -2
Pass Ratio : 100%
Point out the wrong statement.
Randomized studies are not used to identify causation
Complication approached exist for inferring causation
Causal relationships may not apply to every individual
All of the mentioned
Explanation:
Randomized studies are usually used to identify causation.