Home
Courses
About
School
Notes
Questions
MCQ
Contact
Vocational Courses
Data Science
Data Science
caret
Question 1
Marks :
+2
|
-2
Pass Ratio :
100%
Which of the following can also be used to find new variables that are linear combinations of the original set with independent components?
ICA
SCA
PCA
None of the mentioned
Explanation:
ICA stands for independent component analysis.
Question 2
Marks :
+2
|
-2
Pass Ratio :
100%
Which of the following function is a wrapper for different lattice plots to visualize the data?
levelplot
featurePlot
plotsample
none of the mentioned
Explanation:
featurePlot is used for data visualization in caret.
Question 3
Marks :
+2
|
-2
Pass Ratio :
100%
The preProcess class can be used for many operations on predictors.
True
False
Explanation:
Operations include centering and scaling.
Question 4
Marks :
+2
|
-2
Pass Ratio :
100%
Which of the following function can be used to flag predictors for removal?
searchCorrelation
findCausation
findCorrelation
none of the mentioned
Explanation:
Some models thrive on correlated predictors.
Question 5
Marks :
+2
|
-2
Pass Ratio :
100%
Which of the following function can create the indices for time series type of splitting?
newTimeSlices
createTimeSlices
binTimeSlices
none of the mentioned
Explanation:
Rolling forecasting origin techniques are associated with time series type of splitting.
Question 6
Marks :
+2
|
-2
Pass Ratio :
100%
The function preProcess estimates the required parameters for each operation.
True
False
Explanation:
predict.preProcess is used to apply them to specific data sets.
Question 7
Marks :
+2
|
-2
Pass Ratio :
100%
caret does not use the proxy package.
True
False
Explanation:
caret uses the proxy package.
Question 8
Marks :
+2
|
-2
Pass Ratio :
100%
Point out the wrong statement.
Simple random sampling of time series is probably the best way to resample times series data.
Three parameters are used for time series splitting
Horizon parameter is the number of consecutive values in test set sample
All of the mentioned
Explanation:
Simple random sampling of time series is probably not the best way to resample times series data.
Question 9
Marks :
+2
|
-2
Pass Ratio :
100%
The advantage of using a model-based approach is that is more closely tied to the model performance.
True
False
Explanation:
Model-based approach is able to incorporate the correlation structure between the predictors into the importance calculation.
Question 10
Marks :
+2
|
-2
Pass Ratio :
100%
For most classification models, each predictor will have a separate variable importance for each class.
True
False
Explanation:
The exceptions are classification trees, bagged trees and boosted trees.
1
2
3
4
5
6
7
8
9
10
Time Left:
Submit
Score
:
Rank
:
Accuracy
:
%
Time
:
Total Question
:
Attempted
:
Correct
:
Wrong
:
User Name
Email Id
Mobile No
Submit