Basics of Data Science

Question 1
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is performed by Data Scientist?
Define the question
Create reproducible code
Challenge results
All of the mentioned
Explanation:
A data scientist is a job title for an employee or business intelligence (BI) consultant who excels at analyzing data, particularly large amounts of data.
Question 2
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is characteristic of Processed Data?
Data is not ready for analysis
All steps should be noted
Hard to use for data analysis
None of the mentioned
Explanation:
Processing includes merging, summarizing and subsetting data.
Question 3
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is the most important language for Data Science?
Java
Ruby
R
None of the mentioned
Explanation:
R is free software for statistical computing and analysis.
Question 4
Marks : +2 | -2
Pass Ratio : 100%
Raw data should be processed only one time.
True
False
Explanation:
Raw data may only need to be processed once.
Question 5
Marks : +2 | -2
Pass Ratio : 100%
Point out the correct statement.
Raw data is original source of data
Preprocessed data is original source of data
Raw data is the data obtained after processing steps
None of the mentioned
Explanation:
Accounting programs are prototypical examples of data processing applications.
Question 6
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is one of the key data science skills?
Statistics
Machine Learning
Data Visualization
All of the mentioned
Explanation:
Data visualization is the presentation of data in a pictorial or graphical format.
Question 7
Marks : +2 | -2
Pass Ratio : 100%
Which of the following would be more appropriate to be replaced with question mark in the following figure?
Data Analysis
Data Science
Descriptive Analytics
None of the mentioned
Explanation:
Data Science is a multidisciplinary which involves extraction of knowledge from large volumes of data that are structured or unstructured.
Question 8
Marks : +2 | -2
Pass Ratio : 100%
Point out the wrong statement.
Merging concerns combining datasets on the same observations to produce a result with more variables
Data visualization is the organization of information according to preset specifications
Subsetting can be used to select and exclude variables and observations
All of the mentioned
Explanation:
Data formatting is the organization of information according to preset specifications.
Question 9
Marks : +2 | -2
Pass Ratio : 100%
Which of the following is a key characteristic of a hacker?
Afraid to say they don’t know the answer
Willing to find answers on their own
Not Willing to find answers on their own
All of the mentioned
Explanation:
Hacker is an expert at programming and solving problems with a computer.
Question 10
Marks : +2 | -2
Pass Ratio : 100%
Which of the following approach should be used to ask Data Analysis question?
Find only one solution for particular problem
Find out the question which is to be answered
Find out answer from dataset without asking question
None of the mentioned
Explanation:
Data analysis has multiple facets and approaches.