data science mcq multiple choice question objective type questions set

data science mcq multiple choice question objective type questions set

Kanchan

Unit 1
1)KDD also called as…….?
1) Data visualization
2) Data science
3) Data mining
4) Database
Ans- 3
2) Below which technique is very important in classification?
1) Big data
2) Machine learning
3) All of the above
4) None of the above
Ans-4 ( regression)
3) Is linear regression is a supervised machine learning algorithm?
1) True
2) False
Ans- true
4)what is data visualization?
1) It is numerical representations of information and data.
2) It is character representations of information and data
3) It is graphical representation of information and data.
All of the above
Ans- 3
5)What are the common types of data visualization?
1) Charts
2) Tabels
3) Infographics
4) All
Ans- 4
6) Below which are the software engineering trends and technique?
I)Big data
II)LCNC
III)Veracity
IV)OLAP(online anlaytic processing)
V)Power and energy
VI)IOT
1) I,II,IV
2) I,II,VI
3) III,IV,VI
4) III,V,I
Unit 2
1)What type of data database stored?
1) real time information.
2) historical data.
3) both 1 and 2
4) none
Ans- 1
2) About what similation AI is?
1) ANN
2) AI
3) Both
4) None
Ans-2
3) ANN is inspired by…?
1) Human intelligence
2) Human neurons
3) Human language
4) None
4) Following which is not a hypothesis process?
1) Make assumptions
2) Set acceptance criteria
3) Evaluate results
4) Future investigation
Ans-4
5) who takes longer time to search but it has predictable memory size?
1) Structured data
2) Unstructured data
3) Scalable data
4) Non scalable data
Ans-4
Unit 3
1)why do research?
1) To solve unsolved and challenging problems.
2) To analize the process
3) To identify the cause and effect relationship
4) All
Ans-1
2)which process is the important turning point in data science?
1) Data visualization
2) Data analysis
3) Big data
4) Data mining
Ans-2
3) which of following is not the factor of Big data?
1) Volume
2) Velocity
3) Variety
4) Vacancy
Ans-4
4)The process of taking a single problem and breaking it is called?
1) Big data
2) Data analysis
3) Parallel computing
4) All
Ans- 3
5) Following which is not a type of artificial intelligence?
1) Reactive machines
2) Limited memory
3) Theory of mind
4) Increase the revenue
Ans-4
Unit 4
1)what is the motive of data science?
1) Turn data into actionable value
2) High performance
3) Storage of data
4) All
Ans-1
2) Following which is not a application of data science?
1) Health care
2) Keras
3) Internet search
4) All
Ans-1
3) Following which is a example about of data science?
1) Army and weapons
2) Volume
3) Data analysis
4) Data mining
Ans-1
4)SAS stand for
1) Statistical analysis system
2) Science about software
3) Both
4) None
Ans-1
5)when the term data science was used?
1) 2005
2) 2006
3) 2001
4) 2002
Ans-3
Unit 5
1) Following which is the aspects of experimentation?
1) Planning
2) Factors
3) Result
4) All
Ans-4
2)Anaconda also includes a package manager called?
1) Ana
2) Conda
3) Keras
4) None above
Ans-2
3)what is pycharm?
1) Popular IDE for python
2) Standarising on keras
3) Extensive collection
4) All
Ans-1
4) what is an area of statistic.
1) Segmentation
2) Clustering
3) Predictive analytics
4) All
Ans-3
5)what is the study of computational systems?
1) EDA
2) Informatics
3) Mathematics
4) Segmentation
Ans-2
6) Is important step of EDA is data cleansing?
1) True
2) False
Ans-1
Unit 6
1)what is Hadoop?
1) open source framework
2) programming language
3) Both
4) None
Ans-1
2) Following which is not a responsibility of data scientist?
1) Management
2) Analytics
3) Acquiring
4) Collaboration
Ans-3
3) what is not acquisition process point?
1) Production is undertaken
2) A need for a data must be identified
3) Required data is carried out
4) Solitary role
Ans-4
4) which are broad categories of statistics?
1) Describe
2) Information
3) Inferential
4) None
Ans-3
5)Data science pipeline also called
1) Data science pipe
2) Data science array
3) Data science life cycle
4) Data science algorithm
Ans-3

Maya

2) Analytics
3) Acquiring
4) Collaboration
Ans-3
3) what is not acquisition process point?
1) Production is undertaken
2) A need for a data must be identified
3) Required data is carried out
4) Solitary role
Ans-4
4) which are broad categories of statistics?
1) Describe
2) Information
3) Inferential
4) None
Ans-3
5)Data science pipeline also called
1) Data science pipe
2) Data science array
3) Data science life cycle
4) Data science algorithm
Ans-3
Q6. Regression analysis is a form of________ modeling technique. 
A. Machine learning
B. AI
C. Predictive
D. None of the above
Ans. C. 
Q7. Stochastic gradient decent and k-nearest neighbour is classification techniques. 
A. True
B. False
Ans. A. 
Q8. Row data also known as
A. Secondary
B. Permanent
C. Temporary
D. Eggy
Ans. D. 
Q9. A data warehouse is which of the following? 
A. Can be update by end user. 
B. Contains numerous naming conventions and format. 
C. Organised around important subject areas. 
D. Contains only current data. 
 Ans. C. 
 Q10. OLTP stands for
A. Online transactional processing. 
B. Online transactional program.
C. Online transport processing. 
D. None of the above. 
Ans. A.
Q11. A data warehouse stores________ data about business. 
A. Future
B. Historical
C. Both A and B
D. None of the above
Ans. B. 
Q12. Learning, reasoning and self correction are the skills of AI programming. 
A. True
B. False
Ans. A. 
Q13. _________ technique analysis complete data or a sample of summarised numerical data. 
A. Descriptive
B. Diagnostic
C. Statistical
D. Predictive
 Ans. A. 
Q14. __________ have a numerical value that perform more than one task simultaneously. 
A. AI
B. ANN
C. Both A and B
D. None of the above
Ans. B. 
Q15. _________ is one of the key data science skills.
A. Machine Learning
B.Statistics
C.Data Visualization
D.All of the above
Ans. D. 
Q16. Which among the following is the top most important thing in data science?
A. Answer
B.Question
C.Data
D.None of the above
Ans. B. 
Q17. CLI stands for ________.
A. Command Line Intercom
B.Command Line Interface
C.Command Language Interface
D.None of the above
Ans. B . 
Q18. The applications of Data Science are __________.
A. Healthcare
B.Fraud and Risk Detection
C.Airline Route Planning
D.All of the above
Ans. D. 
Q19. The data science applications in healthcare are _______.
A. Data Science for Medical Imaging
B.Data Science for Genomics
C.Drug Discovery with Data Science
D.All of the above
Ans. D.
Q20.The main classes in predictive modeling are ________.
A. Parametric Predictive Modeling
B.Non-Parametric Predictive Modeling
C.Both A and B
D.None of the above
Ans. C. 
Q21. The kernels used in SVM are ___________.
A. Gaussian Kernel
B.Polynomial Kernel
C.Sigmoid Kernel
D.All of the above
Ans. D. 
Q22. Data that summarize all observations in a category are called __________ .
A. summarized data
B.frequency data
C.Both A and B
D.None of the above
Ans. A. 
Q23. Which of the following is an essential process in which the intelligent methods are applied to 
extract data patterns?
A. Warehousing
A. Data Mining
B. Text Mining
C. Data Selection
Ans: B. 
Q24. _________ is a high level API built on TensorFlow.
A. Flask
B. Keras
C. Pycharm
D. Theano
Ans. B. 
Q25. Predictive analytics uses statistics and _______ to determine future performance.
A. Algorithmic techniques
B. Modeling techniques
C. System development and design techniques
D. None of the mentioned above
Ans. B
Q26. Amongst which of the following is / are the types of predictive analytics techniques,
A. Predictive models
B. Descriptive models
C. Decision models
D. All of the mentioned above
Ans. D. 
Q27. In descriptive statistics, data from the entire population or a sample is summarized with ?
A. Decimal descriptive
B. Floating descriptive
C. Integer descriptive
D. Numerical descriptive
Ans. D. 
Q28. What was Haddop written in? 
A. C
B. C++
C. Java
D. JCP
 Ans. C. 
Q29. OLAP stands for
A. Online analytical processing
B. Online analysis processing
C. Online analyst programming
D. None of the above.
 Ans. A. 
Q30. Regression analysis ______ .
A. Establishes a relationship between two variables.
B. Establishes cause and effect.
C. Measures growth.
D. Measures the demand for a good.
Answer: A. 
Q31. __________ data that depends on data model and resides in a fixed field within record.
A : Structured data
B : Un-Structured data
C : Semi-Structured data
D : Scattere
Ans. A.

Sharada

Sharda ramchandra adkine
Class:B.Sc(C.S)third year
UNIT 1
Q.1.which of the following information is correct regarding data science?
1.data science provides meaningful information based on large amount of complex data or big data
2.data science is related to data mining, machine learning and big data
3.both 1&2
4.none
Ans: 3.both 1&2
Q.2 which one of the following is the most important language for data science?
1.ruby
2.java
3.R
4.none of these
Ans: 3.R
Q.3.The goal of data mining includes which of the following?
1.To explain some observed event or condition
2.To confirm that data exists
3.To analyze data for expected relationships
4.To create a new data warehouse
Ans: 1.To explain some observed event or condition
Q.4.Time and space are the complexity of an algorithm
1.True
2.False
Ans:1.True
Q.5.which of the following technique is used to differentiate the collected data
1.Regression
2.clustoring
3.classification
4.data mining
Ans: 3.classification
Q.6.which of the following focuccess on discovery of unknown property of data
1.data collecting
2.classification
3.big data
4.data mining
Ans: 4.data mining
Q.7.PWA stands for
1.process web app
2.progress web application
3.Process windows application
4.progressive web app
Ans: 4.progressive web app

UNIT 2
Q.1. A data warehouses is which of the following?
1.can be updated by end user
2.contains numerous naming conventions and formats
3.organized around important subject areas
4.contains only current data
Ans: organized around important subject areas
Q.2.A database store realtime information about one particular part of business?
1.True
2.False
Ans: True
Q.3.who is the father of AI?
1.fisher Ada
2.Alan Turing
3.John Maccarthy
4.Allen Newell
Ans: John Maccarthy
Q.4.Artificial intelligence is about
1.playing a game on computer
2.making a machine intelligent
3.programming on machine with your own intelligence
4.putting your intelligence in machine
Ans: making a machine intelligent
Q.5.What is the advantages of ANN?
1.Parallel processing capability
2.Storing data on entire network
3.Capability to work with incomplete knowledge
4.All of the above
Ans: All of the above
Q.6.The structure is that can grow more but also searching an element into that is very quick is
1.Unstructured data
2.Non scalable data
3.scalable data
4.structured data
Ans: scalable data
Q.7.Non scalable data take longer time to search but it has predictable memory size
1.False
2.True
Ans: True
Q.8.What is the start or initial point of the further investigation
1.Data mining
2.Classification
3.AI
4.hypothesis technique
Ans: hypothesis technique
Q.9.what is true about hypothesis?
1.It is process which is imperative process to simplify & deconstruct problem
2.Its provides idea made from limited evidence
3.It is start or initial point of the further investigation
4.All of the above
Ans: All of the above

UNIT 3
Q.1. Research means 
1.Research is careful investigation or inquiry specifically through a search for new facts
2.It is original contribution of existing stop of knowledge making for advanced main
3.It is the task of searching for available data modify a certain result
4.All of the above
Ans: All of the above
Q.2.What is the main role of research in education
1.To upsurge ones social status
2.To increase ones job prospects
3.To augment ones personal growth
4.To Help an applicant in becoming a renowned educationalist
Ans: To help an applicant in becoming a renowned educationalist
Q.3.Data analysis defined as it is process of
1. Data mining
2.Data collecting
3.Data analysis
4.Data cleaning
Ans: Data analysis
Q.4.Which analysis combine the insight and previous analysis and to take action regarding a current 
prompt?
1.Text analysis
2.predection analysis
3.preceptive analysis
4.Descriptive analysis
Ans: preceptive analysis
Q.5.Data analysis is the process of 
1.Inspecting data
2.cleaning data
3.transforming data
4.All of the above
Ans: All of the above
Q.6.Which of the following is not a major data analysis approach?
1.Data mining
2.Predictive analysis
3.buisness intelligence
4.Text analysis
Q.7.Which of the following characteristics of big data is relatively more concerned to data science?
1.velocity
2.value
3.variety
4.volume
Ans: variety
Q.8.Paradigm is the way to classify programming languages based on their features
1.False
2.True
Ans: True
Q.9.Which of the two statistics in data analysis
1.Text analysis
2.Descriptive
3.Inferential
4.both 2&3
Ans: bath 2 & 3
Q.10.What is true about machine learning?
1.machine learning is that field of computer science
2.ML is a type of artificial intelligence that extract patterns out of raw data by using an algorithm or 
method
3.The main focus of ML is to allow computer systems learn from experience without being explicitly 
programmed or human intervention
4.All of the above
Ans: All of the above

UNIT 4
Q.1. SAS stands for 
1.system analysis specification
2.system analysis system
3.statistical analysis system
4.statistical analysis specification
Ans: statistical analysis specification
Q.2.Which of the following application are used in data science?
1.Health care
2.Fraud &risk detection
3.Target advertisement
4.All of the above
Ans: All of the above
Q.3.How many data processes google everyday
1.1000 Mb
2.20 petabyte
3.50 GiGabyte
4.1.5 GB
Ans: 20 petabyte
Q.4. The best example of speech recognisation is 
1.voice
2.seri
3.cortana
4.All of the above
Ans: All of the above
Q.5.Unstructured data called organized data
1.False
2.True
Ans:False

UNIT-5
Q.1.which of the following technique which produce multiple results?
1.classification
2.Transformation
3.Regression
4.Experimentation
Ans: Experimentation
Q.2.How many types of project deployment tools?
1.3
2.6
3.7
4.2
Ans: 6
Q.3.IDE stands for
1.integrated deployment environment
2.international development environment
3.integrated development environment
4 interested development environment
Ans: integrated development environment
Q.4.which of the following factors affects the experiment results
1.seasonability
2.newness effects
3.segments
4.All of the above
Ans: All of the above
Q.5.To explore deep learning keras is great place to start
1.True
2.False
Ans: True
Q.6 what is the high level API built on tensorflow?
1.pycharam for python IDE
2.Anaconda
3.keras
4.cloud platform
Ans: keras
Q.7.EDA stands for 
1.Exploratory Data Analysis
2.europian Defense Agency
3.Enterprise Digital Assistant
4.Engineering Design Automation
Ans: Exploratory Data Analysis
Q.8.The most important steps in EDA?
1.data mining
2.data collecting
3.data cleaning
4.organise data
Ans: data cleaning
Q.9.Which statement below is true related to EDA?
1.Getting a better understanding of data
2.identifying various data patterns
3.getting a better understanding of the problem statement?
4.All of the above
Ans: All of the above
Q.10.Which process is used to analyze current as well as historical facts to make pediction about future
events?
1.data analysis
2.data
3.predective analysis
4.data visualization
Ans: predective analysis
Q.11.Which model of predective analysis describes the relationship between all the elements of the 
decision? 
1.predective model
2.decision model
3.descriptive model
4.None
Ans: decision model

UNIT-6
Q.1.Which of the following is not a goal of HDFS?
1.Fault detection and recovery
2.Handle huge dataset
3.prevent detection of data 
4.provide high network bandwidth for data movement
Ans: prevent detection of data
Q.2.In which language Hadoop is written?
1.c#
2.Java
3.c++
4.python
Ans: java
Q.3.Who is the founder of hadoop? 
1.Gausling
2.cloudera
3.Doug cutting
4.James cutting
Ans: Doug cutting
Q.4.Which of the following is the most popular high level API in hadoop ecosystem
1.scalding
2.cascading
3.hcatalog
4.cascalog
Ans: cascading
Q.5.All of the following accurately describes hadoop except
1.Java based
2.Distributed computing approach
3.Real time 
4.Open source
Ans:Real time
Q.6.In HDFS the files cannot be
1.Read
2.deleted
3.executed
4.None of these
Ans: deleted
Q.7.HDFS stands for
1.Harley Davidson Financial service
2.Hadoop Distribution File System
3.Hubble Deep Field South
4.Hadooo Distributed File System
Ans: Hadoop Distributed Fle System
Q.8.Big data analysis does the following except
1.collects data
2.organizes data
3.spreads data
4.Analyzes data
Ans: spreads data
Q.9.The new source of big data that will trigger a big data revolution in the years to come is
1.Business transaction
2.social media
3.Transactional data and sensor data
4.RDBMS
Ans: Transactional data and sensor data
Q.10.Which of the following is correct skills for a data scientist?
1.Probability and statistics
2.machine learning
3.data wrangling
4.All of th above
Ans: All of the above

Mayuri

Q1. Which of the following is an essential process in which the intelligent method are applied to 
extract data pattern?
A. Data mining
B. Data science
C. Data storage
D. None of above
Ans. A. 
Q2. What is KDD in data mining?
A. Knowledge data house
B. Knowledge data definition
C. Knowledge discovery database
D. None of the above
Ans. C. 
Q3. Data analysis is the process of
A. Inspecting
B. Cleansing
C. Transforming
D. All of the above
Ans. D. 
Q4. The process of extracting information to identify patterns, trends and useful data which all the 
businesses to make the data driven decision from huge set of data is called data mining. 
A. True
B. False
Ans. A. 
Q5. Which of the following is most important language for data science? 
A. Java
B. R
C. Python
D. All of the above
Ans. B.
Q6. Regression analysis is a form of________ modeling technique. 
A. Machine learning
B. AI
C. Predictive
D. None of the above
Ans. C. 
Q7. Stochastic gradient decent and k-nearest neighbour is classification techniques. 
A. True
B. False
Ans. A. 
Q8. Row data also known as
A. Secondary
B. Permanent
C. Temporary
D. Eggy
Ans. D. 
Q9. A data warehouse is which of the following? 
A. Can be update by end user. 
B. Contains numerous naming conventions and format. 
C. Organised around important subject areas. 
D. Contains only current data. 
 Ans. C. 
 Q10. OLTP stands for
A. Online transactional processing. 
B. Online transactional program.
C. Online transport processing. 
D. None of the above. 
Ans. A.
Q11. A data warehouse stores________ data about business. 
A. Future
B. Historical
C. Both A and B
D. None of the above
Ans. B. 
Q12. Learning, reasoning and self correction are the skills of AI programming. 
A. True
B. False
Ans. A. 
Q13. _________ technique analysis complete data or a sample of summarised numerical data. 
A. Descriptive
B. Diagnostic
C. Statistical
D. Predictive
 Ans. A. 
Q14. __________ have a numerical value that perform more than one task simultaneously. 
A. AI
B. ANN
C. Both A and B
D. None of the above
Ans. B. 
Q15. _________ is one of the key data science skills.
A. Machine Learning
B.Statistics
C.Data Visualization
D.All of the above
Ans. D. 
Q16. Which among the following is the top most important thing in data science?
A. Answer
B.Question
C.Data
D.None of the above
Ans. B. 
Q17. CLI stands for ________.
A. Command Line Intercom
B.Command Line Interface
C.Command Language Interface
D.None of the above
Ans. B . 
Q18. The applications of Data Science are __________.
A. Healthcare
B.Fraud and Risk Detection
C.Airline Route Planning
D.All of the above
Ans. D. 
Q19. The data science applications in healthcare are _______.
A. Data Science for Medical Imaging
B.Data Science for Genomics
C.Drug Discovery with Data Science
D.All of the above
Ans. D.
Q20.The main classes in predictive modeling are ________.
A. Parametric Predictive Modeling
B.Non-Parametric Predictive Modeling
C.Both A and B
D.None of the above
Ans. C. 
Q21. The kernels used in SVM are ___________.
A. Gaussian Kernel
B.Polynomial Kernel
C.Sigmoid Kernel
D.All of the above
Ans. D. 
Q22. Data that summarize all observations in a category are called __________ .
A. summarized data
B.frequency data
C.Both A and B
D.None of the above
Ans. A. 
Q23. Which of the following is an essential process in which the intelligent methods are applied to 
extract data patterns?
A. Warehousing
A. Data Mining
B. Text Mining
C. Data Selection
Ans: B. 
Q24. _________ is a high level API built on TensorFlow.
A. Flask
B. Keras
C. Pycharm
D. Theano
Ans. B. 
Q25. Predictive analytics uses statistics and _______ to determine future performance.
A. Algorithmic techniques
B. Modeling techniques
C. System development and design techniques
D. None of the mentioned above
Ans. B
Q26. Amongst which of the following is / are the types of predictive analytics techniques,
A. Predictive models
B. Descriptive models
C. Decision models
D. All of the mentioned above
Ans. D. 
Q27. In descriptive statistics, data from the entire population or a sample is summarized with ?
A. Decimal descriptive
B. Floating descriptive
C. Integer descriptive
D. Numerical descriptive
Ans. D. 
Q28. What was Haddop written in? 
A. C
B. C++
C. Java
D. JCP
 Ans. C. 
Q29. OLAP stands for
A. Online analytical processing
B. Online analysis processing
C. Online analyst programming
D. None of the above.
 Ans. A. 
Q30. Regression analysis ______ .
A. Establishes a relationship between two variables.
B. Establishes cause and effect.
C. Measures growth.
D. Measures the demand for a good.
Answer: A. 
Q31. __________ data that depends on data model and resides in a fixed field within record.
A : Structured data
B : Un-Structured data
C : Semi-Structured data
D : Scattere
Ans. A.


Gaurav

                                      1.Unit I

1) What is KDD in data mining?

a) Discovery Database

b)Knowledge Discovery Data

c)Knowledge Data definition

d)Knowledge data house

Answer : b

2) What are the functions of Data Mining?

a)Association and correctional analysis classification

b)Prediction and characterization

c)Cluster analysis and Evolution analysis

d)All of the above

Answer : d

3) True-False: It is possible to design a Linear regression algorithm using a neural network?

a) TRUE

b) FALSE

Answer : a

4) Which of the following technology is used in classification ?

a) Classifier

b) classification model

c) Both a & b

d) None of these

Answer : c

5) which of the following is not a type of regression.

a) polynomial regression

b) stepwise regression

c) Fundamental regression

d) Logistic regression

Answer : c

6) Bar chart , pie chart and column chart are used in........  .

a) Classification

b) Regression

C) Data visualisation

d) All of these

Answer : c

7) PWA stands for

a) Progressive web app

b) Progressive windows application

c) Power web app

d) Programming web app

Answer : a

                                 2.unit ll

8) The full form of OLAP is

a) Online Application Programming

b) Online Advance processing

c) Online analytical processing

d) online analytical programming

Answer : c

9) The data is stored, retrieved and updated in ………………..

a) OLAP

b) OLTP

c) SMTP

d) FTP

Answer : b

10) Analysis in database is slow and painful due to ........

a) small volume

b) large volume

c) Both a & b

d) None of these

Answer : b

11) What is Artificial Intelligence?

a) A field that aims to make humans more intelligent

b) A field that aims to improve the security

c) A field that aims to develop intelligent machines

d) A field that aims to mine the data

Answer: c

12) Which of the following is a component of Artificial Intelligence?

a) Learning

b) Training

c) Designing

d) Puzzling

Answer: a

13) ANN stands for

a) Artificial Network Nation

b) Article Neural Network

c) Artificial Neural Network

d) Additional Neuro Network

Answer : c

14) which of the following step is not used in hypothesis technique.

a) Make assumptions

b) Set acceptance criteria

c) Self awareness

d) Evaluate results

Answer : c

                             3 . Unit lll

15) Research methodology uses......

a) Hypothesis formulation

b) Data collection

c) Analysis

d) All of these

Answer : d

16) Volume, velocity, variety, veracity,value are related to.....

a) Research methodology

b) Big data management

c) Hypothesis technique

d) Data analysis

Answer : b

17) Which of the following are Big data management challenges

a) Growing Data stores

b) Data and architectural complexity

c)Ensuring data quality

d) All of these

Answer : d

18) Reactive machines, Limited memory,self awareness are the types of

a) artificial intelligence

b) machine learning

c) big data

d) none of these

Answer : a

19) Data Analysis is a process of?

a) transforming data

b) inspecting data

c)cleaning data

d)All of above

 Answer : d

20) Which of the following is the type of data analysis?

a) Text analysis

b) Statistical analysis

c) Diagnostic Analysis

d) All of theses

Answer : d

                                  4. Unit lv

21) CTR stands for

a) call to rate

b) call through rate

c) common through rate

d) None of thses

Answer : b

 22) Which of the following are the example about bright future of data science.

a) Automobile Industry

b) IT

c) Banking & Finance

d) All of these

Answer :d

23) "Drew Conway" published the diagrammatic explanation of data science basics in........ .

a) 2012

b) 2008

c) 2013

d) 2010

Answer : c

24) In banking and finance to identify fraud activities before they can actually damaged, the  biggest innovation of that time is............ .

a) scripto currency

b) crypto currency

C) cripto currency

d) None of these

Answer : b

25) The term data science was first time used in .....

a) 2000

b) 2004

c) 2001

d) 2003

Answer : c

                                               5. Unit v

26) ............ Is a technique which explores viral & valuable information.

a) Data science

b) Data minning

c) Both a & b

d) None of these

Answer : b

27) Which of the following is the work of data science?

a) Analysing the data

b) visualising the data

c) predictions

d) All of these

Answer : d

28)  which of the following tools is not used in project development tools,

a) Anaconda

b) keras

c) segments

d) Flask

Answer : c

29) which of the following is type of predictive Analytics.

a) Predictive Model

b) Descriptive Model

c) Decision Model

d) All of these

Answer : d

30) ______ is simplest class of analytics.

a) Descriptive

b) Predictive

c) Prescriptive

d) Summarization

Answer : a

                                         6 . Unit vi

31) What was Hadoop written in?

a) python

b) Perl

c) Java

d) Rubi

Answer: c

32) what is the full form of HDFS.

a) Hadoop Differentiate File System

b) Hadoop Developer File System

c) Hadoop Distributed File System

d) Hadoop Data File system

Answer : c

33) RHIPE stands for...

a) R and Hadoop Integrated programming Environment .

b) R and Hadoop Integrated progressive Environment.

C) Real and Hadoop Integrated programming Environment

d) R and Hadoop Integrated progressive Environment

Answer : a

34) Management,Analytics , strategy, collaboration are the responsibilities of the data scientist?

a) true

b) false

Answer : a

35) "JAVA” can be used for data acquisition.

a) True

b) False

Answer: a

36) which of the following is not a common programming paradigm.

a) Mathematical

b) functional

c) reactive

d) None of these

Answer : d

37) The data science life cycle is also called as.......  .

a) pipe line

b) linear line

c) horizontal line

d) None of these

Answer : a

 

 Kalyani


Chapter1

1.Whichofthefollowingisnotatechniqueof

softwareengineering?

A)Containerization

B)VirtualReality

C)AndroidComputing

D)DataRepositories

Ans.D

2.Whichofthefollowingisnotatoolindata

mining?

A)Weka

B)RapidMiner

C)TeraData

D)Tensorflow

Ans.D

3.KDDstandsfor---

A)KnowledgeDiscoveryinData

B)KnowledgeDiscoveryDatabase

C)KnowledgeDerivedData

D)KnowledgeDiscoveryDatabase

Ans.A

4. KDDprocessincludes

A)DataSelection

B)DataIntegration

C)DataClassification

D)Allofabove

Ans.D

5.Whichofthefollowingisatypeof

Regression?

A)LinearRegression

B)LogisticRegression

C)PolynomialRegression

D)Allofabove

Ans.D

Chapter-2

6.OLTPstandsfor___

A)OnlineTranslateProcessing

B)OnlineTransactionalProcessing

C)OnlineTransportProcess

D)OnlineTestingProcess

Ans.B

7.Whichofthefollowingisnottrueabout

database?

A)Databasestoresrealtimeinformation

B)DatabaseusesOLAPprocessing

C)Databaseisoptimisedtoupdatedatawith

maximum speed

D)Simpletransactionalqueriesareusedin

database

Ans.B

8.Whichofthefollowingisnottrueabout

datawarehouse?

A)Datawarehousestoreshistoricaldata

B)Redundancymayhappenindatawarehouse

C)DatawarehouseusesOLTPprocessing

D)InWarehouseanalysisisfastandeasy

Ans.C

9.OLAPstandsfor___

A)OnlineAdvertisingProcess

B)OnlineApplicationProcessing

C)OnlineAnalyticalProcessing

D)OnlineAirlineProcess

Ans.C

10.ANNstandsfor___

A)ActivatedNuralNetwork

B)ArtificialNuralNetwork

C)ArtificialNuralNode

D)ArtificialNuralNumber

Ans.B

Chapter-3

11.Whichofthefollowingisatypeof

ArtificialIntelligence

A)ReactiveMachines

B)LimitedMemory

C)TheoryofMind

D)AllofAbove

Ans.D

12.Dataanalysisisprocessof___

A)CleaningData

B)TransformingData

C)ModelingData

D)AllofAbove

Ans.D

13.Whichofthefollowingisnotatypeof

dataanalysis?

A)TextAnalysis

B)StatisticalAnalysis

C)PredictiveAnalysis

D)DataMining

Ans.D

14.Whichofthefollowingisafactorofbig

data?

A)Volume

B)Value

C)Velocity

D)AllofAbove

15.SelfAwarenessisatypeof___

A)BigData

B)DataMining

C)ArtificialIntelligence

D)MachineLearning

Ans.C

Chapter-4

16.Theterm datasciencewasusedin___

A)2001

B)1962

C)1985

D)1998

Ans.A

17.Whichofthefollowingistheapplicationof

datascience?

A)HealthCare

B)TargetedAdvertising

C)SpeechRecognition

D)AllofAbove

Ans.D

18.Whichofthefollowingistrueaboutdata

science?

A)Data science is helpful tofulfillthedemand

oflargepopulation

B)Datascienceishelpfultomakeprocessof

qualityserviceeasyandquick

C)Datasciencehelpingtocreatehasslefree

communicationaccrosstheworld

D)AllofAbove

Ans.D

19.Awaytoclassifyprogramminglanguage

basedontheirfeaturescalled___

A)InferentialStatistics

B)ProgrammingParadigm

C)R

D)Python

Ans.B

Chapter-5

20.Whichofthefollowingisnotatoolofdata

science?

A)Anaconda

B)Keras

C)Weka

D)Flask

Ans.C

21.SASstandsfor___

A)StatisticalAnalysisSystem

B)StatisticalApplicationService

C)System AnalysisService

D)StatisticalAnalysisSource

Ans.A

22.SeasonalityandSegmentsaretypesof___

A)Result

B)Factors

C)Kers

D)Planning

Ans.B

23.Whichofthefollowingisthetechniqueof

DataScience?

A)Cleaning

B)Visualization

C)Scrapping

D)AllofAbove

Ans.D

24.EDAstandsfor___

A)EvolutionaryDataAnalysis

B)EvaluatingDataAnalysis

C)ExploratoryDataAnalysis

D)ExploreDataApplication

Ans.C

Chapter-6

25.Whichofthefollowingisnota

responsibilityofdatascientist?

A)Management

B)Analytics

C)Collaboration

D)DataCleaning

Ans.D

26.Hadoopwasfoundedby___

A)ASF

B)DataScientist

C)OLAP

D)ORCH

Ans.A

27.ASFstandsfor___

A)AmericanStandardFoundation

B)ApacheSoftwareFoundation

C)ApacheSoftwareFramework

D)ApacheStandardFramework

Ans.B

28.Whichofthefollowingisnotaskillof

datascientist?

A)MachineLearning

B)DeepLearning

C)DataMining

D)DataMerging

Ans.D

29.HDFSstandsfor___

A)HadoopDistributionFileSoftware

B)HadoopDistributedFileSystem

C)HadoopDataFileSystem

D)HadoopDevelopingFileSystem

Ans.B

30.RHIPEstandsfor___

A)RandHadoopIntegratedProgramming

Environment

B)RandHadoopInteractiveProgramming

Environment

C)RandHadoopInterconnectedProgramming

Environment

D)RandHadoopIntermediateProgramming

Environment

Ans.A



Neha


Chapter 1 :

1) What are the functions of Data Mining?

a) Association and correctional analysis classification

b) Prediction and characterization

c) Cluster analysis and Evolution analysis

d) All of the above

Ans. D)

2) The following given statement can be considered as the examples of_________

 

 Relationship between rash driving and number of road accidents by a driver

a) Prediction 

b) Clustering

c) Regression

d) Classification

Ans. C)

3) Which of the following also used as the first step in the knowledge discovery process?

a) Data selection

b) Data cleaning

c) Data transformation

d) Data integration

Ans. B)

4) Which of the following is an essential process in which the intelligent methods are 

applied to extract data patterns?

a) Warehousing

b) Data Mining

c) Text Mining

d) Data Selection

Ans. B)

5)What are the common Types of Data Visualizations ?

a) Chart

b) Table

c) Graphs

d) All of the above

Ans. D)

6) Which of the following mentioned is linear Data Structure?

a) Queue

b) Tree

c) Stack

d) Both a & c

Ans. D)

7) LCNC stands For____________?

a) Large class number classification

b) Low Code No Code

c) London Community Neighbourhood Co

d) None 

Ans. B)

Chapter 2 :

1) A Data Warehouse is which of the following ?

a) Can be updated by end users 

b) Contains numerous naming conventions and formats

c) Organized around important subject areas

d) Contains only current data

Ans. D)

2) Is Below statement is correct or Not :

“Database is a collection of related data that represents some elements of 

the real world whereas Data warehouse is an information system that stores 

historical and commutative data from single or multiple sources.”

a) True

b) False

Ans. A)

3) ANN used For?

a) Pattern Recognition

b) Classification

c) Clustering

d) All Of these

Ans. D)

4) Which of the following is the common language for Artifical intelligence?

a) Python

b) C

c) Java

d) Ruby

Ans. A)

5) Which of the Following is not a application Of AI?

a) Computer Vision

b) Digital Assistant

c) Database Management System

d) Natural language processing

Ans. C)

Chapter 3 :

1) What are the four v’s Of big data ?

a) Volume

b) Velocity

c) Veracity

d) All

Ans. D)

2) What license is Hadoop distributed under ?

a) Apache License 

b) Commercial

c) Mozila Public Licence

d) Shareware

Ans. A)

3) What is Machine Learning?

i) Artificial Learning

II) Deep Learning

iii) Data statistics

a) Only i

b) I & ii

c) All

d) None

Ans. B)

4) Data Analysis is process of ?

a) Inspecting data

b) cleaning data

c) transforming data

d) All of the above

Ans. D)

5) Which of the Following is not a major data analysis approaches ?

a) Text analysis

b) Dignostic analysis

c) Predective Intelligence

d) Statistical analysis

Ans. C)

Chapter 4 :

1) Which Of the Following is Not Application of DataScience

a) Internet Search

b) Speech Recognization

c) Gaming

d) Privacy

Ans. D)

2) A Programming paradigm includes:

a) Problem Solving

b) Program language design

c) Problem solving & Program design

d) None

Ans. D)

3) This paradigm is relative simple :

a) Object – oriented

b) Scripting

c) Procedural

d) Functional

Ans. A)

4) Which Of the following uses on data on same object to predict the 

values for the object?

a) Fast

b) Accuracy

c) Scalable

d) All

Ans. D)

5) Which of the following is not tool of datascince ?

a) Matlab

b) Tensorflow

c) Apache Spark

d) Jinja

Ans. D)

Chapter 5 :

1) Data Mining System Classification consists of?

a) Database Technology

b) Machine Learning

c) Information Science

d) All of the above

Ans. D)

2) What is the use of data cleaning?

a) to remove the noisy data

b) correct the inconsistencies in data

c) transformations to correct the wrong data.

d) All of the above

Ans. D)

3) Which of the following is not work of Data mining?

a) Selection of data

b) Conversion of data

c) Analyzing the data

d) Integration of data

Ans. C)

4) Which of the following is aspect of Experimentation?

a) Segments

b) Annaconda

c) Keras

d) ORCH

Ans. A)

5) What is removed by EDA?

a) Null Value

b) Garbadge

c) Zero

d) All of the above

Ans. A)

Chapter 6 :

1) HDFS Stands for_________

a) Hadoop development foundation source

b) Hadoop Distribution File System

c) Hadoop Data File system

d) Hadoop Development for Software

Ans. B)

2) Which of the following is not component of data science?

a) Data Minning

b) Big data

c) Machine Learning

d) Artificial Intelligence

Ans. A)

3) Which of the following is third phase of data science life cycle ?

a) Capture

b) Prepare & maintain

c) Processes

d) Communicate

Ans. C)

4) Which is of the following is big part of data scientist role ?

a) Modeling 

b) Clustering

c) Data visualization 

d) All of the above

Ans. D)

5) _______ Is the Process of bringing data that has been created by a 

source outside the organization, into the organization for production 

use.

a) Data Visualization

b) Data Mining

c) Data Acquisition

d) Clustering

Ans. C)


Mohan


UNIT-I

[1] Who is the “Turning Award Winner” ?

(A) DJ Patil

(B) Jeff Hannerbacher

(C) C.F. Jeff

(D) Jim Gray

Ans: Jim Gray

[2] In 1962 who described a field that he called “Data Analysis” which 

resembles modern data science ?

(A) DJ Patil

(B) Jeff Hannerbacher

(C) John Jukey

(D) Jim Gray

Ans: John Jukey

[3] The term “Data Science” has been traced back to in 

________, when Peter Naur proposed it as alternative name 

for computer science.

(A) 1972

(B) 1973

(C) 1974

(D) 1975

Ans: 1974

[4] The professional title of ________ has been attributed to 

“DJ Patil” and Jeff Hannerbacher in 2008.

(A) Data Analysis

(B) Data Scientist

(C) Data Analyst

(D) None of the above

Ans: Data Scientist

[5] The data can be used effectively to make smart decision 

this is achieved by __________.

(A) Artificial Intelligence

(B) Identifying Problems

(C) Both (A) and (B)

(D) None of the above

Ans: Artificial Intelligence

UNIT-II

[6] Cloud servers are located in _______ all over the world.

(A) Data Warehouses

(B) Big Companies

(C) Big Cities

(D) Data Centers

Ans: Data Centers

[7] Database use online transactional processing(OLTP) to 

_______ large number of short online transactions quickly.

(A) Insert

(B) Update

(C) Delete

(D) All of the above 

Ans: All of the above

[8] Hypothesis originates from _______ world which related 

to work that is Hupo(under) and Thesis(placing).

(A) Latin

(B) American

(C) Greek

(D) French

Ans: Greek

[9] Without any assumption or any consideration we can not 

start with any _______ .

(A) Research

(B) Investication

(C) Both (A) and (B)

(D) None of the above

Ans: Both (A) and (B)

[10] Classification helps to ________ .

(A) Combining the data

(B) Collection of data 

(C) Merging of data 

(D) Differentiate to the data

Ans: Differentiate to the data

UNIT-III

[11] Predictive analysis depends on _______ .

(A) Results

(B) Predictions

(C) Execution 

(D) Statistics

Ans: Predictions

[12] Volume, Velocity, Variety, Veracity and Value are the 

factors of _______ .

(A) Data Science

(B) Data Analysis

(C) Big Data Management

(D) Artificial Intelligence

Ans: Big Data Management

[13] Big Data Management benefits are ________ .

(A) Increased Revenue

(B) Improved Customer Service

(C) Increased Efficiency

(D) All of the above

Ans: All of the above

[14] What are the main components of Big Data ?

(A) MapReduce

(B) HDFS

(C) YARN

(D) All of the above

Ans: All of the above

[15] Machine Learning is a field of AI consisting of learning 

algorithms that _______ .

(A) At executing some task

(B) Over time with experience

(C) Improve their performance

(D) All of the above

Ans: All of the above

UNIT-IV

[16] Which year data science term was used ?

(A) 2001

(B) 2002

(C) 2003

(D) 2004

Ans: 2001

[17] Which are the applications of data science ?

(A) Helth care

(B) Broad and Risk detection

(C) Internet search

(D) All of the above

Ans: All of the above

[18] ________ is a defined as a process of cleaning, 

transforming and modelling data in discover useful 

information which helps for decision making.

(A) Data Analysis

(B) Data Analyst

(C) Data Mining

(D) Data Science

Ans: Data Analysis

[19] Which of the following is a data visualization method ?

(A) Line

(B) Circle

(C) Pie chart and Bar chart

(D) Pentagon

Ans: Pie chart and Bar chart

[20] Data Analysis is defined by the statistician ?

(A) DJ Patil

(B) Jeff Hannerbacher

(C) John Jukey

(D) None of the above

Ans: John Jukey

UNIT-V

[21] Which is most significant language for Data Science ?

(A) R

(B) Ruby

(C) Java

(D) All of the above

Ans: R

[22] What is the use of data cleaning ?

(A) to remove the noisy data

(B) correct the inconsistencies in data

(C) transformations to correct the wrong data

(D) All of the above

Ans: All of the above

[23] ________ are project deployment tools.

(A) Anaconda

(B) Keras

(C) Flask

(D) All of the above

Ans: All of the above

[24] Which of the following clustering requires merging 

approach ?

(A) Partitional

(B) Hierarchical

(C) Naïve Bayes

(D) None of the above

Ans: Hierarchical

[25] What is the long form of EDA ?

(A) Exploratory Data Analysis

(B) Exploratory Data Analyst 

(C) Explanning Data Analysis

(D) Explorer Data Analyst

Ans: Exploratory Data Analysis

UNIT-VI

[26] Hadoop is an opensource framework which was founded 

by ________ .

(A) AFS

(B) ASF

(C) FSA

(D) SAF

Ans: ASF

[27] ASF stands for _________ .

(A) Appache Software Foundation

(B) Appache Software Function

(C) Apply Software Foundation

(D) Appling Software Foundation 

Ans: Appache Software Foundation

[28] In which language is Hadoop created ?

(A) Python 

(B) Perl

(C) Java

(C) Ruby

Ans: Java

[29] HDFS stands for ________ .

(A) Hadoop District File System

(B) Haddop District File System

(C) Hadoop Distributed File System

(D) Hadoop Distance File System

Ans: Hadoop Distributed File System

[30] Which industries are dependent on data science ?

(A) Agriculture

(B) Digital Economy

(C) Health Care

(D) All of the above

Ans: All of the above

[31] Which among the following is the top most important 

thing in data science ?

(A) Answer

(B) Data

(C) Question

(D) None of the above

Ans: Question.


Asmita

Data science

Unit-1

1. _______is the most important language for data science

1. Java

2. Python

3. R

4. Ruby

Ans. R

2. _______is one of the key data science skills

1. Machine learning

2. Statistics

3. Data visualization

4. All of the above

Ans. All of the above

3. _______ data mining technique is used to uncover patterns in data

1. Data merging

2. Data booting

3. Data dredging

4. All of the above

 Ans. Data dredging

4. Which is the following has many feature of that is now known as cloud computing

1. Web services

2. Software

3. All of the above

4. Internet

Ans. Internet

• What is visualization. 

1. It is the graphical representation of information and data

2. It is the textual representation of information and data

3. None

4. All of the above

Ans. It is graphical representation of information and data

• Classification rules are extracted from

1. Root node

2. Decision tree

3. Siblings

4. Branches

Ans. Decision tree

Unit-2

• AI is about________

1. Playing a game on computer

2. Making a machine intelligent

3. None

4. All of the above

Ans. Making a machine intelligent

• AI programming skills are________

1. Learning

2. Reasoning

3. Self correction

4. All of the above

Ans. All of the above

• Which of the given language is not used for AI

1. Python

2. R

3. Java

4. Perl

Ans. Perl

• ANN stages_____

1. Input

2. Nodes

3. Output

4. All of the above

Ans. All of the above

• Which of the following is a Branch of statistics

1. Descriptive

2. Inferential

3. Industry

4. Both 1&2

Ans. Both 1&2

Unit-3

• Data analysis is a process of

1. Inspecting data

2. Cleaning data

3. Transforming data

4. All of the above

Ans. All of the above

• How many types of data analysis

1. 7

2. 5

3. 8

4. 4

Ans. 7

• What are the 'v' of big data

1. Volume

2. Velocity

3. Variety

4. All of the above

Ans. All of the above

• AI stands for

1. Machine intelligence

2. Artificial intelligence

3. Artificial nural network

4. None

Ans. Artificial intelligence

• _______ are the types of data analysis methods

1. Descriptive analysis

2. None

3. Diagnostic analysis

4. Both 1&3

Ans. Both1&3

Unit-4

• Unstructured data is not organised

1. False

2. True

Ans. True

• A column is a_____ representation of data

1. Horizontal

2. Diagonal

3. Vertical

4. Top

Ans. Vertical

• Data science feature ___

1. Automobile industry

2. IT

3. Army and weapons

4. All of the above

Ans. All of the above

• Which of the following approach should be trying to ask a data analysis question

1. Finding the question which is to be answered

2. Finding only one solution for a specific problem

3. Finding out the answer from the dataset without enquring any questions

4. None

Ans. Finding the question which is to be answered

Unit-5

• _______ is a high levels API built on tensorflow

1. PyCharm for python IDE

2. Aanconda

3. Keras

4. Cloud platform

Ans. Keras

• Two types of experimentation factors

1. Seasonality

2. Segment

3. Nin

4. Both 1&2

Ans. Both 1&2

• What is KDD in data mining

1. Knowledge discovery database

2. Knowledge discovery data

3. None

4. Knowledge data definition

Ans. Knowledge discovery database

• ______ is simplest class of analytics

1. Descriptive

2. Predicative

3. Prescriptive

4. Summarization

Ans. Descriptive

• Which is not type of predicative analytics

1. Predicative model

2. Descriptive model

3. Division model

4. Flask

Ans. Flask

Unit-6

• The Hadoop framework is written in

1. C++

2. Python

3. Java

4. C

Ans. Java

• Hadoop is open source

1. Only for Apache Hadoop

2. Only for Apache &cloudera

3. True

4. False

Ans. Only for Apache Hadoop

• What license is Hadoop distributed under

1. Apache license 2.0

2. Mozilla public license

3. Shareware

4. Commercial

Ans. Apache license 2.0

• What are the different feature of big data analytics

1. Open source

2. Scalability

3. Data recovery

4. All of the above

Ans. All of the above

• ______ has the world largest Hadoop cluster

1. Apple

2. Datamatics

3. Facebook

4. None

Ans. Facebook


Shweta


• What license is Hadoop distributed under

1. Apache license 2.0

2. Mozilla public license

3. Shareware

4. Commercial

Ans. Apache license 2.0

• What are the different feature of big data analytics

1. Open source

2. Scalability

3. Data recovery

4. All of the above

Ans. All of the above

• ______ has the world largest Hadoop cluster

1. Apple

2. Datamatics

3. Facebook

4. None

Ans. Facebook



Comments

Popular posts from this blog