Big Data Solutions MCQ With Answer
1. How many V’s are present in the Big
Data?
a. 3
b. 4
c. 5
d. 6
Answer: c
Data?
a. 3
b. 4
c. 5
d. 6
Answer: c
2. Data in a Relational Database is:
a. Structured
b. Un-Structured
c. Semi Structured
d. Meta Data
Answer: a
3. Data is found in the big data, in how many
forms?
a. 2
b. 3
c. 4
d. 5
Answer: b
4. What kind of data is in Log files?
a. Structured
b. Un-Structured
c. Semi Structured
d. Meta Data
Answer: c
5. What is the overall percentage of the
world’s total data created within the past
two years is:
a. 80%
b. 85%
c. 90%
d. 95%
Answer: c
6. What are the main components present in
the Big Data Analytics?
a. MapReduce
b. HDFS
c. YARN
d. All of the above
Answer: d
6. What are the major benefits of Big Data
Processing?
a. Businesses can utilize outside
intelligence while taking decisions
b. Improved customer service
c. Better operational efficiency
d. All of the above
Answer: d
7. The Hadoop is written in which
programming language?
a. C
b. C++
c. Java
d. Python
Answer: c
8. Which of the following option given are
NOT related to the big data problem(s)?
a. Parsing 5 MB XML file every 2
minutes
b. Processing the twitter data
c. Processing online banking
transactions
d. both (a) and (c)
Answer: d
9. What does the characteristics “Velocity”
in Big Data represents?
a. Speed of input data generation
b. Speed of individual machine
processors
c. Speed of ONLY storing data
d. Speed of storing and processing
data
Answer: d
10. Which of the following are example(s) of
Real Time Big Data Processing?
a. Complex Event Processing (CEP)
platforms
b. Stock market data analysis
c. Bank fraud transactions detection
d. both (a) and (c)
Answer: d
11. Hadoop is open source.
a. ALWAYS True
b. True only for Apache Hadoop
c. True only for Apache and
Cloudera Hadoop
d. ALWAYS False
Answer: b
12. Which of the following is not an example
of Social Media?
a. Twitter
b. Google
c. Insta
d. Youtube
Answer: b
13. By 2027, the volume of data produced
digitally will reach to
a. TB
b. YB
c. ZB
d. EB
Answer: c
14. For Drawing insights for Business what
are need?
a. Collecting the data
b. Storing the data
c. Analysing the data
d. All the above
Answer: d
15. Does Facebook uses “Big Data ”
to determine the behavior of its
users? Is this True or False.
a. TRUE
b. FALSE
Answer: a
16. The Process of describing the data that
is huge and complex to store and
process is known as
a. Analytics
b. Data mining
c. Big Data
d. Data Warehouse
Answer: c
17. Data generated from online
transactions is one of the example for
volume of big data. Is this true or
False.
a. TRUE
b. FALSE
Answer: a
18. Velocity is the speed at which the data is
processed
a. TRUE
b. FALSE
Answer: b
19. have a structure but cannot be
stored in a database.
a. Structured
b. Semi-Structured
c. Unstructured
d. None of these
Answer: b
20. . refers to the ability to turn your
data useful for business.
a. Velocity
b. Variety
c. Value
d. Volume
Answer: c
21. Value tells the trustworthiness of data in
terms of quality and accuracy.
a. TRUE
b. FALSE
Answer: b
22. Files are divided into sized
Chunks.
a. Static
b. Dynamic
c. Fixed
d. Variable
Answer: c
23. ____________is an open source
framework for storing data and
running application on clusters of
commodity hardware.
a. HDFS
b. Hadoop
c. MapReduce
d. Cloud
Answer: b
24. Hadoop MapReduce allows you to
perform distributed parallel processing
on large volumes of data quickly and
efficiently: statement is True or False
a. TRUE
b. FALSE
Answer: a
25. In Relational database Management
System the property of Scaling is
apploicable.
a. TRUE
b. FALSE
Answer: b
26. Which of the following options is not the
example of NoSql ?
a. Google
b. NetFlix
c. Amazon
d. CERN
Answer: c
27. Scalability and better performance of No
SQL is Achieved by sacrificing ACID
Compatibility Is it TRUE?
a. TRUE
b. FALSE
Answer: a
28. For Scalability and better performance of
No SQL is attained by compromising
ACID Compatibility Is it TRUE?
a. TRUE
b. FALSE
Answer: a
29. is a programming
model for writing applications that
can process Big Data in parallel on
multiple nodes.
a. HDFS
b. MAP REDUCE
c. HADOOP
d. HIVE
Answer: b
30. Which of the following is a widely used and
effective machine learning algorithm based
on the idea of bagging?
a. Decision Tree
b. Regression
c. Classification
d. Random Forest
Answer: d
31. Data Set is the:
a. Tweets stored in a flat file
b. A collection of image files in a
directory
c. An extract of rows from a database
table stored in a CSV formatted file
d. All the above
Answer: d
32. Data analysis is the process of:
a. Examining data to find facts
b. Relationships,
c. Patterns, insights and/or trends.
d. All the above
Answer: d
33. What are the general categories of
analytics that are distinguished by the
results they produce:
a. Descriptive analytics
b. Diagnostic analytics
c. Predictive analytics
d. All the above
Answer: d
34. BI enables an organization to gain insight
into the performance of an enterprise
a. By analyzing data generated by its
business processes and information
systems.
b. By examining data to find facts
c. From relationships,
d. All the above
Answer: a
35. Data variety refers to;
a. Multiple schemas
b. Multiple formats and types of data
c. Multiple Data Models
d. None of above
Answer: b
36. Unstructured Data Consists of:
a. Text file, Audio Files.
b. Video files, Text data
c. Tagged Data
d. a) and b)
Answer: d
37. Multiple internal and external data in the big
data comes from the multiple sources as :
a. Sensors, Social network sites
b. Email, Xml, Multimedia
c. a) and b)
d. None of the above
Answer: c
38. Ingestion Layer Should have the capability to:
a. validate, cleanse, transform, reduce
b. integrate
c. Preprocess the data
d. a) and b)
Answer: d
39. According to analysts, for what can traditional
IT systems provide a foundation when they’re
integrated with big data technologies like
Hadoop?
a. Big data management and data mining
b. Data warehousing and business
intelligence
c. Management of Hadoop clusters
d. Collecting and storing unstructured data
Answer: a
40. What are the main components of Big Data?
a. MapReduce
b. HDFS
c. YARN
d. All of these
Answer: (d)
41. What are the different features of Big Data
Analytics?
a. Open-Source
b. Scalability
c. Data Recovery
d. All the above
Answer: (d)
42. What are the four V’s of Big Data?
a. Volume
b. Velocity
c. Variety
d. All the above
Answer: (d)
43. All of the following accurately describe
Hadoop, EXCEPT:
a. Open-source
b. Real-time
c. Java-based
d. Distributed computing approach
Answer: (b)
44. ___________ is general-purpose computing
model and runtime system for distributed data
analytics.
a. Mapreduce
b. Drill
c. Oozie
d. None of the above
Answer: (a)
45. The examination of large amounts of data to see
what patterns or other useful information can be
found is known as
a. Data examination
b. Information analysis
c. Big data analytics
d. Data analysis
Answer: (c)
46. Big data analysis does the following except
a. Collects data
b. Spreads data
c. Organizes data
d. Analyzes data
Answer: (b)
47. What makes Big Data analysis difficult to
optimize?
a. Big Data is not difficult to optimize
b. Both data and cost effective ways to
mine data to make business sense
out of it
c. The technology to mine data
d. All of the above
Answer: (b)
48. The new source of big data that will trigger a
Big Data revolution in the years to come is
a. Business transactions
b. Social media
c. Transactional data and sensor data
d. RDBMS
Answer: (c)
49. The unit of data that flows through a Flume
agent is
a. Log
b. Row
c. Event
d. Record
Answer:( c)
50. Listed below are the three steps that are
followed to deploy a Big Data Solution except
a. Data Ingestion
b. Data Processing
c. Data dissemination
d. Data Storage
Answer: (c)
51. Check below the best answer to “which
industries employ the use of so-called “Big
Data” in their day to day operations?
a. Weather forecasting
b. Marketing
c. Healthcare
d. All of the above
Answer: (d)
52. There are almost as many bits of information in
the digital universe as there are stars in the
actual universe?
a. True
b. False
Answer: (a)
53. The word ‘Big data’ was coined by
a. Roger Mougalas
b. John Philips
c. Simon Woods
d. Martin Green
Answer: (a)
54. The word ‘Big Data’ was coined in the year
a. 2000
b. 1970
c. 1998
d. 2005
Answer: (c)
55. Concerning the Forms of Big Data, which one
of these is odd?
a. Structured
b. Unstructured
c. Processed
d. Semi-Structured
Answer: ( c )
56. Big Data applications benefit the media and
entertainment industry by
a. Predicting what the audience wants
b. Ad targeting
c. Scheduling optimization
d. All of the above
Answer: (d)
57. The feature of big data that refers to the quality
of the stored data is ______
a. Variety
b. Volume
c. Variability
d. Veracity
Answer: (d)
58. ______ is a framework for performing remote
procedure calls and data serialization.
a. Drill
b. BigTop
c. Avro
d. Chukwa
Answer: c
59. Which of the following is a characteristic of Big
Data?
a. Huge volume of data
b. Complexity of data types and structures
c. Speed of data creation and growth
d. All of the mentioned
Answer: d
60. Concurrent access to shared data may result in
_________________
a. Data consistency
b. Data insecurity
c. Data inconsistency
d. None of the mentioned
Answer: c
61. Mutual exclusion implies that :
a. If a process is executing in its critical
section, then no other process must be
executing in their critical sections
b. If a process is executing in its critical
section, then other processes must be
executing in their critical sections
c. If a process is executing in its critical
section, then all the resources of the
system must be blocked until it finishes
execution
d. None of the mentioned
Answer: a
62. In the memory hierarchy, as the speed of
operation increases the memory size also
increases.
a. True
b. False
Answer: b
63. To use a _______network service, the service
user first establishes a connection, uses the
connection, and terminates the connection.
a. Connection-oriented
b. Connection-less
c. Service-oriented
d. Service-less
Answer: a
64. Which layer is responsible for the process-toprocess delivery ?
a. Network
b. Transport
c. Application
d. Physical
Answer: b
65. ________________refers to the biases, noise
and abnormality in data, trustworthiness of
data.
a. Value
b. Veracity
c. Velocity
d. Volume
Answer: b
66. _____________refers to the connectedness of
big data.
1. Value
2. Veracity
3. Velocity
4. Valence
Answer: d
0 Comments