Which statement is TRUE concerning optimizing the load performance?
A. You can improve the performance by increasing the number of map tasks assigned to the load
B. When loading large files the number of files that you load does not impact the performance of the LOAD HADOOP statement
C. You can improve the performance by decreasing the number of map tasks that are assigned to the load and adjusting the heap size
D. It is advantageous to run the LOAD HADOOP statement directly pointing to large files located in the host file system as opposed to copying the files to the DFS prior to load
Which of the following statements is TRUE regarding search visualization with Apache Hue?
A. Hue submits MapReduce jobs to Oozie
B. No additional setup is required to secure your session cookies
C. Hue applications require some code to be installed on the client
D. The File Browser application allows you to perform keyword searches across your Hadoop data
Use of Bulk Load in HBase for loading large volume of data will result in which of the following?
A. It will use less CPU but will use more network resource
B. It will use less network resource but more CPU
C. It will behave same way as using HBase API for loading large volume of data
D. None of the above
Which of the following statement is TRUE with BigSheets?
A. You can create any type of sheet from a parent workbook
B. You must create a child workbook in order to create a chart
C. You can delete a parent workbook without deleting the child workbooks
D. You must run the workbook on the data to get the full results of the analysis
Considering the following properties:
1.
Automated creation of target database schema and bulk extract and load
2.
Real-time replication subscriptions (with CDC)
3.
Managed workload for optimized performance of potentially thousands of artifacts
4.
Ensured governance around both data access as well as for metadata capture (to support data lineage and impact assessment)
Which tool supports all of the above?
A. Pig
B. JAQL
C. Data Click
D. BigSheets
Which keyword can be inserted before a Jaql statement to understand how it may be rewritten into MapReduce?
A. explain
B. describe
C. transcribe
D. mapreduce
Which Big SQL file format could be expected to result in a longer running query?
A. Text
B. Avro
C. Parquet
D. Sequence_text
What keyword is used when loading in Hive from single file on the local file system?
(Hint: If we leave off this keyword with the LOAD command, Hive assumes the location we are referring to is on HDFS)
A. LOCAL
B. INSERT
C. DIRECTORY
D. OVERWRITE
Which component of BigInsights is able to mask data items so as restrict viewing of sensitive data?
A. Flume
B. HDFS
C. Oozie
D. Big SQL
A large retailer is planning to create a data cloud using BigInsights. They would like to run queries using data that is stored in BigInsights as well as other databases like DB2 and Oracle.
Which one of the following would enable them to write the queries without having to move data into one common data store?
A. Hive
B. HBase
C. Big SQL
D. BigSheets