Exam4Training

Huawei H13-723-ENU HCIP-Big Data Developer Online Training

Question #1

In the implementation of the business logic of HBase writing data, which of the following interfaces or classes does not need to be involved?

  • A . Put
  • B . HTable
  • C . HBaseAdmin
  • D . PutList

Reveal Solution Hide Solution

Correct Answer: C
Question #2

In MapReduce application development, which of the following JAVA classes is responsible for managing and running a computing task?

  • A . lob
  • B . Context
  • C . FileSystem
  • D . Configuration

Reveal Solution Hide Solution

Correct Answer: A
Question #3

In FusionInsight HD, Oozie needs to upload the configuration files and jar packages that the job relies on to HDFS before submitting the job

  • A . True
  • B . False

Reveal Solution Hide Solution

Correct Answer: A
Question #4

In the Fusioninsight HD product, what is wrong about the Kafka secure and non-secure port description?

(Note: The old API refers to the old Producer API and the old Consumer API. Among them, the old Consumer API: refers to the interface defined in kafka.consumer.ConsumerConnector; the old Producer API: refers to the interface defined in kafka.producer.Producer.)

  • A . Kafka cluster security access port defaults to 21007, non-secure access port defaults to 21005.
  • B . The premise for the old API to access a topic through port 21005 is that the server parameter allow.everyone.if.no.acl.found is set to true, and the topic does not have ACL attributes set
  • C . The old API only supports access to port 21005; the new API is compatible with access to non-secure port 21005 and secure port 21007.
  • D . For topics with ACL set, you can use the old API to connect to port 21005 for access.

Reveal Solution Hide Solution

Correct Answer: D
Question #5

Which of the following methods can generate DStream objects?

  • A . KafkaUtils.createStream(…)
  • B . KafkaUtils.createDirectStream(…)
  • C . StreamingContext.socketStream(..)
  • D . StreamingContext.fileStream(…)

Reveal Solution Hide Solution

Correct Answer: ABCD
Question #6

Regarding the file format provided by Hive in the FusionInsight HD system, which is not a columnar file?

  • A . CRC
  • B . Parquet
  • C . RCFile
  • D . TextFiIe

Reveal Solution Hide Solution

Correct Answer: D
Question #7

In FusionInsight HD, for the creation and read-write permissions of various resources in Solr, which of the following statements is wrong?

  • A . Solr user group users can create a configuration set and specify the configuration set when creating a Collection
  • B . Users of the Solr user group can create Collection
  • C . Collection creator, role with read and write access to the collection, and Solr admin role can update the collection
  • D . Only the collection creator and Solr admin roles can delete a collection

Reveal Solution Hide Solution

Correct Answer: A
Question #8

In Spark, which of the following is the operator that takes the intersection of the DataFrame?

  • A . filter (condifionExpr: String): DataFrame
  • B . join (right: DataFrame, join Exprs: Column): DataFrame
  • C . intersect (other: Data Fra me): Data Fra me
  • D . dropDuplicates():DataFrame

Reveal Solution Hide Solution

Correct Answer: C
Question #9

To create a table through HBase’s createTable method, what parameters must be passed in?

  • A . Table name
  • B . Table names and columns
  • C . Table name and column family
  • D . Can be empty

Reveal Solution Hide Solution

Correct Answer: C
Question #10

The system time of the node where the HDFS client is located must be consistent with the system time of the FusionInsight HD cluster. If there is a time difference, the time difference should be less than a few minutes?

  • A . 5
  • B . 10
  • C . 20
  • D . 30

Reveal Solution Hide Solution

Correct Answer: A

Question #11

In the process of Flume cascade transmission, you can use the fail over mode to transmit, so that if the Flume node of the next hop fails or the data is received abnormally, it can automatically switch to another way to continue transmission.

  • A . True
  • B . False

Reveal Solution Hide Solution

Correct Answer: A
Question #12

In FusionInsight HD, what are the roles that belong to the Streaming service? (multiple choice)

  • A . Nimbus
  • B . Supervisor
  • C . Broker
  • D . quorumpeer

Reveal Solution Hide Solution

Correct Answer: AB
Question #13

Regarding Streaming’s topology, what is wrong in the following description?

  • A . A Topology is a directed acyclic graph (DAG) connected by a set of Spout components and Bolt components through Stream Groupings
  • B . Topology will keep running until it is explicitly killed
  • C . Business logic is encapsulated in Topology
  • D . A Topology can only specify to start one Worker process

Reveal Solution Hide Solution

Correct Answer: D
Question #14

When the HDFS client writes files to HDFS with N copies, if one of the copies fails to write, all copies will return the write failure.

  • A . True
  • B . False

Reveal Solution Hide Solution

Correct Answer: B
Question #15

In FusionInsight HD, use the Streaming command line to submit the om.huawei.examole.WordCount task in example.jar. The task name is wcTest. Which of the following commands is correct?

  • A . storm jar example .jar wcTest om.huawei.example.WordCount
  • B . storm jar example .jar wcTest WordCount
  • C . storm jar example .jar om.huawei.example.WordCount wcTest
  • D . storm jar example.jar WordCount wcTest

Reveal Solution Hide Solution

Correct Answer: C
Question #16

For an application running on the MapReduce platform, where will the jar package that the application depends on be placed?

  • A . HIVE
  • B . HBASE
  • C . HDFS
  • D . DB

Reveal Solution Hide Solution

Correct Answer: C
Question #17

As the core object of Spark, which of the following characteristics does RDD have? (multiple choice)

  • A . Read only
  • B . Partition
  • C . Fault tolerance
  • D . Efficient

Reveal Solution Hide Solution

Correct Answer: ABCD
Question #18

In FusionInsight HD, what are the computing frameworks that can be used in real-time processing scenarios? (multiple choice)

  • A . Spark Streaming
  • B . Streaming
  • C . MapReduoe
  • D . HDFS

Reveal Solution Hide Solution

Correct Answer: AB
Question #19

In FusionInsight HD, which method of the OozieClient class will be called by the JAVA API of the Oozie client when running tasks?

  • A . suspend
  • B . getJobInfo
  • C . submit
  • D . run

Reveal Solution Hide Solution

Correct Answer: D
Question #20

Set the data block storage size to 128M. When the HDFS client writes a file, when writing a 100M file, how much storage space is actually occupied?

  • A . 128M
  • B . 100M
  • C . 64M
  • D . 50M

Reveal Solution Hide Solution

Correct Answer: B

Question #21

Which of the following reasons will cause HDFS NameNode to enter safemode (safe read-only mode)? (multiple choice)

  • A . The disk space where the metadata of the active and standby NameNodes reside is insufficient.
  • B . The number of lost blocks exceeds the threshold.
  • C . The missing copy exceeds the threshold.
  • D . The damaged copy exceeds the threshold.

Reveal Solution Hide Solution

Correct Answer: AB
Question #22

For the HBase component of the FusionInsight HD platform, what attributes of the secondary index need to be defined when adding a secondary index? (multiple choice)

  • A . Index name
  • B . Index column
  • C . Index column type
  • D . The name of the column family to which the index column belongs

Reveal Solution Hide Solution

Correct Answer: ABCD
Question #23

When a MapReduce application is executed, which of the following actions occurred before the map phase?

  • A . split
  • B . combine
  • C . partition
  • D . sort

Reveal Solution Hide Solution

Correct Answer: A
Question #24

In FusionInsight HD, regarding the secondary development of Hive UDF, which of the following descriptions is correct? (multiple choice)

  • A . Before the user-defined UDF is used, it needs to be created in the Hive system.
  • B . User-defined UDF is not allowed to add information such as summary and status.
  • C . User-defined UDF can add deterministic and statefull annotations according to the actual situation.
  • D . In a secure cluster, user-defined UDFs are recommended to be created before each use.

Reveal Solution Hide Solution

Correct Answer: AC
Question #25

In the FusionInsight HD product, what is the role name of the Kafka service?

  • A . Producer
  • B . Broker
  • C . Consumer
  • D . ZooKeeper

Reveal Solution Hide Solution

Correct Answer: B
Question #26

What are the several parameter setting methods for Spark applications? (multiple choices)

  • A . Configure in the application configuration file spark-defaults.conf
  • B . When submitting the application, set it through –conf
  • C . In the application code, set by the setProperty method of SparkContext
  • D . In the application code, set through the SparkConf object

Reveal Solution Hide Solution

Correct Answer: ABD
Question #27

In a FusionInsight HD cluster, Flume does not support writing collected data to which service in the cluster?

  • A . HDFS
  • B . HBase
  • C . Kafka
  • D . Redis

Reveal Solution Hide Solution

Correct Answer: D
Question #28

In Spark, assuming that lines is a DStream object, which of the following statements can periodically count the number of words on this stream?

  • A . Iines.flatMap(_.split ” “” )).map(word => (word, 1)).reduce(_ +_).print()
  • B . Iines.flatMap(_.split ” “” )).map(word => (word,word.Iength())).reduceByKey (_ +_).print()
  • C . Iines.fIatMap(_.spIit ” “” )).map(word => (word, 1)).reduceByKey(_ +_).print()
  • D . Iines.flatMap(_.split ” “” )).flatMap(word => (word, 1)).groupByKey(_ +_).print()

Reveal Solution Hide Solution

Correct Answer: C
Question #29

HBase filters can set column names or column values ​​ as filter conditions, and support multiple filters to be used together.

  • A . True
  • B . False

Reveal Solution Hide Solution

Correct Answer: A
Question #30

Regarding the disaster tolerance of Streaming, which of the following statements is correct? (multiple choice)

  • A . After the Supervisor process exits, it can be automatically recognized and pulled up by Nimbus without affecting the running business.
  • B . Worker can be automatically pulled up by Supervisor after abnormal exit without manual intervention.
  • C . When a node fails, tasks on that node will be reassigned to other normal nodes without manual intervention.
  • D . After Nimbus fails, the standby Nimbus will automatically take over, without affecting the running business.

Reveal Solution Hide Solution

Correct Answer: ABCD

Question #31

The rowkey design of HBase table is a very important development and design link.

Suppose there are the following scenarios. The most frequent query scenario is based on the mobile phone number to query the historical call records every month and every six months. Which of the following rowkey designs is the best?

  • A . Name + mobile phone number
  • B . Date + mobile phone number
  • C . Mobile number + date
  • D . Mobile phone number + name

Reveal Solution Hide Solution

Correct Answer: C
Question #32

When indexing in Solr, which of the following code snippets can significantly improve indexing performance? (multiple choice)

  • A . lbClient.setRequestWriter (new BinaryRequestWriter());
  • B . if (updateRequest.getDocumentsMap().size() >= 10000) {
    cloudSolrClient.request (updateRequest, collName);
    updateRequest.clear();
    }
  • C . if (updateRequest.getDocuments().size() >= 1000) {
    cloudSolrClient.request (updateRequest, collName);
    updateRequest.clear();
    }
  • D . SolrlnputDocument doc = new SolrlnputDocument();
    doc.addField ("id", i.toString());
    doc.addField ("name", "Zhang San" + i);
    doc.addField ("features", "test" + );
    doc.addField ("price", (float) i * 1.01);

    cloudSolrClient.add (doc);
    cloudSolrClient.commit();

Reveal Solution Hide Solution

Correct Answer: AB
Question #33

In FusionInsigt HD, which of the following components can be used for data collection? (multiple choice)

  • A . Flume
  • B . Loader
  • C . HBase
  • D . MapReduoe

Reveal Solution Hide Solution

Correct Answer: AB
Question #34

When using Solr for full-text search, you can specify the response format of the query result through the wt parameter.

Regarding the response format of Solr query results, what is wrong in the following statement?

  • A . Support CSV and JSON
  • B . Support CSV, JSON and HTML
  • C . Support CSV, JSON and XML

Reveal Solution Hide Solution

Correct Answer: B
Question #35

During the Solr application development of FusionInsight HD products, you can verify the Collection through the Solr Admin UI.

Which of the following statements about Solr Admin UI is correct? (multiple choice)

  • A . Click Tree under Cloud to view metadata information such as Collections, configuration sets and live_nodes in Solr Cloud
  • B . Click Cloud to view the distribution and status of each ShardReplica under each Collection
  • C . Enter the Core Overview interface of the Shard Replica of a Collection, you can view the actual number of documents, storage size and location information of the Replica index
  • D . Solr user group, Solr admin role and Super group users can access Solr Admin UI

Reveal Solution Hide Solution

Correct Answer: ABCD
Question #36

A certain project needs to save the online data in a certain area and search the full text whether there is any sensitive information in these online records to prevent crimes in the area.

In this scenario, which of the following is the best solution?

  • A . Create an index on Solr, save the data, and return all the data when searching the full text.
  • B . Store data on HBase, and satisfy fuzzy matching queries through HBase’s filtering features.
  • C . When storing data, create an index on Solr and store the complete data in HBasx; when querying, obtain key information of records through Solr full-text search, and obtain complete records on HBase through key information.

Reveal Solution Hide Solution

Correct Answer: C
Question #37

In FusionInsight HD’s Hive, the user-defined UDF can have the same name as the Hive built-in UDF. In this case, the user-defined UDF will be used.

  • A . True
  • B . False

Reveal Solution Hide Solution

Correct Answer: B
Question #38

Suppose there is an application, there are 10 tables, each table has tens of millions of records, and the number of fields is about 20.

Now Redis is used to cache the data of these 10 tables. For the design of the data structure, which of the following is the best design?

  • A . A hash structure is adopted, and a table uses a hash KEY, and a row in the table records a field for the hash KEY.
  • B . Using a hash structure, each row record of each table uses a hash KEY, the field of the hash KEY corresponds to the field of the table record, and each table is designed with a different prefix to distinguish it.
  • C . Using string structure, each field in each row of each table uses a KEY.
  • D . Using string structure, each row of each table uses a KEY, and value is the concatenated value of all the fields in a row of the table.

Reveal Solution Hide Solution

Correct Answer: B
Question #39

In the Hive application of FusionInsight HD, there are scenarios where the data storage file has a high compression efficiency, and most queries only involve a part of the file field. This scenario is suitable for ORC File storage.

  • A . True
  • B . False

Reveal Solution Hide Solution

Correct Answer: A
Question #40

In FusionInsight HD, which of the following source types does Flume support? (multiple choice)

  • A . Avro
  • B . HBase
  • C . Kafka
  • D . HTTP

Reveal Solution Hide Solution

Correct Answer: ACD
Exit mobile version