Exam4Training

Hitachi HCE-5920 Hitachi Vantara Certified Specialist – Pentaho Data Integration Implementation Online Training

Question #1

You have a PDI job with an Oozie JobExecutor entry and you want this job entry to finish before the nextjob entry starts.

How do you accomplish this task?

  • A . Use a Simple Evaluationentry
  • B . Add the timeout property in the Advanced Options pane.
  • C . Use a Wait For entry
  • D . Set the Enable Blocking option

Reveal Solution Hide Solution

Correct Answer: C
Question #2

You need to process data on the nodes within a Hadoop cluster. To accomplish this task, you write a mapper and reducer transformation and use the Pentaho MapReduce entry to execute the MapReduce job on the cluster.

In this scenario, which two steps are required within the transformations? (Choose two.)

  • A . the Madoop Fie Input step
  • B . the Hadoop File Output step
  • C . the MapReduce Input step
  • D . the MapReduce Output step

Reveal Solution Hide Solution

Correct Answer: A,C
Question #3

Which script will execute jobs stored in a Pentaho server from a command line?

  • A . pan
  • B . carle
  • C . kitchen
  • D . spoon

Reveal Solution Hide Solution

Correct Answer: A
A

Explanation:

https://help.hitachivantara.com/Documentation/Pentaho/8.2/Products/Data_Integration/Co mmand_Line_Tools

Question #4

You have a PDI job where you want to dynamically pass a table name to the Table input step of a transformation. You have replacedthe table name reference in the transformation’s Table input step with‘$(table_name)’ but when the transformation runs the table name is shown as ‘$(table_name)’

Which action will correct this issue?

  • A . Add a transformation entry in the job that uses theGet Variables step.
  • B . Add a transformation entry in the job that sets the table_name variable.
  • C . Change the reference in the Table input step to ‘?’
  • D . Add a Set Variable step before the Table input step in the transformation.

Reveal Solution Hide Solution

Correct Answer: D
Question #5

Which two PDI steps are used to parse XML content? (Choose two.)

  • A . the ‘XML Join’ step
  • B . the ‘Modified Java Script Value’ stop
  • C . the ‘Get data from XML’ step
  • D . the ‘Add XML’step

Reveal Solution Hide Solution

Correct Answer: A,B
Question #6

You need to load data from many CSV files into a database and you want to minimize the number of PDI jobs and transformations that need to be maintained.

In which two scenarios is Metadata injection the recommend option? (Choose two.)

  • A . When the files have a different structure andhave different target tables.
  • B . When the files have a different structure and have the same target table.
  • C . When the files have the same structure and have different target tables.
  • D . When the files have the same structure and have the same target table.

Reveal Solution Hide Solution

Correct Answer: B,D
Question #7

You are encryption your database connection password to use in the kettle. properties file.

The output of the encr script is: Encrypted XYZABC123

In this scenario, which syntax is correct?

  • A . DB_PASSWORD=Encrypted (‘XYZABC 123’)
  • B . DB_PASSWORD=String(‘XYZABC123’)
  • C . DB_PASSWORD=XYZABC123
  • D . DB_PASSWORD=Encrypted XYZABC123

Reveal Solution Hide Solution

Correct Answer: A
Question #8

You have slow-running steps in a PDI transformation and you notice that it is taking a long time for subsequent steps to get data and begin processing.

Which action will help solve the problem?

  • A . Reduce the value in the Nr’ of rows in rowset’ option on the Miscellaneous tab in the Transformation properties.
  • B . Select the ‘Enable step performance monitoring?’ option on the Monitoring tab in the Transformation properties.
  • C . Right click on the slow-running steps and select the Load Balance option from the Data Movement submenu.
  • D . Select the ‘Execute for every input row? Option on the Advanced tab of the Transformation properties from the parent job.

Reveal Solution Hide Solution

Correct Answer: A
Question #9

What are two ways to schedule a PDI job stored in the repository? (Choose two.)

  • A . Write a loginscript to start the timer and execute a kitchen script specifying a job in the repository.
  • B . Use the pan script specifying a job in the repository and schedule it using cron.
  • C . Use the kitchen script specifying a job in the repository and schedule it using cron.
  • D . Use Spoon connected to the Pentaho repository and choose Action > Schedule in the menu.

Reveal Solution Hide Solution

Correct Answer: B,C
B,C

Explanation:

https://help.hitachivantara.com/Documentation/Pentaho/8.1/Products/Data_Integration/Sch edule_Perspective#:~:text=or%20kitchen%20commands.-,Schedule%20a%20Transformation%20or%20Job,The%20Schedule%20window%20appe ars.

Question #10

Which two statements are correct about the Repository Explorer? (Choose two.)

  • A . You can create new jobs and transformations.
  • B . You can move. rename, and delete jobs and transformations.
  • C . You can upload JDBC drivers when defining JOBC connections.
  • D . You can lock and unlock jobs and transformations.

Reveal Solution Hide Solution

Correct Answer: B,D
B,D

Explanation:

https://help.hitachivantara.com/Documentation/Pentaho/7.0/0L0/0Y0/040/010

Question #11

In a PDI transformation you are retrieving data from a large lookup table using a Database Lookup step from improve performance, you enable caching in the stepand use the Load all data from table option.

In this scenario, which three statement s are correct about the data flow of the ‘Database Lookup step? (Choose three.)

  • A . When caching is enable, only rows with matching lookup values will passed through.
  • B . There must be enough allocated heap space to store the lookup fields allocated heap space to store the lookup fields in memory.
  • C . Cached comparisons are case sensitive.
  • D . Every input row must have only one matching row in the lookup table
  • E . Only one matching row is used from the Lookup table.

Reveal Solution Hide Solution

Correct Answer: A,B,C
Question #12

You need to design a PDI job that will execute a transformation andthen send an e-mail with an attached log of the transformation’s execution.

Which two sets of actions will accomplish this task? (Choose two.)

  • A . In the mail entry’s options, select the ‘Attach files to message’ option and select the filetype ‘Log’
  • B . In the Transformation entry option, select the ‘Specify logfile’ option and enter a name and extension for the file.
  • C . In the log tab of job properties, configure the Log Connection and the Log table options for the Job entry log tablesection.
  • D . In the Mail entry’s options, select the ‘Attach files to message’ option and select the file type General’

Reveal Solution Hide Solution

Correct Answer: B,D
Question #13

You have a PDI input step that generates data within a transformation.

Which two statements are true about downstream steps in this scenario? (Choose two.)

  • A . The steps will receive a stream of data from the input as soon as it is a available.
  • B . Only one step can receive data from the input step.
  • C . The steps will receive the data once the input step fully fetches it.
  • D . Multiple steps can receive data from the input step.

Reveal Solution Hide Solution

Correct Answer: C,D
Question #14

A client is developing a Web application to implement a wizard-like application used by many users. At several pointsin the workflow, the application needs to execute jobs and transformations that are stored in the Pentaho server. Execution will include user-specific parameters. Upon completion of the job or transformation, the Web application will continue to the next UI page.

What is the recommended approach to accomplish this task?

  • A . Use the Web service API to execute the jobs and transformations on the Pentaho server.
  • B . Use the scheduling tool of the OS to execute the job and transformation on the pentaho server.
  • C . Use the Kitchen and pan scripts to execute the jobs and transformations on the Pentaho server.
  • D . Use the Job and Transformation steps to execute the jobs and transformations on the Pentaho server.

Reveal Solution Hide Solution

Correct Answer: C
C

Explanation:

https://help.hitachivantara.com/Documentation/Pentaho/8.2/Products/Data_Integration/Co mmand_Line_Tools

Question #15

A customer’s transformation Is running slowly in a lest environment. You have access to Spoon and you can run and monitor the job.

How do you troubleshoot this problem?

  • A . Execute the transformation via the pan script and pass the performance gathering parameter.
  • B . Ensure there is enough memory on the Pentaho server and that there are no "Out Of Memory’ errors in the log.
  • C . Make sure the customer is using data partitioning to ensure parallel processing for fasterexecution
  • D . Verify that there are no bottleneck slaps m the transformation by comparing the amount of rows in the input buffer versus the output buffer within the Step Metrics tab

Reveal Solution Hide Solution

Correct Answer: A
Question #16

You want to manage a slowly changing dimension as type II.

Which step in PDI should you use to accomplish this task.

  • A . The ‘Shared Dimension’ step
  • B . The ‘Dimension Lookup/update’ step
  • C . The ‘Combination lookup/update’ step
  • D . The ‘Table output step

Reveal Solution Hide Solution

Correct Answer: D
Question #17

you want to make a dynamic PDI transformation that is driven with variables that areloaded from a properties file.

Which free form text fields within a step can be configured with variables?

  • A . Any free form text field with a ”V” next to it
  • B . any free form text field with a ‘@’ sign next to it.
  • C . any free form text field with a ‘$’ sign next to it.
  • D . Any free form text field with the variable name entered in all caps

Reveal Solution Hide Solution

Correct Answer: B
Question #18

You have multiple transformations that read and process data from multiple text files. You identity a series of steps that are common across transformations and you want to re-use them to avoid duplication of code.

How do you accomplish this?

  • A . Use the "Mapping (sub-transformation)’ step containing the series of steps.
  • B . Use the ETL Metadata Infection’ stop containing the series of steps.
  • C . Use the "Get data from XML’ step to read the series of steps
  • D . Use the ‘Job Executor1 step containing the series of steps

Reveal Solution Hide Solution

Correct Answer: B
Question #19

You are connecting to a secure Hadoopclusterfrom Pentaho and want to use impersonation.

Which Pentaho tool should you use?

  • A . Pentaho Report Designer
  • B . Pentaho Spoon
  • C . Pentaho Security Manager
  • D . Pentaho Server

Reveal Solution Hide Solution

Correct Answer: A
Question #20

You have instated the Pentaho server using an archive installation. You now want to change the server port.

Which file do you modify?

  • A . server.xml
  • B . repositories.xml
  • C . context.xml
  • D . slave-server-config.xml

Reveal Solution Hide Solution

Correct Answer: A
A

Explanation:

Follow the instructions below to change the port through which the Pentaho Server runs:

✑ Stop the Pentaho Server.

✑ Navigate to the /pentaho-server/tomcat/conf/ directory.

✑ Open the server.xml file with any text editor, and search for the value for Define a non-SSL HTTP/1.1 Connector.

Change the port number in the connector port element below from 8080 to your preferred portnumber.

https://help.hitachivantara.com/Documentation/Pentaho/9.0/Setup/Customize_the_Pentaho_Server#:~:text=Change%20the%20port%20numbers%20for%20the%20Pentaho%20Server,-Follow%20the%20instructions&text=Navigate%20to%20the%20%2Fpentaho%2Dserver,to%20your%20preferred%20port%20number.

Exit mobile version