Tuesday , February 27 2024
Breaking News

RapidMiner Data Engineering Professional Certification Exam Quiz Answers

Why would you use the Read Excel operator instead of the Import Data wizard to import data from an Excel spreadsheet into RapidMiner Studio? (Select one)

  • To prevent RapidMiner from overwriting data in the Excel spreadsheet
  • To import data from other spreadsheets with the same metadata without going through the Import Data wizard again
  • To share the spreadsheet data with a colleague
  • To ensure that RapidMiner reads the data as Excel, rather than CSV

While working with a data set, you wish to create a new attribute which subtracts 1 week from an attribute named “date”. There are no time changes to be concerned with in the data:

Which of the following are valid methods to create this new attribute in RapidMiner Studio? (Select ANY correct answer)

  • use the Adjust Date operator with adjustment=-7 and date unit=Day
  • use the Generate Attributes operator with the function ‘ date_add(date,-1,DATE_UNIT_WEEK) ‘
  • use the Date to Numerical operator, subtract the number of milliseconds in one week, and then use the Numerical to Date operator
  • none of the above methods are able to create this new attribute in RapidMiner Studio

What is the function of the Process Documents operator with TF-IDF vectors selected? (Select one)

  • to transform a text attribute into multiple numerical attributes for future modeling
  • to identify common topics from the entire corpus
  • to create an association rule graph
  • to ascertain the quality of the text prior to modeling

You have a data set containing first and last names like this:

You now want to create a new attribute called “Full Name” which will look like this:

Pick any valid way to accomplish this task (Select ANY correct answer)

  • use the Reorder Attributes and the Generate Aggregation operator.
  • use the Generate Concatenation operator.
  • use the Generate Attributes operator and the concat function.
  • use the Pivot operator.

Question: You have two ExampleSets and wish to join them into one ExampleSet as shown below:

Which type of join will produce the desired result? (Select ANY correct answer)

  • inner
  • left
  • right
  • outer

Question: You have an input table with direct mailing transactions.  Two of the columns are label and earnings (indicated by red boxes below) and you want to aggregate the data into a 2×2 table as shown:

  • Which of the two processes shown below will product this result? (Select one)
  • Option A
  • Option B

Question: The following text is a movie review of the film “Rat Race”:

After watching “Rat Race” last week, I noticed my cheeks were sore.

The text is entered into Create Document, then into Tokenize (by word), and then Stem (Porter) as shown below:

Which of the following is the correct output of this process? (Select one)

  • watching Rat Race week I noticed cheeks sore
  • after watch rat race last week i notic my cheek were sore
  • After After_watching watching watching_Rat Rat Rat_Race Race Race_last last last_week week week_I I I_noticed noticed noticed_my my my_cheeks cheeks cheeks_were were were_sore sore
  • After watching Race last week noticed cheeks were sore

Question: You observe your colleague dragging a process called “Normalization” from her Local Repository onto the Process panel and connecting it as shown:

What will this new operator named “Execute Normalization” do with the output from Filter Examples? (Select one)

  • It will create a new visualization called “Normalization”.
  • It will execute a Jupyter notebook called “Normalization”.
  • It will create a new building block called “Normalization”.
  • It will execute “Normalization” as an embedded process.

Question: You have imported a data set into RapidMiner Studio containing sales transactions which look like this:

Unfortunately the date attribute was imported as type nominal. To convert this attribute to a date-time type, you could (Select ANY correct answer)

  • use the Nominal to Date operator with date format = ‘yyyy-MM-dd HH:mm:ss’
  • use the Numerical to Date operator with no time offset
  • use the Generate Attributes operator with the expression = ‘date_parse_custom(date,”yyyy-MM-dd HH:mm:ss”) ‘
  • add a new Turbo Prep subprocess with a ‘Change to date and time’ Transformation and format = ‘yyyy-MM-dd HH:mm:ss’

Question: You have two ExampleSets and wish to combine them into one ExampleSet as shown below:

Which of the following RapidMiner Studio processes will produce this desired result? (Select one)

  • Option A
  • Option B
  • Option C
  • Option D

Question: What is the function of this group of operators in RapidMiner Studio? (Select one)

  • to read/write file objects in Amazon S3
  • to read/write ExampleSets in Amazon S3
  • to read/write processes in Amazon S3
  • to stream databases in Amazon S3

Question: The Sonar data set has one nominal, special attribute named “class” and 60 real, regular attributes. A small part of the data set is shown below:

To rename ALL of the regular attributes so that they no longer contain the “_” character, you could (Select ANY correct answer)

  • add the Rename by Replacing operator to your process, replacing the “_” character with nothing in the Parameters panel.
  • add the Rename operator to your process, entering each “old” and “new” attribute name in the Parameters panel.
  • go to Turbo Prep, rename the attributes, commit the transformations, and then add the result to your process.
  • add the Rename by Generic Names operator to your process, entering the “_” character as the generic name stem in the Parameters panel.

Question: Which of the following techniques in RapidMiner Studio can be used to organize your process and/or make it more understandable to others? (Select ALL correct answers)

  • grouping operators into subprocesses
  • attaching colored notes to operators
  • adding colored notes to a process
  • editing the default names of operators in a process
  • None of the above

Question: The Generate Sales Data operator creates a fictitious ExampleSet of sales transaction data:

You put this operator into a blank process and connect it to a Select Attributes operator with certain parameters, as shown below:

Which attribute(s) will be in the results? (Select one)

  • date
  • transaction_id, customer_id, product_id, amount
  • customer_id, product_id, amount
  • customer_id, product_id, date, amount

Question: In order to train a machine learning model to predict an attribute named “Churn”, you should set its role to (Select one)

  • churn
  • id
  • label
  • prediction

Question: To remove one or more attributes from an ExampleSet in a RapidMiner Studio process, you can (Select ALL correct answers)

  • use the Delete Attributes operator.
  • use the Select Attributes operator and check the ‘invert selection’ option in the Parameters panel.
  • use the Filter Attributes operator.
  • use the Select Attributes operator and choose the subset of desired attributes in the Parameters panel.

Question: You have an ExampleSet of movie reviews with a polynominal attribute named “text” as shown below:

To change the word “film” to “movie” in the attribute named “text”, you could (Select ANY correct answer)

  • Option A
  • Option B
  • Option C
  • Option D

Question: A sample of the Titanic data set with seven examples and five attributes is shown below:

A Filter Examples operator is now applied to the sample with these parameters:

How many examples will be in the resulting ExampleSet after this Filter Examples operator is applied to the sample? (Select one)

Hint: Notice  the ‘Match any’  radio box in the image above

  • 1
  • 2
  • 3
  • 4

Question: Which of the following allows you to create a new attribute containing the square root of an existing attribute? (Select ANY correct answer)

  • Generate Function Set
  • Auto Model
  • Generate Attributes
  • Turbo Prep → Generate

Question: You have a table with sales transactions over time.  Three of the columns are Product Category, Units Sold, and State (indicated by red boxes below):

  • Group by=’Product Category’, Column grouping=’Units Sold’, Aggregation=sum of ‘State’
  • Group by=’State’, Column grouping=’Product Category’, Aggregation=sum of ‘Units Sold’
  • Group by=’Product Category’, Column grouping=’Units Sold’, Aggregation=sum of ‘State’
  • Group by=’State’, Column grouping=’Units Sold’, Aggregation=sum of ‘Product Category’

Question: In order to always import the most recent entries from a database table into your RapidMiner Studio process, you should (Select one)

  • use the Manage Database Connections wizard each time you run the process.
  • use the Update Database operator to refresh the data, and then use the Read Database operator.
  • use the Read Database operator to retrieve data from the table each time you run the process.
  • always ensure that you have the most recent JDBC drivers installed in RapidMiner Studio.

Question: You have two example sets that both contain an attribute named “Age” and wish to use Union to create one ExampleSet as shown below:

Pick one of the following processes could produce this desired result? (Select ANY correct answer)

  • Option A
  • Option B
  • Option C
  • Option D

About Clear My Certification

Check Also

Contact Us

[contact-form-7 id=”dd026da” title=”Contact form 1″]

Leave a Reply

Your email address will not be published. Required fields are marked *