You can use multiple copies of your output step and connect the preceeding step by giving ‘Distribute rows’ while you connect the hop.
No, the default transformation architecture is to run in parallel while jobs run in sequence. Changing this requires an architecture change that might affect the performance
Yes, you can use the ‘Memory Group By’ step in the transformation which helps you to group by without doing any sorting.
Yes, we can use the ‘Dimensional Lookup’ step in the transformation which can perform the SCD type-2 logic directly. You can also ‘Merge Diff’ step which also does the same
It can be done when you enable ‘Execute for every input row’ option. You can see this option when you double click the transformation/job under Options tab under Execution section
Yes, you can use the ‘Get System Info’ step in a transformation to get the Pentaho version. In the ‘Type’ column choose ‘Kettle version’
You can calculate aggregate functions over the whole dataset by leaving this The Fields that Make up the Group Table blank in the ‘Group By’ step.
No, we cannot form a loop inside the same transformation. But we can form a loop between transformation/jobs inside a job
You can use bulk loading option which contains Vertica Bulk Loader, Oracle Bulk Loader, MySQL Bulk Loader, etc.
We can execute a prepared sql join statement directly in the database join step whereas we cannot do that in the regular join