Overview:
The Profile node provides statistical
summaries and visual insights for selected columns in a dataset. It helps users
understand distributions, missing values, and basic metrics before further
transformations.
Location:
Pipeline → Data Preparation → Profile
Predecessor Rules:
• Only a single predecessor is allowed.
• Predecessor must output tabular data.
• Works with all RDBMS datasets, template file readers, and template table
readers.
Feature Selection:
• Features can be selected only from the
property pane.
• Selected columns appear visually inside the Profile node.
• Works after renaming, filtering, or metadata refresh.
Execution:
On execution, the node calculates basic
statistics for each selected column.
Statistics and charts appear in Explore view → Result tab.

Statistics Generated:
• Missing Values
• Missing Value %
• Mean
• Std
• Min
• Max
• Freq
• Top
• Unique
Rules:
• For categorical/geographical/interval columns → Mean, Std, Min, Max = None
• For numerical columns → Freq, Top, Unique = None