- The Reader in Rubiscape Pipelines is a component used to ingest data from various sources into a pipeline.
- It acts as the entry point of the pipeline, enabling users to read structured, semi-structured from files, databases, APIs, cloud storage, or streaming sources.
- The Reader ensures smooth data flow into downstream pipeline steps for processing, transformation, analysis, or model execution.
The Reader node is responsible for:
• Connecting to a data source
• Fetching data in supported formats
• Converting incoming data to a pipeline-ready dataset
• Passing the data to the next pipeline component
Readers are essential because every pipeline starts with a data ingestion step.
Rubiscape supports multiple types of readers:
• File Reader
- Open Rubiscape → Pipelines
- Create or edit a pipeline
- Drag the Reader node from the Task panel onto the canvas (TaskPane >> Connect >> Reader )
- The properties pane on right hand side show below details for reader configuration:
- The labels in above screenshot are labelled as below:
1 : Task Name - To update the task name/Reader Name2 : Data Fields - Option to configure the columns of a Reader3 : Metadata Refresh - Option to refresh metadata (Column Name Changes/Column Configuration changes)4 : Filter Condition - Option to configure filtering condition based on logical expression