The purpose of the From DB process is to define the input database table, schema, and load into a big data environment. A database connection will need to be defined first on the Connections page and that Database Connection will need to be dragged onto the canvas and connected to the From DB process.
Process Configuration
Configuration of this process includes three screens.
Input
Table - name of the table within the source database
Incremental - toggle to load in all data or if data should be loaded incrementally
Schema
Columns - comma separated list of input table column headers
Time Source - column designating a timestamp, required incremental loading
Output
The Outputs tab provides the ability to name table and displayed name on the graph canvas, along with selecting whether to load to Big Query (BQ) if in the the Google Cloud Platform (GCP), load to Redshift or RDS if in Amazon Web Services (AWS), or simply write to HDFS if an using on-premise Hadoop.
Expected Output
Data from the source connection database and defined table in the Input tab of this process. This data gets loaded to Hive, Syntasa is aware of the State, and optionally to other environment specific query engines.