The purpose of the From DB process is to define the input database table, schema, and load into a big data environment. A database connection will need to be defined first on the Connections page and that Database Connection will need to be dragged onto the canvas and connected to the From DB process.
Configuration of this process includes three screens.
Table - name of the table within the source database
Incremental - toggle to load in all data or if data should be loaded incrementally
Columns - comma separated list of input table column headers
Time Source - column designating a timestamp, required incremental loading
The Outputs tab provides the ability to name table and displayed name on the graph canvas, along with selecting whether to load to Big Query (BQ) if in the the Google Cloud Platform (GCP), load to Redshift or RDS if in Amazon Web Services (AWS), or simply write to HDFS if an using on-premise Hadoop.
Data from the source connection database and defined table in the Input tab of this process. This data gets loaded to Hive, Syntasa is aware of the State, and optionally to other environment specific query engines.