From DB – SYNTASA™

The purpose of the From DB process is to define the input database table, schema, and load into a big data environment. A database connection will need to be defined first on the Connections page and that Database Connection will need to be dragged onto the canvas and connected to the From DB process.

Process Configuration

Configuration of this process includes three screens.

Input

Table - name of the table within the source database

Incremental - toggle to load in all data or if data should be loaded incrementally

Schema

Columns - comma separated list of input table column headers

Time Source - column designating a timestamp, required incremental loading

Output

The Outputs tab provides the ability to name table and displayed name on the graph canvas, along with selecting whether to load to Big Query (BQ) if in the the Google Cloud Platform (GCP), load to Redshift or RDS if in Amazon Web Services (AWS), or simply write to HDFS if an using on-premise Hadoop.

Expected Output

Data from the source connection database and defined table in the Input tab of this process. This data gets loaded to Hive, Syntasa is aware of the State, and optionally to other environment specific query engines.

{[{category.name}]}

Process Configuration

Input

Schema

Output

Expected Output