In modern data engineering and data science workflows, version control is essential for collaboration, reproducibility, governance, and operational reliability. Syntasa integrates with Git, the industry-standard distributed version control system, enabling teams to manage code, notebooks, and application definitions in a structured and auditable way.
This article explains what Git is in the context of Syntasa and how Git integration works across the platform—from development in notebook workspaces to application lifecycle management.
What is Git?
Git is a distributed version control system that tracks changes to files over time. It allows multiple users to work on the same project simultaneously, maintain a full history of changes, and safely experiment using branches without impacting production code.
Within Syntasa, Git serves as the system of record (Source of Truth) for:
Notebook code (Python, SQL, etc.)
Configuration files
Syntasa App (Dataflow) definitions
By externalizing version control to Git, Syntasa ensures that logic and configuration remain portable, secure, and independently recoverable from the platform itself.
How Git Works in Syntasa
Syntasa does not host its own Git server. Instead, it acts as a secure Git client that connects to your existing Source Control Management (SCM) provider. The integration is built around three core layers.
Git Repo Connection (The Integration Bridge)
The Git Repo Connection is the foundation of Git integration in Syntasa. It defines how the platform authenticates and communicates with an external repository.
Key Characteristics
Reusable Resource
A single Git Repo Connection can be reused across multiple notebooks, apps, and users.Secure Authentication
Supported authentication methods include:Personal Access Tokens (PAT)
SSH private keys
All credentials are encrypted at rest within the Syntasa business service.
Real-Time Validation
The Test Repo action verifies connectivity and fetches available branches directly from the remote repository.Branch Awareness
Once validated, the connection dynamically exposes all repository branches for selection.
This connection acts as the trusted bridge between Syntasa and your SCM provider.
Git for App Lifecycle Management (Deployment Layer)
Beyond notebooks, Git is also used to manage the lifecycle of Syntasa Apps (Dataflows).
Importing Apps from Git
Apps can be imported directly from a Git repository into Syntasa.
This supports promotion of apps across environments (Development → QA → Production).
App definitions remain consistent and versioned across deployments.
Exporting Apps to Git
Existing Syntasa apps can be exported back to a Git repository.
This ensures pipeline logic is version-controlled alongside notebooks and scripts.
Git becomes the authoritative record for app configuration changes.
This approach enables Git-driven deployment strategies and environment parity.
Supported SCM Providers
Syntasa is provider-agnostic and currently supports:
GitHub (Public and Enterprise)
GitLab (SaaS and Self-Managed)
Bitbucket (Cloud and Server)
Key Benefits of Git Integration in Syntasa
Collaboration
Multiple users can work on the same project using branches and standard Git workflows without overwriting each other’s work.
Auditability
Every change to code or app configuration is tracked, providing a complete history of who changed what and when.
Disaster Recovery
Because logic is stored in an external Git repository, work can be recovered even if a workspace or app is deleted.
Environment Portability
Git-backed apps and notebooks can be reliably moved across Syntasa environments.
CI/CD Readiness
Storing app definitions in Git enables integration with automated CI/CD pipelines for testing and controlled deployments.
End-to-End Workflow Summary
Connect
Create a Git Repo Connection in Syntasa using a Personal Access Token or SSH key.Develop
Link the connection to a Notebook Workspace to write, version, and collaborate on code.Deploy
Import or export Syntasa Apps using Git to promote solutions across environments.
Conclusion
By integrating Git at both the development and deployment layers, Syntasa ensures that data projects are collaborative, reproducible, auditable, and enterprise-ready. Git becomes the backbone of governance and lifecycle management, while Syntasa provides a seamless, secure interface to operationalize it across notebooks and applications.