fivetran / hybrid_deployment

Apache License 2.0
0 stars 0 forks source link

Fivetran Hybrid Deployment Agent

Hybrid Deployment from Fivetran enables you to sync data sources using Fivetran while ensuring the data never leaves the secure perimeter of your environment. It provides flexibility in deciding where to host data pipelines, with processing remaining within your network while Fivetran acts as a unified control plane. When you install a hybrid deployment agent within your environment, it communicates outbound with Fivetran. This agent manages the pipeline processing in your network, with configuration and monitoring still performed through the Fivetran dashboard or API.

For more information see the Hybrid Deployment documentation

Hybrid Deployment can be used with:

Note: You must have a valid agent TOKEN before you can start the agent. The TOKEN can be obtained when you create the agent in the Fivetran Dashboard.


Using Hybrid Deployment with containers

For detail instructions see the online documentation.

The following approach can be used to setup the environment.

Note: Docker or Podman must be installed and configured, and it’s recommended to run them in rootless mode.

Expand for instructions on using containers ### Step 1: Install and Start the agent Run the following as a non root user on a x86_64 Linux host with docker or podman configured. Use the command below with your TOKEN and selected RUNTIME (docker or podman) to install and start the agent. ``` TOKEN="YOUR_AGENT_TOKEN" RUNTIME=docker bash -c "$(curl -sL https://raw.githubusercontent.com/fivetran/hybrid_deployment/main/install.sh)" ``` The `install.sh` script will create the following directory structure under the user home followed by downloading the agent container image and starting the agent. Directory structure will be as follow: ``` $HOME/fivetran --> Agent home directory ├── hdagent.sh --> Helper script to start/stop the agent container ├── conf --> Config file location │   └── config.json --> Default config file ├── data --> Persistent storage used during data pipeline processing ├── logs --> Logs location └── tmp --> Local temporary storage used during data pipeline processing ``` A default configuration file `config.json` will be created in the `conf/` sub folder with the token specified. Only the agent TOKEN is a required parameter, [optional parameters](https://fivetran.com/docs/core-concepts/architecture/hybrid-deployment/setup-guide#agentconfigurationparameters) listed in the documentaiton. The agent container will be started at the end of the install script. To manage the agent container, you can use the supplied `hdagent.sh` script. ### Step 2: Manage agent container Use the `hdagent.sh` script to manage the agent container. The default runtime will be docker, if using podman use `-r podman`. Usage: ``` ./hdagent.sh [-r docker|podman] start|stop|status ```

Using Hybrid Deployment with Kubernetes

Note: Hybrid Deployment Kubernetes support is in Private Preview.

Review the requirements and detailed setup guide as outlined in the online documentation

Requirements:

Expand for instructions on installation of agent in Kubernetes Installation: ```bash helm upgrade --install hd-agent \ oci://us-docker.pkg.dev/prod-eng-fivetran-ldp/public-docker-us/helm/hybrid-deployment-agent \ --create-namespace \ --namespace default \ --set config.data_volume_pvc=YOUR_PERSISTENT_VOLUME_CLAIM \ --set config.token="YOUR_TOKEN_HERE" \ --version 0.1.0 ``` > Notes: > * Replace `YOUR_PERSISTENT_VOLUME_CLAIM` with your Persistent Volume Claim name. > * Replace `YOUR_TOKEN_HERE` with your agent token (obtained from Fivetran dashboard on agent creation) To confirm installation review: ``` helm list -a kubectl get deployments -n kubectl get pods -n kubectl logs ``` Uninstall: ``` helm uninstall hd-agent ```