Manage Databricks Data Sources
This section describes ALTR’s capabilities for managing Databricks data sources. ALTR must be connected to a data source in order to enforce data access governance and advanced data protection on sensitive data.
Create an ALTR Account
Section titled “Create an ALTR Account”To create an ALTR account, navigate to altr.com, click Login and follow the screen instructions to create an account.
Databricks Connection Overview
Section titled “Databricks Connection Overview”ALTR connects to Databricks using two methods: a service principal and an API.
The service principal is a login to the customer’s Databricks account that allows ALTR to create and manage objects for tag-based policy.
The API enables Databricks to interact with ALTR’s cloud-based access control engine to make access decisions for tag-based policy.
Unlike other data security platforms, ALTR is a true cloud-to-cloud SaaS solution. There’s no proxy to install or custom views to maintain. Instead, ALTR integrates directly with Databricks via API, simplifying deployment and reducing latency.
Data Source Connections
Section titled “Data Source Connections”When connecting a Databricks metastore to ALTR, there are a few requirements and limitations.
Requirements:
- Databricks Premium Tier
- Databricks on AWS
- Unity Catalog
Not Supported:
- Databricks on Azure
- Databricks on GCP
- Serverless Compute
- Hive Catalog
Connect Databricks Data Sources
Section titled “Connect Databricks Data Sources”When connecting your account to ALTR, you will need the following information:
- Workspace Hostname
- Service Principal ID
- Cluster ID
- OAuth Secret
For assistance locating these fields in your Databricks account, see Locate Required Information to Connect.
To connect a Databricks metastore:
- Select Data Configuration > Data Sources in the navigation menu.
- Click Add Data Source.
- On the Databricks card, click Select.
- Enter a user-friendly Display Name for the connection.
- Enter the Workspace Hostname.
- Enter the Service Principal ID.
- Enter the Cluster ID.
- Enter the OAuth Secret.
- Click Connect Data Source.
Check Databricks Connections
Section titled “Check Databricks Connections”Check the data source connection if you suspect there’s an issue with the connection, such as if you received errors or the data source has changed.
To check the data source connection:
- Select Data Configuration > Data Sources in the navigation menu.
- Click the data source that you want to test the connection.
- Click the Connection Details tab.
- Click Check Connection.
Remove Databricks Data Sources
Section titled “Remove Databricks Data Sources”Remove a Databricks data source from ALTR if your service user is having problems or some other issue has occurred with the data source.
To remove a data source:
- Select Data Configuration > Data Sources in the navigation menu.
- Select the data source you wish to disconnect.
- Click the Remove Data Source button. The process to remove a data source can take up to several minutes to complete.