Overview
Heap’s Databricks integration allows you to sync Heap data to Databricks to leverage Heap behavioral data in other tools.
Prerequisites
To connect this integration, you'll need the following permissions:
- Admin or Architect privileges in Heap
- Access to an AWS-hosted Databricks account that uses the Unity Catalog
Setup
To get started, navigate to Integrations > Directory, search for Databricks, then select it where it appears.
You’ll be prompted to provide the following information:
- Hostname: The ID of your databricks account, which you can find in the account URL.
- Path: The path of the warehouse you are connecting via this integration.
- Catalog: The catalog that this Heap data should sync to; if left blank, this integration will create a new catalog.
- Schema (optional): The schema that this Heap data should sync to; if left blank, this integration will create a new schema.
- Token: This is required to allow Heap to write to the schema. The token must be a Personal Access Token (PAT) rather than an OAuth Token.
Once all those fields are populated, click the Connect button.
That’s it! Once setup is complete, you’ll see a sync within 48 hours with the following built-in tables:
- Pageviews
- Sessions
- Users
- user_migrations