Skip to main content

Azure Databricks Running Environment

Databricks provides the combination of data lakehouse storage, analytics processing, and artificial intelligence capabilities in a single unified platform. For job execution, the Databricks running environment can be hosted in the Azure or AWS ecosystems.

Note

This running environment is available only if you install Designer Cloud Powered by Trifacta Enterprise Edition on Azure.

Tip

In the Run Job page, select Spark (Databricks) to run the job on this running environment when the Trifacta Application has been integrated with it.

Additional configuration is required.

Note

Use of Azure Databricks is not supported on Marketplace installs.

Note

When executing a job on the Azure Databricks running environment using a relational source, the job fails if one or more columns has been dropped from the underlying source table. As a workaround, the recipe panel may show steps referencing the missing columns, which can be used to either fix the recipe or the source data.

For more information, see Configure for Azure Databricks.

For more information on Databricks, see https://databricks.com/.