Skip to main content

Google Data Catalog Connections

Note

This feature may not be available in all product editions. For more information on available features, see Compare Editions.

Google Data Catalog is a fully managed and highly scalable data discovery and metadata management service. For more information, see https://cloud.google.com/data-catalog.

Tip

This connection is in early preview. It is read-only and available only in SaaS product editions. For more information on early previews, see Early Preview Connection Types.

Limitations and Requirements

Note

During normal selection or import of an entire table, you may encounter an error indicating a problem with a specific column. Since some tables require filtering based on a particular column, data from them can only be ingested using custom SQL statements. In this case, the problematic column can be used as a filter in the WHERE clause of a custom SQL statement to ingest the table.

  • For more information, please consult the CData driver documentation for the specific table.

  • For more information on using custom SQL, see Create Dataset with SQL.

Note

For filtering date columns, this connection type supports a set of literal functions on dates. You can use these to reduce the volume of data extracted from the database using a custom SQL query. For more information, see the pg_dateliteralfunctions.htm page in the driver documentation for this connection type.

  • OAuth 2.0 authentication is required.

    • An OAuth 2.0 web client is created for you in the Trifacta Application.

    • You cannot create OAuth 2.0 connections via API.

  • Custom SQL queries must be provided to ingest data from tables (except for tables "Table" and "Schemas"). For example:

    SELECT * FROM TableColumns WHERE ResourceName =<resourcename>;

Create Connection

via Dataprep by Trifacta application

When you create the connection, please review the following properties and specify them accordingly:

Connection Property

Description

Project Id

The ID associated with the Google Cloud Platform project resource to which you would like to connect.

Tip

Navigate to the Google Cloud Console dashboard and select your project from the Select from drop-down list. The project ID is present in the Project info card.

Connect String Options

The following default value sets the connection timeout in seconds:

Timeout=0;

Setting this value to 0 disables timeouts.

OAuth2 Client

The client is displayed.

Note

When you create the connection in this window, you must click Authenticate, which authenticates to the app. This step is required.

Default Column Data Type Inference

Leave this value as Enabled.

For more information, see the driver documentation http://cdn.cdata.com/help/HGG/jdbc/default.htm.

Data Type Conversions

For more information, see the driver documentation http://cdn.cdata.com/help/HGG/jdbc/default.htm.