Page tree

Trifacta Dataprep


Contents:

 

Contents:


When you import data into  Cloud Dataprep by TRIFACTA® INC., you are creating a reference to a source of data; the source is never touched. When the data is required for use,  Cloud Dataprep by TRIFACTA INC. reads a sample of the source data into the application for your use. Data is read into the application through an object called a connection.

The following are the supported types of connection for the product:

  • Upload/Download: You can upload data directly from your local desktop. You can also save it locally on export.


  • Relational sources: You can read from database tables into the product.

  • Google Cloud Storage: Read from and write to files in cloud storage.

  • BigQuery: You can store relational content in BigQuery, from which Cloud Dataprep by TRIFACTA INC. can read.

Locate Connections

You already have a set of connections that you can use. Connections can be either read-only or read-write.

Read-only:

  1. In the Home page, click the Library icon in the left nav bar. 
  2. In the Library page, click Import Data.
  3. In the Import Data page, your list of available connections is displayed in the left nav bar. In this location, you can use the listed connections to read from data only.

See Import Data Page.

Read-write:

  1. In the Home page, click the Settings icon in the left nav bar.
  2. In the Settings page, click the Connections link.
  3. In the Connections page, you can review the list of connections that you can access.

See Connections Page.


Use Connections

Read-Only

  1. In the Import Data page, select one of the available connections. 
  2. Navigate through the connection to select the asset to import.
  3. Select the object and click Open.
  4. In the Import Data page, review the settings of the asset in the card in the right panel. Make updates as needed. 

For more information, see Import Data Page.

Write

NOTE: You cannot write or publish results until connections have been created for you.

Write results:

You write results through a connection by specifying a set of settings.

  1. In the Run Job page, click Add Publishing Action.
  2. In the left nav bar, select the connection.
  3. Specify the settings for the publishing action.
  4. Run the job.
  5. When it successfully completes, the specified results are published through the selected connection.

See Run Job Page.

Create Connection

NOTE: Connections may be created by your Trifacta administrator. Some connections require additional configuration outside of the application. See Connection Types.

When a new connection is created, it is initially available only to you.

Pre-requisites:

Before you create a new connection, please verify the following:

  • On the datastore, you have read and (optionally) write locations.
  • You have credentials to use to connect to this datastore. These credentials have permissions on your read/write locations.
  • Some datastores require a special connection string, which must be inserted as part of the connection object.

Read-only:

  1. In the Import Data page, click the New icon in the left nav bar.
  2. In the Create Connection window, specify the parameters of the connection.

For more information, see Create Connection Window.

Read-write:

  1. In the Connections page, click Create Connection.
  2. Click the connection category or search for a specific connection to create.
  3. If a connection is grayed out:
    1. It may already exist. Some connections types permit only one globally available connection.
    2. It may not be supported in your product.
    3. It may be read-only.
  4. Click the name of the connection.
  5. In the Create Connection window, specify the parameters of the connection. 

See Create Connection Window.

Share Connection

Through the Connections page, you can share your private connection with other users.

NOTE: When you share a connection, you can choose to share your credentials with the connection. Those credentials may provide access to specific areas of the datastore.


  1. In the Connections page, locate the connection you wish to share.
  2. In the context menu for the connection, select Share....
  3. Specify the type of sharing from the drop-down and, if applicable, the users with whom you wish to share.

See Share Connection Window.

Delete Connection

NOTE: You can delete a connection only if you are the connection owner and the connection is not used to import any datasets.

 

  1. In the Connections page, locate the connection to remove. 
  2. In the context menu, select Delete....
  3. The connection is deleted.

See Connections Page.

Automation

You can create, edit, or delete connections through the APIs. See API Reference.

This page has no comments.