This section provides information on how to enable connection to Teradata databases.
- Teradata provides Datawarehousing & Analytics solutions and Marketing applications. The Teradata database supports all of their Datawarehousing solutions. For more information, see http://www.teradata.com.
For more information on supported versions, see Connection Types.
This connection supports reading and writing. You can create multiple Teradata connections in the Trifacta application.
- By default, Teradata does not permit the publication of datasets containing duplicate rows. Workarounds:
Your final statement for any recipe that generates results for Teradata should include a
Remove duplicate rowstransformation.
NOTE: The above transformation removes exact, case-sensitive duplicate rows. Teradata may still prevent publication for case-insensitive duplicates.
- It's possible to change the default writing method to Teradata to enable duplicate rows. For more information, contact Trifacta Support.
- When creating custom datasets using SQL from Teradata sources, the
ORDER BYclause in standard SQL does not work. This is a known issue.
Download and Install Teradata drivers
To enable connectivity, you must download and install the Teradata drivers into an accessible location on the Trifacta® node.
NOTE: Please download and install the Teradata driver that corresponds to your version of Teradata. For more information on supported versions, see Connection Types.
- If you don't have a Teradata developer account, create one here: https://downloads.teradata.com/user/register
- Log in to the account. Navigate to http://downloads.teradata.com/download/connectivity/jdbc-driver
- Download the JDBC driver in ZIP or TAR form.
- Copy the downloaded ZIP or TAR file to the Trifacta node.
- Extract and place the JAR file into a folder accessible to the Trifacta user.
- Verify that the Trifacta user is the owner of the JAR file and its parent folder.
- You can apply this change through the Admin Settings Page (recommended) or
trifacta-conf.json. For more information, see Platform Configuration Methods.
- Locate the
data-service.classpath. To the classpath value add the folder where you installed the JAR file. For the new entry, remember to add the following to the entry:
- Add a prefix of
Add a suffix of
Whole classpath example:
- Add a prefix of
- Save your changes and restart the platform.
Increase Read Timeout
Particularly when reading from large Teradata tables, you might experience read timeouts in the Trifacta application.
The default setting is
300 seconds (5 minutes). You should consider raising this limit if you are working with large tables.
For more information, see Configure Photon Running Environment.
Create Teradata Connection
For more information on creating a Teradata connection, see Create Connection Window.
The properties that you provide are inserted into the following URL, which connects Trifacta to the connection:
The Connection URL is mostly built up automatically using cluster configuration for the platform.
Connect string options
The connect string options are optional. If you are passing additional properties and values to complete the connection, the connect string options must be structured in the following manner:
<prop>: the name of the property
<val>: the value for the property
- , : any set of connect string options must begin and end with a comma.
- A comma can be omitted from the end of the connect string options.
=: property names and values must be separated with an equal sign (
When SSL is enabled for the connection, the following is automatically appended to the connect string options:
NOTE: When SSL is enabled, Trifacta assumes that the Teradata database has been configured to receive SSL connections. Some additional configuration may be required within Teradata. For more information, please see the documentation that was provided with your Teradata distribution.
You can connect to a specific database using the following connect string option:
If no database is specified, the connect attempts to connect the
This connection uses the following driver:
- Driver version:
- Driver documentation: https://developer.teradata.com/connectivity/reference/jdbc-driver
|Duplicate row error|
This error occurs when duplicate rows are being inserted during publishing to Teradata.
Workaround: All inserted rows must be unique ,or the Teradata tables must be
- After you create your connection, load a small dataset based on a table in the connected Teradata database. See Import Data Page.
- Perform a few simple transformations to the data. Run the job. See Transformer Page.
- Verify the results.
For more information, see Verify Operations.
This page has no comments.