Page tree

Trifacta SaaS



Contents:

   

Contents:


Feature Availability: This feature may not be available in all product editions.

Amazon Athena  is a service that enables a data analyst to perform interactive queries in the AWS public cloud on data stored in Amazon Simple Storage Service (S3) For more information, see https://aws.amazon.com/athena/ .

Early Preview connection: This connection is in early preview. It is read-only and available only in SaaS product editions. For more information on early previews, see Early Preview Connection Types.

Limitations and Requirements

NOTE: During normal selection or import of an entire table, you may encounter an error indicating a problem with a specific column. Since some tables require filtering based on a particular column, data from them can only be ingested using custom SQL statements. In this case, the problematic column can be used as a filter in the WHERE clause of a custom SQL statement to ingest the table.

  • For more information, please consult the CData driver documentation for the specific table.
  • For more information on using custom SQL, see Create Dataset with SQL.

NOTE: For filtering date columns, this connection type supports a set of literal functions on dates. You can use these to reduce the volume of data extracted from the database using a custom SQL query. For more information, see the pg_dateliteralfunctions.htm page in the driver documentation for this connection type.

  • Supports authentication using AWS key and secret.

Create Connection

via Trifacta application

When you create the connection, please review the following properties and specify them accordingly:

Connection PropertyDescription
AWS Region

The hosting region for your Amazon Web Services resources.

For more information, see https://cdn.cdata.com/help/FKG/jdbc/RSBAmazonAthena_p_AWSRegion.htm

S3 Staging Directory

Specifies the location in your S3 bucket where Amazon Athena can store the results of a query.

Database

The name of the Amazon Athena database.

Connect String Options

The following are the connect string options passed to the JDBC driver by default:

Timeout=0;CleanQueryResults=true;

Timeout - This value sets the connection timeout in seconds. Setting this value to 0 disables timeouts.

CleanQueryResults - The results of every query that runs on Amazon Athena is stored in the S3 staging directory. Setting CleanQueryResults to true clears the query results from the S3 staging directory after ingestion.

Access Key IDAWS account access key. This value is accessible from your AWS security credentials page.
Secret access keyAWS account secret key. This value is accessible from your AWS security credentials page.
Default Column Data Type InferenceLeave this value as Enabled.

For more information, see the driver documentation https://cdn.cdata.com/help/FKG/jdbc/ .

via API

Depending on your product edition, you can create connections of this type. Key information:

"vendor": "amazon_athena",
"vendorName": "Amazon Athena",
"type": "jdbc"

For more information, see

Trifacta API Reference docs: Enterprise | Professional | Premium

Data Type Conversions

For more information, see the driver documentation  https://cdn.cdata.com/help/FKG/jdbc/ .

See Also for Amazon Athena Connections:

There is no content with the specified labels

This page has no comments.