To improve performance of your Hive or relational connections, custom SQL queries can be enabled to push the initial filtration of table rows and columns back the database, which is more efficient at performing this task. Instead of loading the entire table into the Trifacta® application and then performing the filtration through the Transformer page, you can insert basic SQL commands as part of your relational queries to collect only the rows and columns of interest from the source.
When enabled, custom SQL query is available for Hive, Redshift, and all relational sources.
Locate the following settings:
trueto enable the SQL pushdown feature. By default, this feature is enabled.
When set to
true, you can insert multi-line statements in your SQL pushdown queries. The default is
NOTE: Use of multi-line SQL has limitations. See Create Dataset with SQL.
- Save the file.
- As needed, you can configure the maximum permitted load time before timeout from the application. See Configure Application Limits.
- Restart the platform. See Start and Stop the Platform.
Use Custom SQL Queries
When custom SQL query is enabled, you can enter customized SQL statements in the imported dataset page as part of the import process. See Import Data Page.
For examples, see Create Dataset with SQL.
After a dataset has been imported using custom SQL, you can edit the SQL as needed. See Dataset Details Page.
This page has no comments.