To improve performance of your Hive or relational connections, custom SQL queries can be enabled to push the initial filtration of table rows and columns back the database, which is more efficient at performing this task. Instead of loading the entire table into the  and then performing the filtration through the Transformer page, you can insert basic SQL commands as part of your relational queries to collect only the rows and columns of interest from the source.

When enabled, custom SQL query is available for all relational sources. 

Limitations

See Create Dataset with SQL.

Enable

Steps:

  1. Locate the following setting:

    Enable custom SQL Query
    SettingDescription
    enabledSet to true to enable the SQL pushdown feature. By default, this feature is enabled.
  2. Locate the following setting:

    "webapp.connectivity.customSQLQuery.enableMultiStatement": false,
    SettingDescription
    enableMultiStatement

    When set to true, you can insert multi-line statements in your SQL pushdown queries. The default is false.

    NOTE: Use of multi-line SQL has limitations. See Create Dataset with SQL.

  3. Save the file. 
  4. As needed, you can configure the maximum permitted load time before timeout from the application. See Configure Application Limits.
  5. Restart the platform. See Start and Stop the Platform.

Use Custom SQL Queries

When custom SQL query is enabled, you can enter customized SQL statements in the imported dataset page as part of the import process. See Import Data Page.

For examples, see Create Dataset with SQL.

After a dataset has been imported using custom SQL, you can edit the SQL as needed. See Dataset Details Page