When publishing to Hive, please complete the following steps to configure the table and settings to apply to the publish action through the Run Job page.
NOTE: Some Trifacta data types may be exported to Hive using different data types. For more information on how types are exported to Hive, see Hive Data Type Conversions.
- Select location: Navigate the Hive browser to select the database and table to which to publish.
- To create a new table, click Create a new table.
- Select table options:
- Table name:
- New table: enter a name for it. You may use a pre-existing table name, and schema checks are performed against it.
- Existing table: you cannot modify the name.
Output database: To change the database to which you are publishing, click the Hive icon in the sidebar. Select a different database.
NOTE: You cannot publish to a Hive database that is empty. The database must contain at least one table.
Publish actions: Select one of the following.
NOTE: If you are writing to unmanaged tables in Hive, create and drop & load actions are not supported.
- Create new table every run: Each run generates a new table with a timestamp appended to the name.
Append to this table every run: Each run adds any new results to the end of the table.
Tip: Optionally, users can be permitted to publish to Hive staging schemas to which they do not have full create and drop permissions. This feature must be enabled. For more information, see Configure for Hive.
When enabled, the name of the staging DB must be inserted into your user profile. See User Profile Page.
- Truncate the table every run: With each run, all data in the table is truncated and replaced with any new results.
- Drop the table every run: With each run, the table is dropped (deleted), and all data is deleted. A new table with the same name is created, and any new results are added to it.
- Table name:
- To save the publishing action, click Add.
This page has no comments.