Trifacta® Wrangler Enterprise supports detailed monitoring of a job throughout each phase of its execution.
- Applies only to ingest and publishing jobs
- Applies only to JDBC datasets
These phases apply to ingest and publishing jobs. Information on them is surfaced in the application.
In the Connect phases, the Trifacta platform uses the specified connection for the flow to connect to the source of the job.
NOTE: Errors in this phase typically involve issues in the connection definition or in the network configuration or availability.
After the platform has been able to connect to the datastore, the Request phase entails the submission of the request to the datastore for the assets. For example, for JDBC-based datasets, this phase covers the SQL query of the database through the response that the query was successfully executed.
NOTE: Errors in this phase typically reflect errors in the SQL query, which can include renaming or moving of assets in the datastore.
NOTE: If assets are retrieved via custom SQL query, you may need to review the query and validate it through the Trifacta application. For more information, see Create Dataset with SQL.
This phase covers the transfer of assets from the datastore to the Trifacta platform.
NOTE: Errors in this phase typically indicate issued with permissions.
NOTE: This phase applies to publishing jobs only.
Depending on the destination, the Prepare phase includes the creation of temporary tables, generation of manifest files, and the fetching of extra connections for parallel data transfer.
After the data has been transferred to the platform, this phase covers the processing of cleanup after data transfer, including the dropping of temporary tables or copying data within the instance.
The base feature is enabled by default.
Optionally, you can enable the following capabilities in the Trifacta application. You can apply this change through the Admin Settings Page (recommended) or
trifacta-conf.json. For more information, see Platform Configuration Methods.
Enable phases in Data sources tab
To display separate columns in the Data sources tab of the Job Details page for each phase on an ingest job, set the following parameter to
Enable phases in Outputs tab
To display separate columns in the Outputs tab of the Job Details page for each phase for a publish job, set the following parameter to
Save your changes and restart the platform.
Monitoring Jobs in the Application
When the base feature is enabled, you can do the following:
- Track phases of progress by hovering over the job in progress in the Job Details page.
- Review new and better detail in the Job Details page. Click View Details for the job listing.
For more information, see Job Details Page.
Datasources tab - Phased ingest monitoring
If job monitoring phases have been enabled for the Datasources tab, the tab looks like the following:
If an ingest job succeeds or fails, you can click View details in the status column for additional information on each phase of the ingest job:
Output destinations tab - Phased publishing monitoring
If job monitoring phases have been enabled for the Output Destinations tab, the tab looks like the following:
If a publishing job succeeds or fails, you can click View details in the status column for additional information on each phase of the publishing job:
This page has no comments.