Page tree

 

Contents:


Trifacta® Wrangler Enterprise supports detailed monitoring of a job throughout each phase of its execution.  

Limitations:

  • Applies only to ingest and publishing jobs
  • Applies only to JDBC datasets

Monitoring Phases

These phases apply to ingest and publishing jobs. Information on them is surfaced in the application.

Connect

In the Connect phases, Trifacta Wrangler Enterprise uses the specified connection for the flow to connect to the source of the job. 

NOTE: Errors in this phase typically involve issues in the connection definition or in the network configuration or availability.


Request

After the platform has been able to connect to the datastore, the Request phase entails the submission of the request to the datastore for the assets. For example, for JDBC-based datasets, this phase covers the SQL query of the database through the response that the query was successfully executed.

NOTE: Errors in this phase typically reflect errors in the SQL query, which can include renaming or moving of assets in the datastore.

NOTE: If assets are retrieved via custom SQL query, you may need to review the query and validate it through the Trifacta application. For more information, see Create Dataset with SQL.


Transfer

This phase covers the transfer of assets from the datastore to Trifacta Wrangler Enterprise.

NOTE: Errors in this phase typically indicate issued with permissions.


Prepare

NOTE: This phase applies to publishing jobs only.


Depending on the destination, the Prepare phase includes the creation of temporary tables, generation of manifest files, and the fetching of extra connections for parallel data transfer.

Process

After the data has been transferred to the platform, this phase covers the processing of cleanup after data transfer, including the dropping of temporary tables or copying data within the instance.


Enable

The base feature is enabled by default.

"feature.enableJobMonitoring": true,

Configure

Optionally, you can enable the following capabilities in the Trifacta applicationYou can apply this change through the Admin Settings Page (recommended) or trifacta-conf.json. For more information, see Platform Configuration Methods.

Enable phases in Data sources tab

To display separate columns in the Data sources tab of the Job Details page for each phase on an ingest job, set the following parameter to true:

"jobMonitoring.enablePhasesInDatasourcesTable": true,

Enable phases in Outputs tab

To display separate columns in the Outputs tab of the Job Details page for each phase for a publish job, set the following parameter to true:

"jobMonitoring.enablePhasesInOutputsTable": true,

Save your changes and restart the platform.

Monitoring Jobs in the Application

When the base feature is enabled, you can do the following:

  • Track phases of progress by hovering over the job in progress in the Job Details page. 
  • Review new and better detail in the Job Details page. Click View Details for the job listing.

For more information, see Job Details Page.

Datasources tab - Phased ingest monitoring

If job monitoring phases have been enabled for the Datasources tab, the tab looks like the following:

Figure: Job monitoring in the Datasources tab

View details:

If an ingest job succeeds or fails, you can click View details in the status column for additional information on each phase of the ingest job:

Figure: View details on monitoring ingest jobs

Output destinations tab - Phased publishing monitoring

If job monitoring phases have been enabled for the Output Destinations tab, the tab looks like the following:

Figure: Job monitoring in the Datasources tab

View details:

If a publishing job succeeds or fails, you can click View details in the status column for additional information on each phase of the publishing job:

Figure: View details on monitoring ingest jobs

This page has no comments.