- Applies only to ingest and publishing jobs
- Applies only to JDBC datasets
These phases apply to ingest and publishing jobs. Information on them is surfaced in the application.
In the Connect phases, Trifacta Wrangler Enterprise uses the specified connection for the flow to connect to the source of the job.
NOTE: Errors in this phase typically involve issues in the connection definition or in the network configuration or availability.
After the platform has been able to connect to the datastore, the Request phase entails the submission of the request to the datastore for the assets. For example, for JDBC-based datasets, this phase covers the SQL query of the database through the response that the query was successfully executed.
NOTE: Errors in this phase typically reflect errors in the SQL query, which can include renaming or moving of assets in the datastore.
NOTE: If assets are retrieved via custom SQL query, you may need to review the query and validate it through the Trifacta application. For more information, see Create Dataset with SQL.
This phase covers the transfer of assets from the datastore to the platform.
NOTE: Errors in this phase typically indicate issued with permissions.
NOTE: This phase applies to publishing jobs only.
Depending on the destination, the Prepare phase includes the creation of temporary tables, generation of manifest files, and the fetching of extra connections for parallel data transfer.
After the data has been transferred to the platform, this phase covers the processing of cleanup after data transfer, including the dropping of temporary tables or copying data within the instance.
The base feature is enabled by default.
Optionally, you can enable the following capabilities in the Trifacta application. You can apply this change through the Admin Settings Page (recommended) or
trifacta-conf.json. For more information, see Platform Configuration Methods.
Enable phases in Data sources tab
To display separate columns in the Data sources tab of the Job Details page for each phase on an ingest job, set the following parameter to
Enable phases in Outputs tab
To display separate columns in the Outputs tab of the Job Details page for each phase for a publish job, set the following parameter to
Save your changes and restart the platform.
Monitoring Jobs in the Application
When the base feature is enabled, you can monitor jobs in the following locations.
- Track phases in the Jobs panel in Flow View. Hover the mouse over the link to the job.
- See Flow View Page.
NOTE: This feature may require enablement in your deployment. See Configure JDBC Ingestion.
For long-loading datasets, you can track the progress of the import through the Import Data page as you specify the import. See Import Data Page.
After specifying the import, if the data is continuing to be ingested, you can track progress through the Library page. See Library Page.
Dataset Details Page:
In the Dataset Details page, you can monitor the ingest progress. Hover over the Status link.
Job Details Page
- Track phases of progress by hovering over the job in progress in the Job Details page.
- Review new and better detail in the Job Details page. Click View Details for the job listing.
- For more information, see Job Details Page.
Datasources tab - Phased ingest monitoring
If job monitoring phases have been enabled for the Datasources tab, the tab looks like the following:
If an ingest job succeeds or fails, you can click View details in the status column for additional information on each phase of the ingest job:
Output destinations tab - Phased publishing monitoring
If job monitoring phases have been enabled for the Output Destinations tab, the tab looks like the following:
If a publishing job succeeds or fails, you can click View details in the status column for additional information on each phase of the publishing job:
This page has no comments.