Contents:
An exported flow can be imported into Dataprep by Trifacta.
Limitations
You cannot import flows that were exported before Release 6.8.
NOTE: You cannot import flows into a version of the product that is earlier than the one from which you exported it. For example, if you develop a flow on free Designer Cloud Educational, which is updated frequently, you may not be able to import it into other editions of the product, which are updated less frequently.
Imported flows do not contain the following objects:
NOTE: Depending on the import environment, some objects in the flow definition may be incompatible. For example, the connection type may not be valid, or a datasource may not be reachable. In these cases, the objects may be removed from the flow, or you may have to fix a reference in the object definition. After import, you should review the objects in the flow to verify.
- Reference datasets
- Samples
Imported datasets that are ingested into backend storage for Dataprep by Trifacta may be broken after the flow has been imported into another instance. These datasets must be reconnected to their source. You cannot use import mapping rules to reconnect these data sources. This issue applies to the following data sources:
- Microsoft Excel workbooks and worksheets. See Import Excel Data.
- Google Sheets. See Import Google Sheets Data.
Import into a new Google Cloud Platform project
When a flow is imported from one Google Cloud Platform (GCP) project into another GCP project, the underlying data must be accessible through the imported flow in the new GCP project. This means:
- Source data must be accessible to all users who have access to the imported flow. Please see Cross-Project Data Access for more details.
- If the flow is shared with other users of the project, they must have access to the underlying data.
- Samples must be re-created, since they are not included in the import.
NOTE: If the above requirements are not met for flows imported into a different project, users may experience 403 Access Forbidden errors when attempting to connect to the flow or its underlying assets.
NOTE: If you import a flow from Dataprep by Trifacta Premium into Dataprep by Trifacta Standard, you may encounter soft validation errors during job execution. If the imported flow uses custom VPC mode, then the job execution may fail, since network may be inaccessible. The workaround is 1) to set the VPC mode to Auto or 2) set accessible VPC options before you run your job. See Dataflow Execution Settings.
NOTE: If your imported flow contains an output object where dataflow execution overrides have been specified, then the overrides are applied to any jobs executed in the project where the flow was imported. Property values that do not appear in the imported output object are taken from your execution settings. See Dataflow Execution Settings.
Import
NOTE: When you import a flow, you must import a ZIP file containing the JSON definition.
Steps:
- Export the flow from the source system. See Export Flow.
- Login to the import system, if needed.
- Click Flows.
From the context menu in the Flow page, select Import Flow.
Tip: You can import multiple flows (ZIP files) through the file browser or through drag-and-drop. Press
CTRL
/COMMAND
+ click orSHIFT
+ click to select multiple files for import.- Select the ZIP file containing the exported flow. Click Open.
Tip: When you import the flow, click the Warnings link to review the list of objects that must be remapped.
The flow is imported and available for use in the Flows page. After import:
- You may need to reconnect your imported datasets to data sources that are available in the new workspace or project. See Reconnect Flow to Source Data.
- You may also need to reconnect your outputs. See Reconnect Flow to Outputs.
This page has no comments.