When you specify a job, you may pass to the running environment a set of property values to apply to the execution of the job. Overrides are applied to individual jobs.
These property values override any settings applied to the project.
Dataflow Execution Properties |
Default execution settings:
By default, runs your job in the
us-central1
region on an n1-standard-1
machine. As needed, you can change the geo location and the machine where your job is executed.
Tip: You can change the default values for the following in your project settings. See Project Settings Page . |
Making changes to these settings can affect performance times for executing your job. |
Tip: For more information on how the following settings affect your jobs, see Run Job on Cloud Dataflow. |
Setting | Description |
---|---|
Regional Endpoint | A regional endpoint handles execution details for your |
Zone | A sub-section of region, a zone contains specific resources for a given region. Select |
Machine Type | Choose the type of machine on which to run your job. The default is Note: not all machine types supported directly through |
For more information on these regional endpoints, see https://cloud.google.com/dataflow/docs/concepts/regional-endpoints.
For more information on machine types, https://cloud.google.com/compute/docs/machine-types.
Advanced settings:
Setting | Description | ||
---|---|---|---|
VPC Network mode | If the network mode is set to As needed, you can override the default settings configured for your project for this job. Set this value to
For more information: | ||
Network | To use a different VPC network, enter the name of the VPC network to use as an override for this job. Click Save to apply the override. | ||
Subnetwork | To specify a different subnetwork, enter the URL of the subnetwork. The URL should be in the following format:
where:
If you have access to another project within your organization, you can execute your
where:
|
For more information on these settings, see Project Settings Page.
Setting | Description | |
---|---|---|
Worker IP address configuration | If the VPC Network mode is set to
| |
Autoscaling Algorithms | The type of algorithm to use to scale the number of Google Compute Engine instances to accommodate the size of your job. Possible values:
| |
Initial number of workers | Number of Google Compute Engine instances with which to launch the job. This number may be adjusted as part of job execution. This number must be an integer between 1 and 1000 , inclusive. | |
Maximum number of workers | Maximum number of Google Compute Engine instances to use during execution. This value must be greater than the initial number of workers and must be an integer between | |
Service account | Email address of the service account under which to run the job. | |
Labels | Create or assign labels to apply to the billing for the
For more information, see https://cloud.google.com/resource-manager/docs/creating-managing-labels. |
Notes on behavior: