When you specify a |
These property values override any settings applied to the project.
Dataflow Execution Properties |
Default execution settings:
By default, runs your job in the
us-central1
region on an n1-standard-1
machine. As needed, you can change the geo location and the machine where your job is executed.
Tip: You can change the default values for the following in your Execution Settings. See Execution Settings Page. |
Making changes to these settings can affect performance times for executing your job. |
Tip: For more information on how the following settings affect your jobs, see Run Job on Cloud Dataflow. |
Setting | Description |
---|---|
Regional Endpoint | A regional endpoint handles execution details for your |
Zone | A sub-section of region, a zone contains specific resources for a given region. Select |
Machine Type | Choose the type of machine on which to run your job. The default is Note: not all machine types supported directly through |
For more information on these regional endpoints, see https://cloud.google.com/dataflow/docs/concepts/regional-endpoints.
For more information on machine types, https://cloud.google.com/compute/docs/machine-types.
Advanced settings:
Setting | Description | ||
---|---|---|---|
VPC Network mode | If the network mode is set to As needed, you can override the default settings configured for your project for this job. Set this value to
For more information: | ||
Network | To use a different VPC network, enter the name of the VPC network to use as an override for this job. Click Save to apply the override. | ||
Subnetwork | To specify a different subnetwork, enter the URL of the subnetwork. The URL should be in the following format:
where:
If you have access to another project within your organization, you can execute your
where:
Click Save to apply the override. |
For more information on these settings, see Execution Settings Page.
Setting | Description | |
---|---|---|
Worker IP address configuration | If the VPC Network mode is set to
| |
Autoscaling Algorithms | The type of algorithm to use to scale the number of Google Compute Engine instances to accommodate the size of your job. Possible values:
| |
Initial number of workers | Number of Google Compute Engine instances with which to launch the job. This number may be adjusted as part of job execution. This number must be an integer between 1 and 1000 , inclusive. | |
Maximum number of workers | Maximum number of Google Compute Engine instances to use during execution. This value must be greater than the initial number of workers and must be an integer between | |
Service account | Every Optionally, you can specify a different service account under which to run your job.
For more information on service account usage and requirements, see Google Service Account Management | |
Labels | Create or assign labels to apply to the billing for the
For more information, see https://cloud.google.com/resource-manager/docs/creating-managing-labels. |
Notes on behavior: