...
Parameter | Description | Applicable CLI Commands | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
host | (Required) The server and port number of the
Replace this value with the host and port of the running
localhost:3005 is assumed.
| All | ||||||||||||||
conn_name | Internal name of the connection. This name is referenced in your CLI scripts. It should be a single value without spaces.
| load_data , publish , truncate_and_load | ||||||||||||||
conn_id | The internal identifier for the connection. When a connection is created, it is assigned an internal numeric identifier. This ID or the
| publish , load_data , truncate_and_load | ||||||||||||||
job_type | The execution environment in which to run the job:
For more information on these running environments, see Running Environment Options. | run_job | ||||||||||||||
job_id | The internal identifier for the job. This value can be retrieved from the output of a completed run_job command. | get_job_status , publish ,get_publications ,load_data | ||||||||||||||
profiler | When on , profiling of your job is enabled. Default is off . | run_job | ||||||||||||||
data | Full UNIX path to the source TSV file. This file contains a URL pointing to the actual Hive or HDFS source: one TSV file for each job run. Executing user must have access to this file. | run_job | ||||||||||||||
script | Full UNIX path from the
| run_job | ||||||||||||||
publish_action | (Optional) Defines the action taken on second and subsequent publish operations:
| run_job | ||||||||||||||
header | (Optional), The output for a CSV job with
| run_job | ||||||||||||||
single_file | (Optional) When true , CSV or JSON outputs are written to a single file. Default is false . | run_job | ||||||||||||||
output_path | (Required) Defines the fully qualified URI to where the job results are written, as in the following examples:
This parameter specifies the base filename. If you are publishing files, the Protocol is set in
| run_job | ||||||||||||||
output_format | Accepted values:
For | run_job | ||||||||||||||
database | Name of Redshift or Hive database to which you are publishing or loading. | publish ,load_data | ||||||||||||||
table | The table of the database to which you are publishing or loading. | publish ,load_data | ||||||||||||||
publish_format | The format of the output file from which to publish to Hive or Redshift tables. Accepted values: | publish , get_publications | ||||||||||||||
publish_opt_file | Path to file containing definitions for multiple file or table targets to which to write the job's results. For more information, see CLI Publishing Options File. | run_job | ||||||||||||||
skip_publish_validation | By default, the CLI automatically checks for schema validation when generating results to a pre-existing source. If this flag is set, schema validation is skipped on results output. | run_job |
...