Data Lake Tables
Last updated
Last updated
These job options are used when writing to data lake tables or Upsolver-managed tables.
Tables created within Upsolver using your metastore connection are considered Upsolver-managed tables. Note that these tables can still be queried externally. For example, you can create an AWS Glue Data Catalog table within Upsolver, and this table can be queried within Upsolver itself or when you go to your Athena console.
Data lake table options:
General job options:
AGGREGATION_PARALLELISM
— editableType: integer
Default: 1
(Optional) Only supported when the query contains aggregations. Formally known as "output sharding."
ADD_MISSING_COLUMNS
Type: Boolean
Default: false
(Optional) When true
, columns that don't exist in the target table are added automatically when encountered.
When false
, you cannot do SELECT *
within the SELECT
statement of your transformation job.
FLATTEN_PATHS
Type: Array<String>
Default: ()
(Optional) Allows specifying arrays that will be used to flatten the output rows. Please see the guide for details and examples.