AWS Glue Data Catalog

This quickstart describes how to create an AWS Glue Data Catalog connection.

Create a connection to the AWS Glue Data Catalog

Before you begin copying your data to Amazon Athena, you must ingest it into the AWS Glue Data Catalog. This maintains your existing schemas, partitions, and other relevant metadata information.

Your connection is persistent, so you won't need to re-create it for every job. The connection is also shared with other users in your organization.

Here’s the code to create a connection to the Glue Data Catalog:

// Syntax
CREATE GLUE_CATALOG CONNECTION <connection_identifier> 
    AWS_ROLE = '<role_arn>'; 

// Example
CREATE GLUE_CATALOG CONNECTION production_catalog
    AWS_ROLE = 'arn:aws:iam::111111111111:role/<upsolver-role-*>';

This code creates a Glue Data Catalog connection named production_catalog, and instructs Upsolver on which AWS_ROLE to use.

For more information on roles, see the Amazon guide to Roles terms and concepts.

Learn More

Please see the SQL command reference for AWS Glue Data Catalog for the full list of connection options, and examples.

Last updated 1 year ago