LogoLogo
OverviewQuickstartsHow To GuidesReferenceArticlesSupport
Quickstarts
Quickstarts
  • Quickstarts
  • DATA INGESTION WIZARD
    • Using the Wizard
      • Source Set-up
        • Amazon Kinesis
        • Amazon S3
        • Apache Kafka
        • Confluent Cloud
        • Microsoft SQL Server
        • MongoDB
        • MySQL
        • PostgreSQL
      • Target Set-up
        • Amazon Redshift
        • AWS Glue Data Catalog
        • ClickHouse
        • Polaris Catalog
        • Snowflake
      • Job Configuration
        • Job Configuration
        • Job Configuration for CDC
      • Review and Run Job
  • CONNECTORS
    • Connectors
      • Amazon Kinesis
      • Amazon Redshift
      • Amazon S3
      • Apache Kafka
      • AWS Glue Data Catalog
      • ClickHouse
      • Confluent Cloud
      • Elasticsearch
      • Microsoft SQL Server
      • MongoDB
      • MySQL
      • Polaris Catalog
      • PostgreSQL
      • Snowflake
  • JOBS
    • Ingestion
      • Job Basics
        • Ingest to a Staging Table
        • Output to a Target Table
      • Stream and File Sources
        • Amazon Kinesis
        • Amazon S3
        • Apache Kafka
        • Confluent Kafka
      • CDC Sources
        • Microsoft SQL Server
        • MongoDB
        • MySQL
        • PostgreSQL
    • Transformation
      • Updating Data
        • Upsert Data to a Target Table
        • Delete Data from a Target Table
        • Aggregate and Output Data
        • Join Two Data Streams
      • Data Targets
        • Output to Amazon Athena
        • Output to Amazon Redshift
        • Output to Amazon S3
        • Output to Elasticsearch
        • Output to Snowflake
  • APACHE ICEBERG
    • Optimize Your Iceberg Tables
    • Install the Iceberg Table Analyzer
Powered by GitBook
On this page
  1. CONNECTORS
  2. Connectors

AWS Glue Data Catalog

This quickstart describes how to create an AWS Glue Data Catalog connection.

Last updated 12 months ago

Create a connection to the AWS Glue Data Catalog

Before you begin copying your data to Amazon Athena, you must ingest it into the AWS Glue Data Catalog. This maintains your existing schemas, partitions, and other relevant metadata information.

Your connection is persistent, so you won't need to re-create it for every job. The connection is also shared with other users in your organization.

Here’s the code to create a connection to the Glue Data Catalog:

// Syntax
CREATE GLUE_CATALOG CONNECTION <connection_identifier> 
    AWS_ROLE = '<role_arn>'; 

// Example
CREATE GLUE_CATALOG CONNECTION production_catalog
    AWS_ROLE = 'arn:aws:iam::111111111111:role/<upsolver-role-*>';

This code creates a Glue Data Catalog connection named production_catalog, and instructs Upsolver on which AWS_ROLE to use.

For more information on roles, see the Amazon guide to .


Learn More

Please see the SQL command reference for for the full list of connection options, and examples.

Roles terms and concepts
AWS Glue Data Catalog