Hive Metastore

This article describes how to create a connection to your Hive Metastore using a SQL command.

Establishing a Hive Metastore connection allows Upsolver to integrate with the Hive ecosystem, enabling you to create, modify, and query tables within Upsolver. These tables can also be queried from other Hive-compatible services and the Upsolver UI.

This documentation assumes that you have created the Amazon S3 connection s3 with the correct write permissions to the specified storage location. Refer to Upsolver's documentation on creating an Amazon S3 connection for more details.

CREATE HIVE_METASTORE CONNECTION <connection_name>  
    URI = 'thrift://<hostname>:<port>'  
    [ COMMENT = '<comment>' ]  
    [ DEFAULT_SCHEMA = '<default_schema>' ]  
    [ USER_NAME = '<username>' ]  
    [ PASSWORD = '<password>' ]  
    [ CLIENT_PROPERTIES = ( PROPERTY = 'VALUE' [, ...] ) ]  
    [ MAX_CONCURRENT_OPERATIONS = <integer> ]  
    [ DEFAULT_STORAGE_CONNECTION = <storage_connection_identifier> ]  
    [ DEFAULT_STORAGE_LOCATION = 's3://<bucket>/<prefix>/' ]

Connection Options

`URI` — Required

Type: text The URI of your Hive Metastore service, usually in the thrift://<hostname>:<port> format.

`COMMENT` — Optional

Type: text (Optional) A description or comment regarding this connection.

`DEFAULT_SCHEMA` — Optional

Type: text (Optional) The default schema to use for tables under this connection.

`USER_NAME` — Optional

Type: text (Optional) The username for authentication to Hive Metastore.

`PASSWORD` — Optional

Type: text (Optional) The password for authentication to Hive Metastore.

`CLIENT_PROPERTIES` — Optional

Type: text (Optional) Additional client properties for the Hive connection. Formatted as a list: CLIENT_PROPERTIES = ( PROPERTY = 'VALUE' [, ...] ).

`MAX_CONCURRENT_OPERATIONS` — Optional

Type: integer (Optional) The maximum number of concurrent operations that can be used by Upsolver servers for this connection.

`DEFAULT_STORAGE_CONNECTION` — Optional

Type: identifier (Optional) An Amazon S3 connection identifier with the appropriate credentials to write to the provided storage location.

`DEFAULT_STORAGE_LOCATION` — Optional

Type: text (Optional) The Amazon S3 path that serves as the default storage location for the underlying files associated with tables created under this metastore connection.

Minimum Example

CREATE HIVE_METASTORE CONNECTION my_hive_connection  
    URI = 'thrift://hive.upsolver.com:9083'  
    DEFAULT_STORAGE_CONNECTION = s3  
    DEFAULT_STORAGE_LOCATION = 's3://test-bucket/prefix/';

Last updated 1 year ago

Connection Options

URI — Required

COMMENT — Optional

DEFAULT_SCHEMA — Optional

USER_NAME — Optional

PASSWORD — Optional

CLIENT_PROPERTIES — Optional

MAX_CONCURRENT_OPERATIONS — Optional

DEFAULT_STORAGE_CONNECTION — Optional

DEFAULT_STORAGE_LOCATION — Optional