Step 1: Connect with Hive Metastore through AWS PrivateLink
Ryft integrates with Hive Metastore using the default Thrift API. To integrate with Hive Metastore, obtain the Thrift URI of the Hive Metastore server and provide it to Ryft support. The URI should look similar to:thrift://<metastore-host>:9083
Network connectivity
Ensure that the Ryft instance can connect to the Hive Metastore server over the network. The default port for Hive Metastore is9083, but this may vary based on your configuration.
AWS PrivateLink
If your Hive Metastore requires PrivateLink connectivity, follow the PrivateLink setup guide to set it up. Note: The only port needed is9083, which is the default port for Hive Metastore.
Once your PrivateLink endpoint is set up, provide the endpoint details to Ryft, including the AZs where the endpoint is available.
IP Access Lists
If you are restricting network access to specific IP addresses, please allow the following IP addresses:Step 2: Create “Ryft-ControlPlaneRole” for metadata access
Ryft needs access to metadata files as stored in S3 in addition to the catalog access. The easiest way to setup this role is with CloudFormation / Terraform, as described here. For manual setup, use the manual guide, and make sure to define the S3 policy.Locate the ARN of the role you have just created (”Ryft-ControlPlaneRole”) and provide it to Ryft.
The ARN should look similar to:
arn:aws:iam::<account>:role/Ryft-ControlPlaneRole