Add a helper function for connecting to Redshift #879

atheriel · 2025-01-10T16:07:34Z

This commit introduces a new odbc::redshift() function intended to help setting up connections to AWS Redshift, especially when using IAM credentials. It generally follows the pattern established with odbc::databricks() and odbc::snowflake().

Note that finding IAM credentials is outsourced to paws.common. Some Redshift ODBC drivers (there are a few of them) can handle AWS profiles or IAM role assumption, and I did have an earlier version of this commit that did this work manually; unfortunately the supported parameters depend not only on the driver, but also the driver version, and it quickly became a mess.

So: using paws.common here allows us to be very driver version-agnostic, and ensures we support as wide a number of IAM setups as possible.

Note that I also refectored some of automatic driver discover for Databricks and Snowflake into its own helper utility so we could use the same logic for Redshift, and fixed some grammar issues in their respective error messages.

Unit tests are included.

Part of #878.

atheriel · 2025-01-10T16:49:36Z

Test failures seem unrelated.

simonpcouch · 2025-01-10T17:04:26Z

Yup, failures are unrelated and addressed in #876! No worries there.

tnederlof · 2025-01-13T18:37:35Z

This seems to work as expected with this code.

library(odbc)

con <- DBI::dbConnect(
  odbc::redshift(),
  region = "us-east-2",
  clusterId = "tn-demo-cluster",
  database = "demo_db",
  dbuser = "tnederlof"
)

dbListTables(con)

I do need to specify the region otherwise, I get this error: Error occurred while trying to connect: ClusterNotFound: Cluster tn-demo-cluster not found.

This commit introduces a new `odbc::redshift()` function intended to help setting up connections to AWS Redshift, especially when using IAM credentials. It generally follows the pattern established with `odbc::databricks()` and `odbc::snowflake()`. Note that finding IAM credentials is outsourced to `paws.common`. Some Redshift ODBC drivers (there are a few of them) can handle AWS profiles or IAM role assumption, and I did have an earlier version of this commit that did this work manually; unfortunately the supported parameters depend not only on the driver, but also the driver version, and it quickly became a mess. So: using `paws.common` here allows us to be very driver version-agnostic, and ensures we support as wide a number of IAM setups as possible. Note that I also refectored some of automatic driver discover for Databricks and Snowflake into its own helper utility so we could use the same logic for Redshift, and fixed some grammar issues in their respective error messages. Unit tests are included. Signed-off-by: Aaron Jacobs <aaron.jacobs@rstudio.com>

atheriel · 2025-01-13T19:03:08Z

Some checks are failing due to #882.

atheriel force-pushed the redshift-helper-v2 branch 2 times, most recently from 029888d to 2af0891 Compare January 13, 2025 17:54

atheriel force-pushed the redshift-helper-v2 branch from 2af0891 to 0b5ec69 Compare January 13, 2025 19:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a helper function for connecting to Redshift #879

Add a helper function for connecting to Redshift #879

atheriel commented Jan 10, 2025

atheriel commented Jan 10, 2025

simonpcouch commented Jan 10, 2025

tnederlof commented Jan 13, 2025

atheriel commented Jan 13, 2025

Add a helper function for connecting to Redshift #879

Are you sure you want to change the base?

Add a helper function for connecting to Redshift #879

Conversation

atheriel commented Jan 10, 2025

atheriel commented Jan 10, 2025

simonpcouch commented Jan 10, 2025

tnederlof commented Jan 13, 2025

atheriel commented Jan 13, 2025