Skip to content

Commit

Permalink
Add datasources for projects and global connections
Browse files Browse the repository at this point in the history
  • Loading branch information
b-per committed Sep 2, 2024
1 parent 5e91a94 commit 6903e1a
Show file tree
Hide file tree
Showing 24 changed files with 2,450 additions and 880 deletions.
11 changes: 10 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,16 @@

All notable changes to this project will be documented in this file.

## [Unreleased](https://github.com/dbt-labs/terraform-provider-dbtcloud/compare/v1.3.14...HEAD)
## [Unreleased](https://github.com/dbt-labs/terraform-provider-dbtcloud/compare/v1.3.15...HEAD)

# [0.3.15](https://github.com/dbt-labs/terraform-provider-dbtcloud/compare/v0.3.14...v0.3.15)

### Changes

- Add a `dbtcloud_projects` (with an "s") datasource to return all the projects along with some information about the warehouse connections and repositories connected to those projects. Loops through the API in case there are more than 100 projects
- Along with the `check` block, it can be used to check that there are no duplicate project names for example.
- Add a datasource for `dbtcloud_global_connection` with the same information as the corresponding resource
- Add a datasource for `dbtcloud_global_connections` (with an "s"), returning all the connections of an account along with details like the number of environments they are used in. This could be used to check that connections don't have the same names or that connections are all used by projects.

# [0.3.14](https://github.com/dbt-labs/terraform-provider-dbtcloud/compare/v0.3.13...v0.3.14)

Expand Down
224 changes: 224 additions & 0 deletions docs/data-sources/global_connection.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,224 @@
---
# generated by https://github.com/hashicorp/terraform-plugin-docs
page_title: "dbtcloud_global_connection Data Source - dbtcloud"
subcategory: ""
description: |-
---

# dbtcloud_global_connection (Data Source)



## Example Usage

```terraform
data dbtcloud_global_connection my_connection {
id = 1234
}
```

<!-- schema generated by tfplugindocs -->
## Schema

### Required

- `id` (Number) Connection Identifier

### Read-Only

- `adapter_version` (String) Version of the adapter
- `apache_spark` (Attributes) Apache Spark connection configuration. (see [below for nested schema](#nestedatt--apache_spark))
- `athena` (Attributes) Athena connection configuration. (see [below for nested schema](#nestedatt--athena))
- `bigquery` (Attributes) (see [below for nested schema](#nestedatt--bigquery))
- `databricks` (Attributes) Databricks connection configuration (see [below for nested schema](#nestedatt--databricks))
- `fabric` (Attributes) Microsoft Fabric connection configuration. (see [below for nested schema](#nestedatt--fabric))
- `is_ssh_tunnel_enabled` (Boolean) Whether the connection can use an SSH tunnel
- `name` (String) Connection name
- `oauth_configuration_id` (Number)
- `postgres` (Attributes) PostgreSQL connection configuration. (see [below for nested schema](#nestedatt--postgres))
- `private_link_endpoint_id` (String) Private Link Endpoint ID. This ID can be found using the `privatelink_endpoint` data source
- `redshift` (Attributes) Redshift connection configuration (see [below for nested schema](#nestedatt--redshift))
- `snowflake` (Attributes) Snowflake connection configuration (see [below for nested schema](#nestedatt--snowflake))
- `starburst` (Attributes) Starburst/Trino connection configuration. (see [below for nested schema](#nestedatt--starburst))
- `synapse` (Attributes) Azure Synapse Analytics connection configuration. (see [below for nested schema](#nestedatt--synapse))

<a id="nestedatt--apache_spark"></a>
### Nested Schema for `apache_spark`

Read-Only:

- `auth` (String) Auth
- `cluster` (String) Spark cluster for the connection
- `connect_retries` (Number) Connection retries. Default=0
- `connect_timeout` (Number) Connection time out in seconds. Default=10
- `host` (String) Hostname of the connection
- `method` (String) Authentication method for the connection (http or thrift).
- `organization` (String) Organization ID
- `port` (Number) Port for the connection. Default=443
- `user` (String) User


<a id="nestedatt--athena"></a>
### Nested Schema for `athena`

Read-Only:

- `database` (String) Specify the database (data catalog) to build models into (lowercase only).
- `num_boto3_retries` (Number) Number of times to retry boto3 requests (e.g. deleting S3 files for materialized tables).
- `num_iceberg_retries` (Number) Number of times to retry iceberg commit queries to fix ICEBERG_COMMIT_ERROR.
- `num_retries` (Number) Number of times to retry a failing query.
- `poll_interval` (Number) Interval in seconds to use for polling the status of query results in Athena.
- `region_name` (String) AWS region of your Athena instance.
- `s3_data_dir` (String) Prefix for storing tables, if different from the connection's S3 staging directory.
- `s3_data_naming` (String) How to generate table paths in the S3 data directory.
- `s3_staging_dir` (String) S3 location to store Athena query results and metadata.
- `s3_tmp_table_dir` (String) Prefix for storing temporary tables, if different from the connection's S3 data directory.
- `spark_work_group` (String) Identifier of Athena Spark workgroup for running Python models.
- `work_group` (String) Identifier of Athena workgroup.


<a id="nestedatt--bigquery"></a>
### Nested Schema for `bigquery`

Required:

- `gcp_project_id` (String) The GCP project ID to use for the connection

Read-Only:

- `application_id` (String, Sensitive) OAuth Client ID
- `application_secret` (String, Sensitive) OAuth Client Secret
- `auth_provider_x509_cert_url` (String) Auth Provider X509 Cert URL for the Service Account
- `auth_uri` (String) Auth URI for the Service Account
- `client_email` (String) Service Account email
- `client_id` (String) Client ID of the Service Account
- `client_x509_cert_url` (String) Client X509 Cert URL for the Service Account
- `dataproc_cluster_name` (String) Dataproc cluster name for PySpark workloads
- `dataproc_region` (String) Google Cloud region for PySpark workloads on Dataproc
- `execution_project` (String) Project to bill for query execution
- `gcs_bucket` (String) URI for a Google Cloud Storage bucket to host Python code executed via Datapro
- `impersonate_service_account` (String) Service Account to impersonate when running queries
- `job_creation_timeout_seconds` (Number) Maximum timeout for the job creation step
- `job_retry_deadline_seconds` (Number) Total number of seconds to wait while retrying the same query
- `location` (String) Location to create new Datasets in
- `maximum_bytes_billed` (Number) Max number of bytes that can be billed for a given BigQuery query
- `priority` (String) The priority with which to execute BigQuery queries (batch or interactive)
- `private_key` (String, Sensitive) Private Key for the Service Account
- `private_key_id` (String) Private Key ID for the Service Account
- `retries` (Number) Number of retries for queries
- `scopes` (Set of String) OAuth scopes for the BigQuery connection
- `timeout_seconds` (Number) Timeout in seconds for queries
- `token_uri` (String) Token URI for the Service Account


<a id="nestedatt--databricks"></a>
### Nested Schema for `databricks`

Read-Only:

- `catalog` (String) Catalog name if Unity Catalog is enabled in your Databricks workspace.
- `client_id` (String) Required to enable Databricks OAuth authentication for IDE developers.
- `client_secret` (String) Required to enable Databricks OAuth authentication for IDE developers.
- `host` (String) The hostname of the Databricks cluster or SQL warehouse.
- `http_path` (String) The HTTP path of the Databricks cluster or SQL warehouse.


<a id="nestedatt--fabric"></a>
### Nested Schema for `fabric`

Read-Only:

- `database` (String) The database to connect to for this connection.
- `login_timeout` (Number) The number of seconds used to establish a connection before failing. Defaults to 0, which means that the timeout is disabled or uses the default system settings.
- `port` (Number) The port to connect to for this connection. Default=1433
- `query_timeout` (Number) The number of seconds used to wait for a query before failing. Defaults to 0, which means that the timeout is disabled or uses the default system settings.
- `retries` (Number) The number of automatic times to retry a query before failing. Defaults to 1. Queries with syntax errors will not be retried. This setting can be used to overcome intermittent network issues.
- `server` (String) The server hostname.


<a id="nestedatt--postgres"></a>
### Nested Schema for `postgres`

Read-Only:

- `dbname` (String) The database name for this connection.
- `hostname` (String) The hostname of the database.
- `port` (Number) The port to connect to for this connection. Default=5432
- `ssh_tunnel` (Attributes) PostgreSQL SSH Tunnel configuration (see [below for nested schema](#nestedatt--postgres--ssh_tunnel))

<a id="nestedatt--postgres--ssh_tunnel"></a>
### Nested Schema for `postgres.ssh_tunnel`

Read-Only:

- `hostname` (String) The hostname for the SSH tunnel.
- `id` (Number) The ID of the SSH tunnel connection.
- `port` (Number) The HTTP port for the SSH tunnel.
- `public_key` (String) The SSH public key generated to allow connecting via SSH tunnel.
- `username` (String) The username to use for the SSH tunnel.



<a id="nestedatt--redshift"></a>
### Nested Schema for `redshift`

Required:

- `hostname` (String) The hostname of the data warehouse.

Read-Only:

- `dbname` (String) The database name for this connection.
- `port` (Number) The port to connect to for this connection. Default=5432
- `ssh_tunnel` (Attributes) Redshift SSH Tunnel configuration (see [below for nested schema](#nestedatt--redshift--ssh_tunnel))

<a id="nestedatt--redshift--ssh_tunnel"></a>
### Nested Schema for `redshift.ssh_tunnel`

Read-Only:

- `hostname` (String) The hostname for the SSH tunnel.
- `id` (Number) The ID of the SSH tunnel connection.
- `port` (Number) The HTTP port for the SSH tunnel.
- `public_key` (String) The SSH public key generated to allow connecting via SSH tunnel.
- `username` (String) The username to use for the SSH tunnel.



<a id="nestedatt--snowflake"></a>
### Nested Schema for `snowflake`

Read-Only:

- `account` (String) The Snowflake account name
- `allow_sso` (Boolean) Whether to allow Snowflake OAuth for the connection. If true, the `oauth_client_id` and `oauth_client_secret` fields must be set
- `client_session_keep_alive` (Boolean) If true, the snowflake client will keep connections for longer than the default 4 hours. This is helpful when particularly long-running queries are executing (> 4 hours)
- `database` (String) The default database for the connection
- `oauth_client_id` (String, Sensitive) OAuth Client ID. Required to allow OAuth between dbt Cloud and Snowflake
- `oauth_client_secret` (String, Sensitive) OAuth Client Secret. Required to allow OAuth between dbt Cloud and Snowflake
- `role` (String) The Snowflake role to use when running queries on the connection
- `warehouse` (String) The default Snowflake Warehouse to use for the connection


<a id="nestedatt--starburst"></a>
### Nested Schema for `starburst`

Read-Only:

- `host` (String) The hostname of the account to connect to.
- `method` (String) The authentication method. Only LDAP for now.
- `port` (Number) The port to connect to for this connection. Default=443


<a id="nestedatt--synapse"></a>
### Nested Schema for `synapse`

Read-Only:

- `database` (String) The database to connect to for this connection.
- `host` (String) The server hostname.
- `login_timeout` (Number) The number of seconds used to establish a connection before failing. Defaults to 0, which means that the timeout is disabled or uses the default system settings.
- `port` (Number) The port to connect to for this connection. Default=1433
- `query_timeout` (Number) The number of seconds used to wait for a query before failing. Defaults to 0, which means that the timeout is disabled or uses the default system settings.
- `retries` (Number) The number of automatic times to retry a query before failing. Defaults to 1. Queries with syntax errors will not be retried. This setting can be used to overcome intermittent network issues.
40 changes: 40 additions & 0 deletions docs/data-sources/global_connections.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,40 @@
---
# generated by https://github.com/hashicorp/terraform-plugin-docs
page_title: "dbtcloud_global_connections Data Source - dbtcloud"
subcategory: ""
description: |-
All the connections created on the account with some summary information, like their name, type, when they were created/updated and the number of environments using them.
---

# dbtcloud_global_connections (Data Source)

All the connections created on the account with some summary information, like their name, type, when they were created/updated and the number of environments using them.

## Example Usage

```terraform
data dbtcloud_global_connections my_connections {
}
```

<!-- schema generated by tfplugindocs -->
## Schema

### Read-Only

- `connections` (Attributes Set) A set of all the connections (see [below for nested schema](#nestedatt--connections))

<a id="nestedatt--connections"></a>
### Nested Schema for `connections`

Read-Only:

- `adapter_version` (String) Type of adapter used for the connection
- `created_at` (String) When the connection was created
- `environment__count` (Number) Number of environments using this connection
- `id` (Number) Connection Identifier
- `is_ssh_tunnel_enabled` (Boolean)
- `name` (String) Connection name
- `oauth_configuration_id` (Number)
- `private_link_endpoint_id` (Number) Private Link Endpoint ID.
- `updated_at` (String) When the connection was updated
88 changes: 88 additions & 0 deletions docs/data-sources/projects.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,88 @@
---
# generated by https://github.com/hashicorp/terraform-plugin-docs
page_title: "dbtcloud_projects Data Source - dbtcloud"
subcategory: ""
description: |-
Retrieve all the projects created in dbt Cloud with an optional filter on parts of the project name.
---

# dbtcloud_projects (Data Source)

Retrieve all the projects created in dbt Cloud with an optional filter on parts of the project name.

## Example Usage

```terraform
// can be filtered by parts of the project name
data dbtcloud_projects my_acme_projects {
name_contains = "acme"
}
// or can return all projects
data dbtcloud_projects my_projects {
name_contains = "acme"
}
// this can be used to make sure that there are no distinct projects with the same names for example
locals {
name_occurrences = {
for project in data.dbtcloud_projects.my_projects.projects : project.name => project.id ...
}
duplicates_with_id = [
for name, project_id in local.name_occurrences : "'${name}':${join(",", project_id)}" if length(project_id) > 1
]
}
check "no_different_projects_with_same_name" {
assert {
condition = length(local.duplicates_with_id) == 0
error_message = "There are duplicate project names: ${join(" ; ", local.duplicates_with_id)}"
}
}
```

<!-- schema generated by tfplugindocs -->
## Schema

### Optional

- `name_contains` (String) Used to filter projects by name, Optional

### Read-Only

- `projects` (Attributes Set) Set of projects with their details (see [below for nested schema](#nestedatt--projects))

<a id="nestedatt--projects"></a>
### Nested Schema for `projects`

Read-Only:

- `connection` (Attributes) Details for the connection linked to the project (see [below for nested schema](#nestedatt--projects--connection))
- `created_at` (String) When the project was created
- `dbt_project_subdirectory` (String) Subdirectory for the dbt project inside the git repo
- `description` (String) Project description
- `id` (Number) Project ID
- `name` (String) Project name
- `repository` (Attributes) Details for the repository linked to the project (see [below for nested schema](#nestedatt--projects--repository))
- `semantic_layer_config_id` (Number) Semantic layer config ID
- `updated_at` (String) When the project was last updated

<a id="nestedatt--projects--connection"></a>
### Nested Schema for `projects.connection`

Read-Only:

- `adapter_version` (String) Version of the adapter for the connection. Will tell what connection type it is
- `id` (Number) Connection ID
- `name` (String) Connection name


<a id="nestedatt--projects--repository"></a>
### Nested Schema for `projects.repository`

Read-Only:

- `id` (Number) Repository ID
- `pull_request_url_template` (String) URL template for PRs
- `remote_url` (String) URL of the git repo remote
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
data dbtcloud_global_connection my_connection {
id = 1234
}
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
data dbtcloud_global_connections my_connections {
}
Loading

0 comments on commit 6903e1a

Please sign in to comment.