DCM2BQ

This package name is an abbreviation for DICOM to BigQuery. It offers a service (and CLI) to create a JSON representation of a DICOM part 10 file and then store it to Big Query, with many options for input and formatting in between.

This open source package is an alternative to using the Healthcare API DICOM store feature allowing you to stream metadata to BigQuery. It was primarily created to offer a similar option for DICOM data that is ingested to other storage platforms, starting with Google Cloud Storage.

Why store DICOM metadata to BigQuery? Because traditional imaging systems, such as PACS and VNA, only provide a limited view of the underlying metadata. Storing the full metadata into BigQuery provides limitless analytic capabilities over this type of data.

Configuration

Please refer to the configuration options in the default config file.

These options can be set by providing JSON overrides via the DCM2BQ_CONFIG environment variable or by putting these overrrides into a file and passing the file path via the DCM2BQ_CONFIG_FILE environment variable.

The default value will be used for each config option when there's no override provided.

Deployment

The default deployment option for this service is seen in the below architecture:

The workflow for this deployment is as follows:

An object operation occurs in the GCS bucket where notifications are enabled: An object is written, updated, deleted, or metadata is updated.
A notification of the event is sent to the Pub/Sub topic, where a Pub/Sub subscription receives the message and pushes it to CloudRun.
CloudRun routes the message to the HTTP handler within one of the dcm2bq containers.
The dcm2bq container processes the message in the following way:
- Validates that it can handle the message JSON Schema; primarily this is checking that the message format meets expectations and that the object itself has a DICOM-like extension (\.(dcm|DCM|dicom)).
- If the message-type requires creating a new JSON representation (when new objects are written):
  - Read the file from the storage (GCS) to the container memory (Note: Allocate enough memory for your container to handle your maximum DICOM object size), parse it, and insert it in to BigQuery.
- If the message-type is a delete, then it will save that operation to the BigQuery table without any DICOM metadata.
If any errors occur within the container, then the message will be NACK'd and it will be retried. If the message fails after the max retries, then it will be pushed to the deadletter topic, which then auto pushes the message to a BigQuery table for further analysis.

Please note that the code is deployed as a container by default. You can find the latest release of the conatainer image here.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.vscode		.vscode
assets		assets
helpers		helpers
test		test
tf		tf
.dockerignore		.dockerignore
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
bigquery.js		bigquery.js
config.defaults.js		config.defaults.js
config.js		config.js
consts.js		consts.js
dicomtojson.js		dicomtojson.js
eventhandlers.js		eventhandlers.js
gcs.js		gcs.js
hcapi.js		hcapi.js
index.js		index.js
localfile.js		localfile.js
package.json		package.json
perf.js		perf.js
renovate.json		renovate.json
schemas.js		schemas.js
server.js		server.js
tag-lookup.min.json		tag-lookup.min.json
utils.js		utils.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DCM2BQ

Configuration

Deployment

About

Releases 3

Packages

Contributors 2

Languages

License

GoogleCloudPlatform/dcm2bq

Folders and files

Latest commit

History

Repository files navigation

DCM2BQ

Configuration

Deployment

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Contributors 2

Languages

Packages