Overview:

A patient being admitted within a specific time period, 30 days or 90 days, following the previous hospital visit is called a readmission event. Number of readmission events are a metric for US healthcare system and avoidable readmission events cost $41.3B/yr.

For demoing the predictive modeling of readmission use case, synthetic patient data from Synthea is used. This repo has the code for end-to-end machine learing i.e., creating synthetic data from Synthea, storing data in object storage, creating ADW tables, engineering the features and training & deploying model using Oracle ADS.

Business use:

Health insurance companies spend ~ 80% of the cost on ~20% of the insured members. One of the main contributors to this cost is readmission events. Health insurance companies have specialized nurse support to outreach members with an in-hospital admission, to ensure that they are properly treated at the hospital and to send them to a right triage after discharge. Readmission predictive model would help health insurance companies to utilize nurse resources to target members with high risk of readmission and reduce the medical cost.

In US, Center for Medicare and Medicaid services (CMS) provides STARs rating to hospital quality summarizing a variety of measures across 5 areas (mortality, safety of care, readmission, patient experience and timely & effective care) of quality for each hospital. Readmission being one of the key measures, predictive model would help hospitals to identify the patients that are at high risk of readmission and improve their quality of care.

Data set:

Synthea is the best alternative to get medical data for creating demo assets without any PHI/PII issues. Process of generating the data is mentioned in Synthea's README. For the purpose of this ML model, synthetic data in csv format is created for 1M patients.

Meta data:

Synthea creates below csv files. Of these "patients.csv" is the key file with "ID" column as the primary key to link all other csvs

File	Description
patients.csv	Patient demographic data.
allergies.csv	Patient allergy data.
careplans.csv	Patient care plan data, including goals.
claims.csv	Patient claim data.
claims_transactions.csv	Transactions per line item per claim.
conditions.csv	Patient conditions or diagnoses.
devices.csv	Patient-affixed permanent and semi-permanent devices.
encounters.csv	Patient encounter data.
imaging_studies.csv	Patient imaging metadata.
immunizations.csv	Patient immunization data.
medications.csv	Patient medication data.
observations.csv	Patient observations including vital signs and lab reports.
organizations.csv	Provider organizations including hospitals.
payer_transitions.csv	Payer Transition data (i.e. changes in health insurance).
payers.csv	Payer organization data.
procedures.csv	Patient procedure data including surgeries.
providers.csv	Clinicians that provide patient care.
supplies.csv	Supplies used in the provision of care.

CSV file data dictionary is available here.

OCI Product/Services used in this demo:

Reference architecture diagram

Pre-requisites

On desktop machine, install OCI Command Line Interface.
Install 'generalml_p37_cpu_v1' conda environment in datascience notebook
Upgrade ads package using below:

pip uninstall  oracle-ads==2.5.9
pip Install oracle_ads==2.5.9

How to use this repo?

Follow README section of Synthea and create a dataset of atleast 100K patients
Create a bucket in OCI object storage and load all CSVs to object storage. (Reference)
Provision autonomous database as mentioned in this live lab
data_prep folder has data_prep/001_load_tables.sql script to load CSVs to tables in ADW
In ADW SQL worksheet, run feature_extract/002_create_features.sql. Refer to livelab
Run feature_extract/003_combine_features.ipynb to create ML ready dataframe. Refer to livelab for getting ADW wallets
Run model_build/004_build_model.ipynb to create , catalog and deploy a ML model using Oracle ADS

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data_prep		data_prep
feature_extract		feature_extract
model_build		model_build
Architecture.png		Architecture.png
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview:

Business use:

Data set:

Meta data:

OCI Product/Services used in this demo:

Reference architecture diagram

Pre-requisites

How to use this repo?

About

Releases

Packages

Languages

Chandrak1907/Synthea_readmission

Folders and files

Latest commit

History

Repository files navigation

Overview:

Business use:

Data set:

Meta data:

OCI Product/Services used in this demo:

Reference architecture diagram

Pre-requisites

How to use this repo?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages