GitHub - LiuPengJugx/ppo-controller: This project designs a paradigm of dynamic partitioning system which uses reinforcement learning algorithm, PPO, to solve the dynamic workload problem in vertical partitioning field.

Self-Adapting Vertical Partitioning for Dynamic Workload

This is the project code of dynamic partitioning algorithm PPO controller. The structure and function of each directory are listed below.

Project structure description:

baselines: Some classic dynamic partitioning algorithms, including AutoStore, Smopdc, Feedback, PPO Controller etc. It also includes a template class, AlgorithmTemplate, which all algorithms need to implement its defined methods.
data: This directory includes workload generator and generated workload files (i.e. tpc-h/ tpc-ds/ synthetic).
db: Database related module. It provides driver class, transaction class, data types related to table structure and load structure, cost model, etc.
environment: Environment module of RL. Where env.py and env5.py respectively is the initial and final version of PPO-Controller's environment file.
experiment: Drawing related files and is responsible for the visualization of experimental results.
log: Log files.
partitioner: Partitioner module. Some files related to SCVP algorithm.
pretrained: It saves temporarily generated model and data files.
selector: Workload selector module. A query selection algorithm for repartitioning.
visualization: It also includes some code files related to workload data visualization.
other single files:
- util.py: A tool class.
- tianshou_ppo.py: The file use tianshou RL library to implement ppo controller (temporary version).
- adapter_controller_pg.py: A partition generator module that conducts experiments on postgresql database.
- tpc-main.py/ syn-main.py: The main program entry file, which is used to conduct the comparative experiment. The dataset and dynamic partition algorithms can be flexibly specified.

Project dependency package:

Some modules required by the project can be seen in the requirement.txt file. It is recommended to run the following commands on the console:

cd ppo-controller

conda create -n dyppo python==3.6

source activate dyppo

pip install -r requirements.txt

DB Configuration (Non-required)

When conducting experiments about latency, users need to configure the PG database environment in advance and modify the user connection information in db\pg.py. All table structures can be imported through db\ppoc.sql. Then we can call adapter_controller_pg.py to deploy partitions and get experimental results.

Run partitioning algorithms on HDD cost model:

syn-main.py: Test the performance of baselines over synthetic datasets
tpc-main.py: Test the performance of baselines over TPC-H / TPC-DS datasets and conduct sensitivity analysis experiments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Adapting Vertical Partitioning for Dynamic Workload

Project structure description:

Project dependency package:

DB Configuration (Non-required)

Run partitioning algorithms on HDD cost model:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
baselines		baselines
data		data
db		db
environment		environment
experiment		experiment
partitioner		partitioner
pretrained		pretrained
selector		selector
stable_pretrained_model		stable_pretrained_model
visualization		visualization
README.md		README.md
Rebuttal-full.docx		Rebuttal-full.docx
adapter_controller_pg.py		adapter_controller_pg.py
ppo_controller.py		ppo_controller.py
requirements.txt		requirements.txt
syn-main.py		syn-main.py
test_HDD_confidence.py		test_HDD_confidence.py
tianshou_ppo.py		tianshou_ppo.py
tpc-main.py		tpc-main.py
util.py		util.py

LiuPengJugx/ppo-controller

Folders and files

Latest commit

History

Repository files navigation

Self-Adapting Vertical Partitioning for Dynamic Workload

Project structure description:

Project dependency package:

DB Configuration (Non-required)

Run partitioning algorithms on HDD cost model:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages