Dror Berel

Computational Biologist at Fred Hutch.

Consultant: currently accepting new projects!

Enthusiastic Statistician with expertise of implementing machine learning (resampling, ensemble, tuning, benchmarking) tools for high-dimensional multiplex data structure. Leveraging higher level scope tools for analysis of convoluted nested datasets, including multi-layer fusion data, n-table dimension reduction methods, and integration of multiple annotation domains. Expert at data engineering pipelines, utilizing scalable object-oriented tools. Designing analytical tools for complex nested experimental design, at the meta-analysis level. Over 15 years of experience in advanced R and Python.

Recent projects:

Multi-assay data analysis

poster: Prototype meta-analysis demonstration for ImmuneSpaceR, using designated S4 objects https://www.bioconductor.org/help/course-materials/2017/BioC2017/DDay/LightningTalk/SessionII/ImmuneSpaceR.pdf

Bioc2mlr

R package to bridge between Bioconductor’s S4 complex genomic data container, to mlr, a meta machine learning aggregator package.

Bioconductor's S4 data containers for genomic assays are popular, well established data structures. Their data architecture facilitates the application of common analytical procedures and well established statistical methodologies to large assay data. They are extensible to encompass new emerging technologies and analytical methods. However, the S4 system enforces strict constraints on the data and these constraints raise barriers for interoperability and integration with software and packages outside of Bioconductor's repository. mlr is a comprehensive package for machine learning. It aggregates hundreds of supervised and unsupervised models and facilitates analytics such as resampling, benchmarking, tuning, and ensemble. The mlrCPO package extends mlr's pre-processing and feature engineering functionality via composable Preprocessing Operators (CPO) 'pipelines'.

Bioc2mlr is a compact utility package designed to bridge between these approaches. It deploys transformations of SummarizedExperiment and MultiAssayExperiment S4 data structures into mlr's expected format. It also implements Bioconductor's popular feature selection (filtering) methods used by limma package and others, as a CPO. The vignettes present comparisons to the MLInterfaces package, which aims to achieve similar goals, and presents workflows for popular publicly available genomic datasets such as curatedTCGAData.

Website

https://drorberel.github.io/

Blog

https://medium.com/@drorberel

Linkedin profile

https://www.linkedin.com/in/dror-berel-1848496/

Publications

https://www.ncbi.nlm.nih.gov/pubmed?term=%22Berel%20D%22[Author]

Name		Name	Last commit message	Last commit date
Latest commit History 527 Commits
.github		.github
_data		_data
_includes		_includes
_layouts		_layouts
_posts		_posts
blog		blog
css		css
img		img
js		js
.Rhistory		.Rhistory
.gitattributes		.gitattributes
.gitignore		.gitignore
404.html		404.html
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
Heb.md		Heb.md
LICENSE		LICENSE
README.html		README.html
README.md		README.md
_config.yml		_config.yml
aboutme.md		aboutme.md
consulting.md		consulting.md
drorberel.github.io.Rproj		drorberel.github.io.Rproj
feed.xml		feed.xml
index backup.html		index backup.html
index.md		index.md
plug.jpg		plug.jpg
r-bloggers-feed.xml		r-bloggers-feed.xml
staticman.yml		staticman.yml
tags.html		tags.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dror Berel

Computational Biologist at Fred Hutch.

Consultant: currently accepting new projects!

Recent projects:

Multi-assay data analysis

Bioc2mlr

Website

Blog

Linkedin profile

Publications

About

Releases

Packages

Languages

License

drorberel/drorberel.github.io

Folders and files

Latest commit

History

Repository files navigation

Dror Berel

Computational Biologist at Fred Hutch.

Consultant: currently accepting new projects!

Recent projects:

Multi-assay data analysis

Bioc2mlr

Website

Blog

Linkedin profile

Publications

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages