From ffad27e854186b529cb01ad9c3f03e73dd0c78a6 Mon Sep 17 00:00:00 2001 From: "Documenter.jl" Date: Thu, 12 Sep 2024 05:31:57 +0000 Subject: [PATCH] build based on da3a75b --- dev/.documenter-siteinfo.json | 2 +- dev/definitions/index.html | 2 +- dev/index.html | 8 ++++---- 3 files changed, 6 insertions(+), 6 deletions(-) diff --git a/dev/.documenter-siteinfo.json b/dev/.documenter-siteinfo.json index 86ce920..9884333 100644 --- a/dev/.documenter-siteinfo.json +++ b/dev/.documenter-siteinfo.json @@ -1 +1 @@ -{"documenter":{"julia_version":"1.10.5","generation_timestamp":"2024-09-12T05:05:54","documenter_version":"1.7.0"}} \ No newline at end of file +{"documenter":{"julia_version":"1.10.5","generation_timestamp":"2024-09-12T05:31:54","documenter_version":"1.7.0"}} \ No newline at end of file diff --git a/dev/definitions/index.html b/dev/definitions/index.html index 251dad1..205ac20 100644 --- a/dev/definitions/index.html +++ b/dev/definitions/index.html @@ -1,2 +1,2 @@ -Definitions · PlotPWM.jl

Definition of Information Content in Position Weight Matrices (PWM)

In a position weight matrix (PWM), the "letter height", or more formally, the information content $IC(\cdot)$ of the $i$-th column $c_i$, quantifies how conserved the nucleotides are at that position compared to a background model. It is calculated using the formula:

\[IC(c_i) = \sum_{\alpha}f_{\alpha i}\log_2 (f_{\alpha i} / \beta_\alpha)\]

where $f_{\alpha i}$ is the frequency of nucleotide $\alpha\in\Set{A,C,G,T}$ at the $i$-th column of a PWM and $\beta_\alpha$ denotes the genomic background frequency of nucleotide $\alpha$.

Default genomic background

By default, the background model assumes a uniform distribution of nucleotides, with each nucleotide having a frequency of $\beta=(0.25, 0.25,0.25,0.25)$. In this case, the information content $IC(c_i)$ simplifies to:

\[IC(c_i)=2+\sum_{\alpha}f_{\alpha i}\log_2 f_{\alpha i}\]

This formula illustrates why the y-axis of the logo-plot ranges from $0$ to $2$.

+Definitions · PlotPWM.jl

Definition of Information Content in Position Weight Matrices (PWM)

In a position weight matrix (PWM), the "letter height", or more formally, the information content $IC(\cdot)$ of the $i$-th column $c_i$, quantifies how conserved the nucleotides are at that position compared to a background model. It is calculated using the formula:

\[IC(c_i) = \sum_{\alpha}f_{\alpha i}\log_2 (f_{\alpha i} / \beta_\alpha)\]

where $f_{\alpha i}$ is the frequency of nucleotide $\alpha\in\Set{A,C,G,T}$ at the $i$-th column of a PWM and $\beta_\alpha$ denotes the genomic background frequency of nucleotide $\alpha$.

Default genomic background

By default, the background model assumes a uniform distribution of nucleotides, with each nucleotide having a frequency of $\beta=(0.25, 0.25,0.25,0.25)$. In this case, the information content $IC(c_i)$ simplifies to:

\[IC(c_i)=2+\sum_{\alpha}f_{\alpha i}\log_2 f_{\alpha i}\]

This formula illustrates why the y-axis of the logo-plot ranges from $0$ to $2$.

diff --git a/dev/index.html b/dev/index.html index e705263..2f84264 100644 --- a/dev/index.html +++ b/dev/index.html @@ -1,6 +1,6 @@ -Home · PlotPWM.jl

PlotPWM

Documentation for PlotPWM.

PlotPWM.logoplot_with_arrow_gapsMethod

dsmat: Matrix of distances between the pfms e.g. 12 6 32 8 number of rows = number of "modes" of distances number of columns = number of distance in between the pfms weights: weights for each mode of distances optional parameters: givennumcols: the total number of columns that will be occupied for all the arrow-shapes arrowshapescaleratio: the ratio by which the width of the arrow-shapes will be scaled height_top: the maximum height of the arrow-shapes

source
PlotPWM.save_logoplotMethod
save_logoplot(pfm, background, save_name; dpi=65)

Arguments

  • pfm::Matrix{Real}: Position frequency matrix
  • background::Vector{Real}: Background probabilities of A, C, G, T
  • save_name::String: Name of the path/file to save the plot

Note that

  • pfm must be a probability matrix
    • sum of each column must be 1
  • background must be a vector of length 4
    • must be a vector of probabilities
    • sum of background must be 1

Example


+Home · PlotPWM.jl

PlotPWM

Documentation for PlotPWM.

PlotPWM.logoplot_with_arrow_gapsMethod

dsmat: Matrix of distances between the pfms e.g. 12 6 32 8 number of rows = number of "modes" of distances number of columns = number of distance in between the pfms weights: weights for each mode of distances optional parameters: givennumcols: the total number of columns that will be occupied for all the arrow-shapes arrowshapescaleratio: the ratio by which the width of the arrow-shapes will be scaled height_top: the maximum height of the arrow-shapes

source
PlotPWM.save_logoplotMethod
save_logoplot(pfm, background, save_name; dpi=65)

Arguments

  • pfm::Matrix{Real}: Position frequency matrix
  • background::Vector{Real}: Background probabilities of A, C, G, T
  • save_name::String: Name of the path/file to save the plot

Note that

  • pfm must be a probability matrix
    • sum of each column must be 1
  • background must be a vector of length 4
    • must be a vector of probabilities
    • sum of background must be 1

Example


 pfm =  [0.02  1.0  0.98  0.0   0.0   0.0   0.98  0.0   0.18  1.0
         0.98  0.0  0.02  0.19  0.0   0.96  0.01  0.89  0.03  0.0
         0.0   0.0  0.0   0.77  0.01  0.0   0.0   0.0   0.56  0.0
@@ -13,9 +13,9 @@
 
 #= save the logo plot in the current folder as logo.png with a dpi of 65 =#
 save_logoplot(pfm, background, "logo.png"; dpi=65)
-
source
PlotPWM.save_logoplotMethod
save_logoplot(pfm, save_name; dpi=65)
 
 This is the same as `save_logoplot(pfm, background, save_name; dpi=65)`
 where `background` is set to `[0.25, 0.25, 0.25, 0.25]`
 
-See `save_logoplot(pfm, background, save_name; dpi=65)` for more details.
source
+See `save_logoplot(pfm, background, save_name; dpi=65)` for more details.
source