Technical section

Model description

Volatile adopts a Bayesian hierarchical model based on adjusted closing prices, sector and industry information, estimating log-price via polynomials in time.

Denote $\tau_t=t/T$ to represent times at which observations arrive. corresponds to the number of days in the training dataset, which is taken to be the last one year of data.

Furthermore, denote $\gamma_j=1 - \tfrac{j}{D+1},\text{ for }j=0,\dots,D$ to be prior scale parameters associated to the j-th order of a polynomial with degree . Decreasing the scales as increases penalises deviation from zero of higher-order parameters, thereby encouraging simpler models. Currently, we set .

We write:

$\text{sec}(\ell)=k,\text{ for }\ell=1,\dots, L\text{ and }k=1\dots,K$ to indicate that an industry $\ell$ belongs to a sector , where is the number of industries and the number of sectors;
$\text{ind}(i)=\ell,\text{ for }i=1,\dots, N\text{ and }\ell=1\dots,L$ to indicate that a stock belongs to an industry $\ell$ , where the number of stocks.

Then, we construct the hierarchical model

$\begin{align*}\phi^m_j &\sim \mathcal{N}(0,\ 16\gamma_j^2)\\ \phi^s_{k,j} &\sim \mathcal{N}(\phi^m_j,\ 4\gamma_j^2)\\ \phi^\iota_{\ell,j} &\sim \mathcal{N}(\phi^s_{\text{sec}(\ell),j},\ \gamma_j^2)\\ \phi_{i,j} &\sim \mathcal{N}(\phi^\iota_{\text{ind}(i),j},\ \tfrac{1}{4}\gamma_j^2)\\ \psi^m &\sim \mathcal{N}(0,\ 16)\\ \psi_k^s &\sim \mathcal{N}(\psi^m,\ 4)\\ \psi^\iota_{\ell} &\sim \mathcal{N}(\psi_{\text{sec}(\ell)}^s,\ 1)\\ \psi_{i} &\sim \mathcal{N}(\psi^\iota_{\text{ind}(i)},\ \tfrac{1}{4})\\ y_{t,i} &\sim \mathcal{N}\left(\sum_{j=0}^{D}\phi_{i,j}\,\tau_t^j, \text{softplus}^2(\psi_i)\right)\end{align*}$

Parameters at market-level $\phi^m\text{ and }\psi^m$ are prior means for sector-level parameters $\phi^s\text{ and }\psi^s$ , which in turn are prior means for industry-level parameters $\phi^\iota\text{ and }\psi^\iota$ ; finally, the latter are prior means for stock-level parameters $\phi\text{ and }\psi.$ Components of the parameters at each level are supposed to be conditionally independent given the parameters at the level above in the hierarchy. Whereas $\phi$ are used to determine the coefficients of the polynomial model, $\phi$ are used to determine the scales of the likelihood function.

Inference

In order to estimate parameters, we condition on adjusted closing log-prices $y_{t,i}$ , for all $t=1,\dots T$ , then we estimate the mode of the posterior distribution, also known as Maximum-A-Posteriori (MAP). From a frequentist statistics perspective, this corresponds to a polynomial regression task where we minimise a regularised mean-squared error loss. In practice, we train the model sequentially at different levels, that is first we train a market-level model to find market-level parameters; then we fix the market-level parameters and train a sector-level model to find sector-level parameters; and so on. A plot showing the losses decay during training can be saved in the current directory as losses_decay.png by adding the flag --plot-losses in the command line.

Stock-level estimators

Obtained our estimates $\hat\phi^m,\hat\phi^s,\hat\phi^\iota,\hat\phi,\hat\psi^m,\hat\psi^s,\hat\psi^\iota\text{ and } \hat\psi$ , we can use the likelihood mean $\hat y_{t,i}=\sum_{j=0}^{D}\hat\phi_{i,j}\,\tau_t^j$ as an estimator of the log-prices for any time in the past, as well as a predictor for times in the short future. As a measure of uncertainty, we take the learned scale of the likelihood, that is $\hat\sigma_i=\text{softplus}(\psi_i)$ .

Estimators for all hierarchy levels

Because we model log-prices as a Gaussian, the distribution of prices is a log-Normal distribution, whose mean and standard deviation can be derived in closed form from the estimators $\hat y_{t,i}$ and $\hat\sigma_i$ . They are respectively as follows:

$\begin{align*} \hat{\mu}_{t,i}&=e^{\hat{y}_{t,i}+\frac{1}{2}\hat{\sigma}_i^2}\\ \hat{\nu}_{t,i}&= \left((e^{\hat{\sigma}_i^2}-1)e^{2\hat{y}_{t,i}+\hat{\sigma}_i}\right)^{\frac{1}{2}} \end{align*}$

We use log-Normal distribution statistics at times $t=1\dots,T$ to produce the stock estimation plot and at time to fill the prediction table. In order to produce the market, sector and industry estimation plots, we proceed analogously but with estimators at respective levels, that is $\hat y^m_{t}=\sum_{j=0}^{D}\hat\phi^m_{j}\,\tau_t^j$ and $\hat\sigma^m=\text{softplus}(\psi^m)$ for market, $y^s_{t,k}=\sum_{j=0}^{D}\hat\phi^s_{k,j}\,\tau_t^j$ and $\hat\sigma^s_k=\text{softplus}(\psi^s_k)$ for sector, $\hat y^\iota_{t,\ell}=\sum_{j=0}^{D}\hat\phi^\iota_{\ell,j}\,\tau_t^j$ and $\hat\sigma^\iota_\ell=\text{softplus}(\psi^\iota_\ell)$ for industry.

Ranking & rating

Given the selected model complexity, Volatile trains the model and provides a rate for each stock by introducing the following score:

$\text{score}_i=\frac{\hat y_{T+5,i}-y_{T,i}}{\hat\sigma_i}$

where $y_{T,i}$ is the last available log-price and $\hat y_{T+5,i}$ is its prediction in 5 trading days (usually, that corresponds to the log-price in one week). If the future prediction is larger than the current price, the score will be positive; the larger the difference and the more confident we are about the prediction (or equivalently, the smaller the standard deviation is), the more positive will be the score. We can reason similarly if the score is negative. In other words, a large positive score indicates that the current price is undervalued with respect to its stock trend, therefore an opportunity to buy; a large negative score indicates, vice versa, that the current price is overvalued with respect to its stock trend, therefore a moment to sell.

Then, stocks are rated according to the following criteria:

HIGHLY BELOW TREND if $\text{score}_i>3$ ;
BELOW TREND if $2<\text{score}_i<=3$ ;
ALONG TREND if $-2<\text{score}_i<=2$ ;
ABOVE TREND if $-3<\text{score}_i<=-2$ ;
HIGHLY ABOVE TREND if $\text{score}_i<=-3$ .

Current percentage trend growth

Given the price trend estimate $\hat{\mu}_{t,i}$ as a function of time , the percentage trend growth is defined as $\frac{1}{\hat{\mu}_{t,i}}\frac{d\hat{\mu}_{t,i}}{dt}=\frac{d\log \hat{\mu}_{t,i}}{dt}\approx \frac{1}{t}\sum_{j=1}^Dj\,\phi_{i,j}\tau_t^{j-1}$ . When evaluated at current time , the right-hand-side equals $\frac{1}{T}\sum_{j=1}^Dj\,\phi_{i,j}$ , which is the number appearing in the table.

Volatility

The volatility is a measure of how noisy is a stock over time. In Volatile, this is measured as the current standard deviation estimate divided by the current price, that is $\text{volatility}_i=\frac{\hat\nu_{T,i}}{p_{T,i}}$ .

Best match

For each stock, we define its match as the stock in the list that is most closely correlated to it according to some metric. This information is particularly useful in pair trading, a simple trading strategy based on the strong correlation between a pair of stocks. If the two stocks evolve similarly up to some point and then diverge, one could long the underperforming stock and short the overperforming one, with the intention of closing the positions when their evolutions will match again. In order to discover correlated stocks, we first train a model analogous to the one described above, but with a very high polynomial degree; we arbitrarily take , that is the number of weeks in a year of training data. The rationality behind this choice is that while a predictive model should exploit a low-complexity polynomial in order to avoid fitting oscillations that may not be inherent to the prediction (i.e. overfitting), a model purposed to discover correlations should be complex enough to indeed capture most of them. Then, because the polynomial is a smooth curve in time, we can again compute the percentage trend growth function $\frac{1}{\hat\mu_{t,i}}\frac{d\hat\mu_{t,i}}{dt}$ and measure stock correlations via the simple distance metric $\sum_{t=1}^T\left(\frac{1}{\hat\mu_{t,i}}\frac{d\hat\mu_{t,i}}{dt}-\frac{1}{\hat\mu_{t,j}}\frac{d\hat\mu_{t,j}}{dt}\right)^2$ . For every stock , the stock that minimizes such distance will be named its match. Notice that because the percentage trend growth is a normalized derivative, the nominal value of the trend (and therefore of the price) should not matter when measuring the distance; the metric will only look at its relative variation.

Currency conversion

If the symbols passed to Volatile have different price currencies, we first find the most common currency and set it as default, then we download the last year of exchange rate information and convert all currencies to the default one. Training and score metric computation are executed using converted prices. Mathematically, if $p_{t,i}$ is the price of a certain stock in its currency, we define $\tilde{p}_{t,i}=r_{t,\text{curr}(i)}p_{t,i}$ to be the converted price, where $r_{t,\text{curr}(i)}$ is the exchange rate from the original currency $\text{curr}(i)$ to the default one. Then, the corresponding log-prices follow the relation $\tilde{y}_{t,i}=\log r_{t,\text{curr}(i)}+y_{t,i}$ . Because we model $y_{t,i}$ as a Gaussian, $\tilde{y}_{t,i}$ is also a Gaussian with the additional log-exchange rate in the mean and same standard deviation. Therefore, after mean and standard deviation estimates of $\tilde{y}_{t,i}$ are computed, estimators for $y_{t,i}$ can be promptly obtained, from which log-Normal mean and standard deviation estimators of $p_{t,i}$ can in turn be produced.

Portfolio risk measure

We compute a measure of risk for the portfolio as follows:

$\frac{1}{M}\text{Std}\left(\sum_{i=1}^M u_i p_{T,i}\right)=\frac{1}{M}\left(\sum_{i=1}^M\sum_{j=1}^Mu_{T,i}u_{T,j}\text{Cov}\left(p_{T,i},p_{T,j}\right)\right)^\frac{1}{2}$

where $\text{Std}$ stands for standard deviation, $\text{Cov}$ for covariance, is the number of different stocks in the portfolio, $p_{T,i}$ and $u_{T,i}$ are respectively price and number of owned units of stock at time . We then make the practical assumption that

$\text{Cov}(p_{T,i}, p_{T,j})\approx (\hat{\nu}_T^m)^2+\hat{\nu}_{T,i}^2\delta_{i,j}+(\hat{\nu}_{T,\text{ind}(i)}^\iota)^2\delta_{\text{ind}(i),\text{ind}(j)}+(\hat{\nu}_{T,{sec}(i)}^s)^2\delta_{\text{sec}(i),\text{sec}(j)}$

where $\delta_{i,j}$ denotes a Kronecker delta and $\hat{\nu}_{T,i}, \hat{\nu}_{T,\ell}^\iota, \hat{\nu}_{T,k}^s, \hat{\nu}_T^m$ are price standard deviation estimators at stock, industry, sector and market levels. Although the covariance approximation above does not exactly correspond to the model in use, it is useful to associate higher risk to stocks with higher volatility and to increase the risk if multiple stocks in the portfolio belong to the same category.

Notice that because standard deviations are multiplied by the number of owned units and because higher prices usually tend to have higher standard deviations, a larger invested capital is prone to be associated with a higher risk. Furthermore, we divide by the number of stocks in the portfolio to promote diversification as a way to lower risk.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Technical section

Model description

Inference

Stock-level estimators

Estimators for all hierarchy levels

Ranking & rating

Current percentage trend growth

Volatility

Best match

Currency conversion

Portfolio risk measure

Home

Installation guide

User guide

Market exploration

Bot tournament

Bots description