Model-Based Filtering via Finite Skew Normal Mixture for Stock Data

Solmaz Yaghoubi; Rahman Farnoosh

doi:10.2991/jsta.d.200827.001

<Previous Article In Issue

Download article (PDF)

Next Article In Issue>

Volume 19, Issue 3, September 2020, Pages 391 - 396

Model-Based Filtering via Finite Skew Normal Mixture for Stock Data

Authors

Solmaz Yaghoubi¹, Rahman Farnoosh²^{, *}

¹Science and Research Branch, Islamic Azad University, Tehran, Iran

²School of Mathematics, Iran University of Science and Technology, Tehran, Iran

^*Corresponding author. Email: rfarnoosh@iust.ac.ir

Corresponding Author

Rahman Farnoosh

Received 9 March 2019, Accepted 28 July 2020, Available Online 8 September 2020.

DOI: 10.2991/jsta.d.200827.001 How to use a DOI?
Keywords: Stock of banks and credit institutions; Mixture model; Clustering time series; Multivariate skew normal; GAS model
Abstract: This paper proposes a flexible finite mixture model framework using multivariate skew normal distribution for banking and credit institutions’ stock data in Iran. This method clusters time series stocks data of Iranian banks and credit institutions to filter those data into four groups. The proposed model estimates matrices of time-varying parameter for skew normal distribution mixture using EM algorithm, updating the estimated parameters via generalized autoregressive score (GAS) model. Empirical studies are conducted to examine the effect of the proposed model in clustering, estimating, and updating parameters for real data from 12 sets of stocks. Our stock data were filtered in four trade clusters with best performance.
Copyright: © 2020 The Authors. Published by Atlantis Press B.V.
Open Access: This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

1. INTRODUCTION

In recent years, clustering algorithms for time series data have found significance because of having a good quality in different kind of applications.

It is specifically useful in stock filtering performance where big databases gathered from market stocks. These data have regularities which can be clustered automatically.

Stock of banks and credit institutions are very diverse in terms of stock trading value, trading volume, growth rate, the first price, the first opening volume, last price, close prices, and the rate of difference between high and low price. Those features help us in clustering time series data representing from market stock. Thus understanding heterogeneous features is of interest and key important in clustering the groups reliably.

Clustering of stocks provides strategies which help the trader of market stocks identify type of banks and credit institution’s stock as the best candidates for buy, the best candidate for sell, as well as developing a watching list controlling for buy and sell.

In this study, we purpose a finite mixture model using Skew Normal Distribution for clustering high-dimensional time series data. The framework estimates a matrix of time-varying parameters and applies updates using score-driven approach proposed by Creal et al. [1], allowing us to robustly cluster the data into approximately homogenous groups.

The skew normal mixture distribution was considered in our proposed model. The main purpose of this model is to deal with data sets that may not be normal and our model is able to robustly cluster and with good performance the high-dimension data that may have an asymmetric- and/or heavy-tailed distribution.

Literature Review

Roengpitya et al. [2], Ayadi et al. [3], and Ayadi and Groen [4] illustrated cluster analysis to identify bank business models. Ahmadzadehgoli [5] proposed The LINEX Weighted k-Means Clustering and Andre Lucas(2017) introduced a finite mixture model for multivariate normal and t distribution which updated parameters using score-driven approach. Creal et al. [1] and Harvey [6] introduced generalized autoregressive score (GAS) model for updating time-varying parameters while Ayadi and Groen [4] explained static clustering methods with dynamic parameters. Catania [7] provided an example of dynamic clustering with dynamic parameters. He proposed a score-driven mixture model and used score-driven updates for all parameters that required a large number of observations.

Finally we applied our model to a multivariate panel of N = 12 stock data of banks and credit institutions for the period 2019/6- 2019/10, i.e., over T = 90 days in 18 week with P = 8 indicator variables for L groups of similar stock data of banks and credit institutions. We identified L = 4 trade model components and illustrated properties of each of group.

In addition, our study contributes to literature on statistic clustering of time series data for stocks (Roengpitya et al. [2], Ayadi et al. [3], and Ayadi and Groen [4]) by identifying stock trade model because we believe the properties of stock models are unlikely to switch their trade model over a short-term period (see, e.g., Ayadi and Groen [4]). This article is organized in 4 sections. In Section 2, we introduce the finite mixture model for skew normal distribution and estimate matrix of parameters using EM algorithm and updating parameters via GAS model through the score-driven approach. Section 3 explains an empirical study of stock data from banks and credit institutions in Iran, and a brief conclusion is presented in Section 4. Note that in this paper all of computations were run using R program.

2. INTRODUCING THE MODEL

2.1. Mixture Model

Let $yit∈ℝP×1$ be a multivariate panel data for the firms i = 1,2,…,N that contains p = 1,2,…,P characteristics for time t = 1,2,…,T. We show $yit$ by L-component mixture model as follows:

$yit=∑l=1Lzil⋅vilti=1,2,…,N,t=1,2,…,T.$ (1)

where

$zil$ are hidden variables of the firm I. If the firm I is in the mixture component L then

$zil=1$ otherwise

$zil=0$ and

$zi=(zi1,zi2,…,zil)′∈{0,1}$ and

$P(zil=1)=ω1$ , where

$ω1+ω2+⋯+ωl=1$ . we define

$vilt∼fl(.|αlt,βlt,γl)$ where

$αlt$ ,

$βlt$ and

$γl$ are mean, covariance matrices, and skewness parameters of skew normal distribution

It suffices to note that all observations were stacked into the matrix $Yit=(yi1,yi2,…,yit)′∈R(T×P)$ as parameters in each mixture component l with $αlt$ and $βlt$ for all times t. It is important to note that we have two type of parameters: static and dynamic. Here $Θ$ contains all static parameters, like $ωl=ωl(Θ)$ , $γl=γl(Θ)$ and $θl=θl(Θ)$ , for which we use the short-hand notion $ωl$ , $γl$ , and $θl$ for simplicity. We specifically explain $αlt$ and $βlt$ which are functions of past data only and updated using score-driven dynamics proposed by Creal et al. [1] while the values for $γl$ are chosen to form time-invariant identity matrices ranging on the interval (−0.99, 0.99) (See Azzalini [8]).

To compute likelihood function by a standard prediction error we have

$log(L(Θ))=∑i=1Nlog∑l=1Lωlfl(YiT;θl),$ (2)

where

$fl(YiT;θl)=∏t=1Tfl(Yit|Yi,t−1;θlt),$ (3)

and

$fl(Yit|Yi,t−1;θlt)$ is the conditional distribution for the multivariate skew normal

$Y=α+γτ+U$ where

$τ$ and

$U$ are independently distributed as

$HN(0,Ip)$ and

$Np(0,β)$ respectively (see Lin [9]).

As is common in our model, we do not estimate $Θ$ directly by numerically maximizing the log-likelihood function in (2). To overcome this problem we use EM algorithm to estimate the parameters (see Dempster et al. [10]). To formulate the EM algorithm for dynamic parameters we need to define the complete data $(YiT,τiT,zi)$ with the likelihood function

$log(Lc(Θ))=∑i=1N∑l=1Lzillogωl−log|βlt|−12(Yit−αlt−γlτit)Tβlt−1(Yit−αlt−γlτit)−12τitTτit,$ (4)

Since $zi$ is hidden indicator we cannot perform a direct maximization and instead we maximize its conditional expectation function over $zi$ given the observed data $YT=(Y1T,…,YNT)$ and some previously known values for the parameter $Θ(k−1)$ . We maximize with respect to $Θ$ (Lin [9]).

$Q(Θ,Θ(k−1))=E[logLc(Θ)|YT;Θ(k−1)]=∑i=1N∑l=1L∑t=1Tẑil(k−1){logωl−12log|βlt|−12(yit−αlt−γlη̂ilt(k−1))Tβlt−1(yit−αlt−γlη̂ilt(k−1))−12tr(βlt−1γl(Ψ̂ilt(k−1)−η̂ilt(k−1)η̂ilt(k−1)T)γlT)},$ (5)

where

$η̂ilt=E(τit),Ψ̂ilt=E(τitτitT).$

In the E-Step, the hidden indicator probabilities are updated using

$ϑ̂il(k):=P[zil=1|YT,Θ(k−1)]=ωl(k−1)fj(yiT;θl(k−1))∑h=1Lωh(k−1)fl(yiT;θh(k−1)),$ (6)

It is important to note that $ϑil(k)$ does not depend on time because the stock trade model is unlikely to vary in a limited time. After updating $ϑil(k)$ , we move to M-Step and maximize $Q(Θ,Θ(k−1))$ with respect to $ωl$ (See Lucas [11]).

$ω̂l(K)=1N∑i=1Nϑ̂il(k−1).$ (7)

2.2. Updating Dynamic Parameters

Now in this section we use the score-driven approach proposed by Creal [1] to formulate dynamic parameters $αlt$ and $βlt$ .

2.2.1. Mean

As explained above, we use the score-driven approach as discussed in Lucas and Zhang [12]:

$αlt+1=αlt+AUαlt.$ (8)

where

$Uαlt$ represents the first derivation of (5) with respect to

$αlt$ and

$A=A(Θ)$ is a diagonal matrix of unknown parameters. By a computation similar to the one found in Lucas [13] we compute

$Uαlt=∑i=1Nϑil(yit−αlt−γlηilt)∑i=1Nϑil.$ (9)

Then we formulate updating mechanism as follows:

$αlt+1=αlt+A∑i=1Nϑil(yit−αlt−γjηilt)∑i=1Nϑil.$ (10)

2.2.2. Covariance matrix

Using the same calculations and the score-driven approach, we have

$βlt+1=βlt+BUβlt,$ (11)

As before $Uβlt$ is the first derivation of (5) with respect to $βlt$ and $B=B(Θ)$ is a diagonal matrix of unknown parameters. Doing the same calculation as the one used above we have

$Uβlt=12∑i=1Nϑil(k)(yit−αlt−γlηilt)T(yit−αlt−γlηilt)−βlt∑i=1Nϑil(k),$ (12)

Then we formulate the updating mechanism as follows:

$βlt+1=βlt+B12∑i=1Nϑil(k)(yit−αlt−γlηilt)T(yit−αlt−γlηilt)−βlt∑i=1Nϑil(k).$ (13)

After updating parameters using equations (10) and (13), we compute $ϑil(k)$ by substitution in equation (6). Next, we maximize (5) with respect to A and B for computing those values. This step is iterated until a convergence is reached.

3. EMPIRICAL STUDY

3.1. Data

In this section we use an empirical example to examine the ability of our proposed model. The sample studied here contains N = 12 stocks of banks and credit institutions for the period 2019/6-2019/10. This covers T = 90 day. We accept that drivers in stocks trade model can be characterized by four dimensions as shown in Figure 1. The best candidate for buy, the best candidate for sell, the watching list controlling for buy and sell.

We select a set of P = 8 features from these four categories. We consider stocks’ trading value, trading volume, growth rate, the first price, the first opening volume, last price, close prices, and the rate of difference between high and low price.

3.2. Model Selection

In this section, the number of clusters for our empirical analysis was selected using some of well-known criteria, i.e., Akaike information criterion (AIC), Bayesian information criterion (BIC), Davies-Bouldin index (DBI), and silhouette index (SI). The purpose of this criterion is to evaluate the structure of clusters created by clustering algorithms. Many criteria have been introduced to evaluate the accuracy of the clustering results.

These indices try to measure the similarity of members within the cluster and the similarity between the clusters. Therefore, the appropriate method is the one that results in the highest level of similarity within a cluster or the greatest differentiation between clusters.

As a likelihood-based model was utilized here, we used standard-likelihood-based criteria, including AIC and BIC, to determine the number of clusters (Hurvich and Tsai [14] and Bai and Ng [15]). The smaller are the values obtained for these two criteria, the more accurate will be the number of clusters. The silhouette index (SI, see de Amorim and Hennig [16] and Davies–Bouldin index (DBI, see Davies and Bouldin [17] criteria express the greatest similarity within a cluster, and larger values found for these two criteria indicate a better choice in terms of selecting the number of clusters. The results are presented in Table 1.

Index	DBI	SI	AIC	BIC
L = 2	0.5769	0.5296	19.6562	1901.23
L = 3	0.5615	0.4552	13.6587	2851.70
L = 4	0.5831	0.5570	17.5505	1802.14
L = 5	0.6683	0.6292	21.4242	4752.62

Table 1 presents likelihood-based (AIC, BIC) and distance-based (DBI, SI ) information criteria indices for different values of L = 2,…,5. The minimum value (AIC, BIC) and maximum value (DBI, SI ) of components suggested L = 4.

Table 1

Information criteria.

3.2.1. Discussion of stock’ trade model

In this section, L = 4 different component densities are applied to different business models. We label a trade model on each cluster as illustrated in Figure 2 which plots the stock trade model for each feature characterization.

(C1) The best candidate for sell (8.33 of firms; e.g., Middle East Bank)

(C2) Watch list controlling for sell (41.66 of firms; e.g., Saderat Bank, Parsian Bank, Sina Bank, Karafarin Bank, Melal Credit Institution)

(C3) Watch list controlling for buy (16.67 of firms; e.g., Tejarat Bank, Pasargad Bank)

(C4) The best candidate for buy (33.34 of firms; e.g., Melat Bank, Eghtesad Novin Bank, Dey Bank, Post Bank)

The best candidate for sell (blue line): These stocks belong to banks and credit institutions that have the lowest trading volume, value of trade, and daily growth rate over a 90-day period. These stocks are the best choice for selling.

Watch list controlling for sell (red line): This cluster shows the stocks ranked as the second lowest in terms of volume, trading value, and daily growth rate over the same period. These stocks are best candidate on the watch list for sale.

Watch list controlling for buy (green line): These stocks belong to a category that ranks the second highest in terms of volume, trading value, and daily growth rate over the same period. These stocks are best placed on the watch list for purchase.

The best candidate for buy (Purple line): These banks and credit institutions have the highest trading volume, value of transactions, and daily growth rate over same time. These stocks are the best choice for buying.

4. CONCLUSION

We proposed a novel finite mixture model for studying stock data, constructing time-varying component parameters matrices, and providing a skew normal distribution mixture. The advantage of using this model over other models is its performance in robust clustering when dealing with any type of data. In an empirical example, we clustered 12 sets of stocks for Iranian banks and credit institutions into four trade model components. The result indicated clusters that recommend selling or buying and controlling for selling and buying.

ACKNOWLEDGMENTS

The authors acknowledge that this article is not in the “conflict of interest” and “author involvement” of others. There is also no “budget statement” for this article. We also appreciate from Referee and associate editor who led to a number of improvements.

REFERENCES

1.D. Creal, S. Koopman, and A. Lucas, J. Appl. Econom., Vol. 28, 2013, pp. 777-795.

2.R. Roengpitya, N. Tarashev, and K. Tsatsaronis, Bank Business Models, BIS Quarterly Review, 2014, pp. 55-65. The bank for International settlement

3.R. Ayadi, E. Arbak, and W.P. de Groen, Business Models in European Banking: A Preand Post-Crisis Screening, Centre for European Policy Studies, 2014, pp. 1-104. CEPS Discussion Paper

4.R. Ayadi and W.P.D. Groen, Bank Business Models Monitor Europe, The International Research Centre on Cooperative Finance, 2015, pp. 0-122. CEPS Working Paper

5.N. Ahmadzadehgoli, A. Mohammadpour, and M.H. Behzadi, J. Stat. Theory Appl., Vol. 18, 2019, pp. 147-154.

6.A.C. Harvey, Dynamic Models for Volatility and Heavy Tails, with Applications to Financial and Economic Time Series, Cambridge University Press, Cambridge, 2013. Econometric Society Monograph

7.L. Catania, Dynamic Adaptive Mixture Models, University of Rome Tor Vergata, 2016. Unpublished Working Paper, arXiv:1603.01308 [stat.ME]

8.A. Azzalini, Scand. J. Stat., Vol. 12, 1985, pp. 171-178.

9.T.I. Lin, J. Multivar. Anal., Vol. 100, 2009, pp. 257-265.

10.A.P. Dempster, N.M. Laird, and D.B. Rubin, J. R. Stat. Soc. B, Vol. 39, 1977, pp. 1-38.

11.A. Lucas, J. Schaumberg, and B. Schwaab, Bank business models at zero interest rates, Taylor and Francis, 2016. Tinbergen Institute Discussion Paper, TI 2016-066/IV

12.A. Lucas and X. Zhang, Int. J. Forecast., Vol. 32, 2016, pp. 293-302.

13.A. Lucas, J. Schaumberg, and B. Schwaab, Bank Business Model Satzero Interest Rates, 2017. Tinbergen Institute Discussion Paper, TI2016-066/IV

14.C.M. Hurvich and C.-L. Tsai, Biometrika, Vol. 76, 1989, pp. 297-307.

15.J. Bai and S. Ng, Econometrica, Vol. 70, 2002, pp. 191-221.

16.R.C. De Amorim and C. Hennig, Inf. Sci., Vol. 324, 2015, pp. 126-145.

17.D.L. Davies and D.W. Bouldin, A Cluster Separation Measure, IEEE Transactions on Pattern Analysis and Machine Intelligence, 1979.

<Previous Article In Issue

Download article (PDF)

Next Article In Issue>

Journal: Journal of Statistical Theory and Applications
Volume-Issue: 19 - 3
Pages: 391 - 396
Publication Date: 2020/09/08
ISSN (Online): 2214-1766
ISSN (Print): 1538-7887
DOI: 10.2991/jsta.d.200827.001 How to use a DOI?
Open Access: This is an open access article distributed under the CC BY-NC 4.0 license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

ris enw bib

TY  - JOUR
AU  - Solmaz Yaghoubi
AU  - Rahman Farnoosh
PY  - 2020
DA  - 2020/09/08
TI  - Model-Based Filtering via Finite Skew Normal Mixture for Stock Data
JO  - Journal of Statistical Theory and Applications
SP  - 391
EP  - 396
VL  - 19
IS  - 3
SN  - 2214-1766
UR  - https://doi.org/10.2991/jsta.d.200827.001
DO  - 10.2991/jsta.d.200827.001
ID  - Yaghoubi2020
ER  -

download .riscopy to clipboard