Statistical Arbitrage in the U.S. Equities Market
Marco Avellaneda∗† and Jeong-Hyun Lee∗ First draft: July 11, 2008 This version: June 15, 2009
Abstract We study model-driven statistical arbitrage in U.S. equities. The trading signals are generated in two ways: using Principal Component Analysis and using sector ETFs. In both cases, we consider the residuals, or idiosyncratic components of stock returns, and model them as mean-reverting processes. This leads naturally to “contrarian” trading signals. The main contribution of the paper is the construction, back-testing and comparison of market-neutral PCA- and ETF- based strategies applied to the broad universe of U.S. stocks. Back-testing shows that, after accounting for transaction costs, PCA-based strategies have an average annual Sharpe ratio of 1.44 over the period 1997 to 2007, with much stronger performances prior to 2003. During 2003-2007, the average Sharpe ratio of PCA-based strategies was only 0.9. Strategies based on ETFs achieved a Sharpe ratio of 1.1 from 1997 to 2007, experiencing a similar degradation after 2002. We also introduce a method to account for daily trading volume information in the signals (which is akin to using “trading time” as opposed to calendar time), and observe signiﬁcant improvement in performance in the case of ETF-based signals. ETF strategies which use volume information achieve a Sharpe ratio of 1.51 from 2003 to 2007. The paper also relates the performance of mean-reversion statistical arbitrage strategies with the stock market cycle. In particular, we study in detail the performance of the strategies during the liquidity crisis of the summer of 2007. We obtain results which are consistent with Khandani and Lo (2007) and validate their “unwinding” theory for the quant fund drawdown of August 2007. ∗ Courant Institute of Mathematical Sciences, 251 Mercer Street, New York, N.Y. 10012 USA † Finance Concepts, 49-51 Avenue Victor-Hugo, 75116 Paris, France.
The term statistical arbitrage encompasses a variety of strategies and investment programs. Their common features are: (i) trading signals are systematic, or rules-based, as opposed to driven by fundamentals, (ii) the trading book is market-neutral, in the sense that it has zero beta with the market, and (iii) the mechanism for generating excess returns is statistical. The idea is to make many bets with positive expected returns, taking advantage of diversiﬁcation across stocks, to produce a low-volatility investment strategy which is uncorrelated with the market. Holding periods range from a few seconds to days, weeks or even longer. Pairs-trading is widely assumed to be the “ancestor” of statistical arbitrage. If stocks P and Q are in the same industry or have similar characteristics (e.g. Exxon Mobile and Conoco Phillips), one expects the returns of the two stocks to track each other after controlling for beta. Accordingly, if Pt and Qt denote the corresponding price time series, then we can model the system as ln(Pt /Pt0 ) = α(t − t0 ) + βln(Qt /Qt0 ) + Xt or, in its diﬀerential version, dQt dPt = αdt + β + dXt , Pt Qt (2) (1)
where Xt is a stationary, or mean-reverting, process. This process will be referred to as the cointegration residual, or residual, for short, in the rest of the paper. In many cases of interest, the drift α is small compared to the ﬂuctuations of Xt and can therefore be neglected. This means that, after controlling for beta, the long-short portfolio oscillates near some statistical equilibrium. The model suggests a contrarian investment strategy in which we go long 1 dollar of stock P and short β dollars of stock Q if Xt is small and, conversely, go short P and long Q if Xt is large. The portfolio is expected to produce a positive return as valuations converge (see Pole (2007) for a comprehensive review on statistical arbitrage and co-integration). The mean-reversion paradigm is typically associated with market...
References: Associated Press, Quant funds endure August turmoil. The Motley Fool, December 6, 2007. Barr, A., Quant quake shakes hedge-fund giants Goldman, Renaissance, AQR see losses, but also sense opportunity, Marketwatch, August 13, 2007. Cont, R., Da Fonseca, J., Dynamics of implied volatility surfaces. Quantitative Finance, 2002, Vol 2, No 1, 45-60. Davis, G., Mallat, S. and Avellaneda, M., Adaptive greedy approximations. Constructive Approximations, 1997, Vol. 13, No. 1, 57-98. Jolliﬀe, I. T., Principal Components Analysis, Springer Series in Statistics, Springer-Verlag, Heidelberg, 2002. Khandani, A. E. and Lo, A. W., What happened to the quants in August 2007? SSRN, 2007. Laloux, L., Cizeau, P., Potters, M. and Bouchaud, J. P., Random matrix theory and ﬁnancial correlations. International Journal of Theoretical and Applied Finance, 2000, Vol. 3, No. 3, 391-397. Lehmann, B., Fads, martingales, and market eﬃciency. Quarterly Journal of Economics, 1990, Vol. 105, No.1, 1-28. Litterman, R. and Scheinkman, J. A., Common factors aﬀecting bond returns. Journal of Fixed Income, June 1991, 54-61. Lo, A. W. and MacKinlay, A. C., When are contrarian proﬁts due to stock market overreaction? The Review of Financial Studies, 1990, Vol. 3, No. 2, 175-205. Plerou, V., Gopikrishnan, P., Rosenow, B., Amaral, L. N., Guhr, T. and Stanley,
H. E., Random matrix approach to cross correlations in ﬁnancial data. Phys. Rev., 2002, E 65, 066126. Pole, A., Statistical arbitrage: Algorithmic trading insights and techniques, Wiley Finance, 2007. Poterba, J. M. and Summers, L. H., Mean reversion in stock prices: evidence and implications. Journal of Financial Economics, 1988, Vol. 22, 27-59. Potters, M., Bouchaud, J. P. and Laloux, L., Financial application of random matrix theory: old laces and new pieces. Acta Physica Polonica B, 2005, Vol. 36, No. 9, 2767. Rusli, E. M., Goldman Sachs Alpha to Fail?, Forbes.com, August 9, 2007. Scherer, K. P. and Avellaneda, M., All for one and one for all? A principal component analysis of Latin American brady bond debt from 1994 to 2000. International Journal of Theoretical and Applied Finance, 2002, Vol. 5, No. 1, 79-106.
Please join StudyMode to read the full document