Fat Tail Events – Does My Tail Look Fat In This? Part 2 by Dr Ewan Kirk, Cantab Capital
Investors and managers are concerned with “fat tails”. In the second part of this post, we look at kurtosis in more detail.
An apology and a warning
This piece is more technical and longer than I had expected. The problem we’re looking at here is subtle and not easy to distill down to a short, punchy and maths-free post. Sometimes the world isn’t simple.
Fat Tail Events – Introduction
In Part 1 of Does My Tail Look Fat In This, we saw how simple volatility scaling rules can help to reduce the incidence of fat tails and make market processes look a lot more Gaussian. Whilst this was useful, there is still a fat tails problem and it is exemplified by the events of “Black Monday” on the 19th of October 1987. Dealing with this event is going to be much more difficult. To understand the problem better it’s time to formalize what we mean by “fat tails”. When we are measuring distributions, we use a measure called kurtosis to describe how fat or thin tailed a distribution is relative to a standard Gaussian. For a sample of nn returns riri from a market, we calculate the excess kurtosis using the following equation:
Most graphing or spreadsheet packages will calculate this quantity (and many other quantities of interest) so you don’t need to commit this formula to memory! In financial literature, authors sometimes play fast and loose with the terms excess kurtosis and kurtosis.(1) We’ve defined excess kurtosis in the equation above but in everything that follows, everything that we call kurtosis is in fact excess kurtosis.(2)
What do real markets look like?
The easiest way to get a feel for kurtosis is to calculate it (and other statistics) for various markets and distributions. In all cases we will be comparing to a 10% volatility Gaussian daily distribution.(3)
|Crude Oil Scaled||01Jan85||10.4%||2.6||19|
|S&P 500 Scaled||01Jan85||10.4%||21.8||19|
In this table, “Big Days” is defined as the number of days in the 30 year period where there is a return — either positive or negative — which is greater than four standard deviations.(4)
The obvious stand out thing in this table is just how kurtotic (or strictly “lepto-kurtotic”) both markets are and how the equities market has an extremely high kurtosis even after scaling to a 10% volatility process.
It is very common when modelling a market to use this empirical data to construct model distributions to describe the potential future path for each of the markets. Unfortunately these empirical measurements are very sensitive to the start date of one’s measurements. For example, if we decided to start the equities data in 1988 instead of 1985 then the equities data would look like this.
|S&P 500 Scaled||01Jan88||10.05%||5.43||14|
The comparison between this table and the previous one gives us some insight into just how difficult it is to quantify tails. Just removing three years of data, the volatility of the data has dropped a little but the kurtosis has dropped from 55 to 11. Even more oddly, the number of “Big Days” has gone up!(5) The curse of sampling error has struck again. Small sample sizes have large noise around the estimates of the statistical parameters of a distribution.
We showed in Part 1 that scaling the return distribution by recent volatility removes some of the “fat tailyness” but this doesn’t work for all markets: the S&P500 from 1985 has a higher kurtosis after scaling than the WTI market has before scaling. This is all very confusing: we need a better framework to think about returns in financial markets before we can make any progress.
Volatility scaling is the first step in a process. If you scale a distribution by recent volatility you can think of it as equivalent to saying that the market process is a “Gaussian Mixture” process. It is a —possibly unknowable— set of Gaussian distributions with different volatilities and, if you scale the returns by recent volatility, you remove a lot of this effect. So far so good. However, these large outlier events — which we would only expect to happen once every 50 years or so — happen way more often in real financial markets and scaling the returns doesn’t remove all of the outliers.
Let’s attack the problem in a typical scientific way which is to assume for the moment at least that the problem doesn’t exist. We are going to take the Big Days(6) out of the distibution entirely. We are going to look at the distribution of Small Days first and then look at the Big Days separately.
|S&P 500 Scaled Small Days||01Jan88||9.99%||1.15||0|
So, it appears that if we ignore the problem, it goes away. I suppose this might not be considered progress but it is a start. The excess kurtosis is small and not anything would really change our view of how to model risk.
Dealing with the big days
Whilst it is nice to know that if we ignore the problem then it goes away, it isn’t really an approach designed to encourage career longevity either in managers or investors. There are big days and they are going to generate large positive or negative returns. How can we hedge or reduce these risks?
The canonical Big Day example is the 19th of October 1987. How could we have hedged this event ex ante? Without postulating some psychic abilities to see the future, it isn’t clear that there was any information on the 18th of October 1987 which would have allowed you to predict that the following day was going to be so cataclysmic.(7) There may have been warning signs but it’s probably fair to say that in every market which suffers big moves, there are always warning signs which become apparent in hindsight.
Maybe one could have bought put options on the S&P 500? If one can’t see into the future(8) that would imply that you had in place an investment process which required you to buy puts on a regular basis. Unfortunately, as is well known, purchasing put options systematically is an almost guaranteed money loser. Implied volatilities are almost always higher than realised volatilities(9) and so although purchasing put options removes the nasty left tail, it moves the mean of the distribution to the left. Hedging one’s positions with options loses you money almost all the time and most investors will bail out of an underperforming manager before the put hedge kicks in. This investor preference is probably rational from a career perspective even if it isn’t rational from a long term return perspective.
The “put option” hedge becomes even more problematic when one is dealing with a complex dynamic highly diversified futures portfolio such as CTAs would hold. What exactly is it that you hedge? There are options markets in (most) futures contracts but hedging losses on every position would be over-hedging since the manager is only exposed to the final portfolio losses. This is also a non-starter since no options market maker is going to sell an option on a complex dynamic highly diversified futures portfolio. If the only choices are developing psychic abilities or paying away most of your returns as a result of the implied option bias then this is a rather bleak outlook. But, as always in finance, diversification can come to our aid.
In theory, diversification makes my tail look smaller…
Let’s assume that we have a typical CTA portfolio with approximately 100 assets. If we perform the scaling trick to attempt to remove the non-stationarity problem they all become 10% volatility assets. Finally, let’s assume that these scaled distributions all have an equal excess kurtosis and that each of them has an excess kurtosis of 5 which is reasonably close to the best you can do with scaling. There is a formula for the kurtosis of the portfolio of these assets:(10)
If we have 100 assets each with a kurtosis of 5 then a moment’s work with the trusty HP12C(11) shows that the kurtosis of the final distribution is 0.05. Wow! Our problem is solved…or is it?
Even though CTAs are some of the most highly diversified investment instruments in the world, the much vaunted statements about “100, 150, 250 assets traded” are in fact misleading. There aren’t 100 completely independent assets in the world. Oh, that there were! As we mentioned in a previous post there might be at most ten eigen-assets in a portfolio. Many assets are highly correlated due to their composition or structure.(12) Inserting these numbers into our formula, gives us a kurtosis of 0.5 for the portfolio. Not as good as 0.05 but it’s still a great result. In a world where there are diversified asset classes by diversifying into those asset classes we can theoretically reduce the kurtosis of the resultant portfolio to negligable amounts. In a sense, we reduce the kurtosis as a result of asset class specific idiosyncratic events.
…but it doesn’t in practice
There is a problem with this approach though. When we actually examine the returns of real CTAs or well-known trend models such as the Newedge Trend Indicator (13) we find that, despite being highly diversified, the kurtosis of their returns isn’t smaller than that of the constituent markets: In many cases it is larger! Why has the “free lunch” of diversification disappeared when it comes to kurtosis. What’s going on?
There is no simple answer to this. Why should diversification increase Sharpe ratio but not reduce kurtosis? Here at Cantab this is a very active area of research. We have developed an interesting theoretical “generative” model of idealised markets which has many of the same features as we see in the real world and may provide insight into this effect.
Kurtotic Correlated Gaussian Mixture Models
One way to generate a kurtotic distribution is to mix two Gaussian distributions with different volatilities. For example, we might say that (normalised(14)) markets draw from a 10% volatility distribution most of the time and then every so often they draw from a higher volatility distribution. This theoretical model of relatively constant volatility scaled returns with the occasional “event” matches our intuition about real markets quite well. To make this concrete, let’s create a theoretical distribution where 99 days out of 100 the asset has a 10% volatility and 1 day out of 100 it has a 30% volatility. So here we have three random distributions. The 10% volatility Gaussian with an annualised return of 10%, the 30% Gaussian with a zero mean and a random distribution which has a 1% probability of choosing the 30% Gaussian. Using 100,000 random samples, this distribution looks like this:(15)
We can see that there are quite a lot of tail events and indeed some as large as 9 standard deviations, but we are reaching the limit of being able to work out what is going on by eyeballing the graph. So we turn to our standard deviation and kurtosis statistics.(16) In line with the analysis above, let’s assume that we have more of these Sharpe ratio 1.0 assets and they’re uncorrelated to each other. What happens to the Sharpe and kurtosis as we add these assets together?
|Number of Assets||Return||Vol||Sharpe||Kurtosis|
So each single asset has a Sharpe ratio of about 1 and has a kurtosis of 1.7. This kurtosis is low for some markets but on average quite realistic. As this model gorges on the free lunch of truly uncorrelated assets, the volatility drops and the Sharpe ratio rises to greater than 2 for five of these simulated —and remember, theoretical — assets. The kurtosis drops as we would expect from the theoretical argument in the previous section. This is because the model(17) has a different event generation random number for each asset. They all have a draw from the 30% volatility distribution, but it happens on different days. If there’s panic in a market, there’s a low chance of a panic in another market. This doesn’t fit terribly well with our intuition or empirical observations. When we repeat the experiment assuming that Big Days happen on the same day but that the markets maintain the same correlation structure, then we find that the kurtosis is reasonably constant as you add assets. For five assets, the portfolio kurtosis is about the same as one asset.
However, it can get much worse than this. Let us assume that our assets are uncorrelated in the low volatility state, but one random day out of a hundred not only do they have a “big day” at the same time, they’re also correlated at 75% to each other on that day.
|Number of Assets||Return||Vol||Sharpe||Kurtosis|
It’s pretty clear that we are still getting nearly all of the diversification benefit. All the assets still appear nearly uncorrelated if you measure the correlation of the samples. But the kurtosis is enormous. We’ve ended up with something that has a high Sharpe, looks like a fantastic portfolio but occasionally has extreme positive and negative moves as this graph shows.
There is a minus 5% day (17) and plus 6% day (19) in that distribution. This is exceptionally punchy for a 5% volatility portfolio. Indeed, we are getting 4 days — which we would expect once every 127 years — about once every 2 years on average. This is starting to look very much like real markets, real asset managers, and real risks. It seems as if this simple generative model of assets which look uncorrelated — but suddenly aren’t — is generating something which looks quite realistic.
To summarise, the key feature is not that assets can have big days or even that assets can be correlated: it is that correlated Big Days can cause very high kurtosis.
Can we do anything about this?
A generative model is useful in that it allows you to play with a theoretical model which has characteristics of the real world that you are trying to risk manage but without all the dirty data issues and short timeframes(18) of real financial data. But does it help us manage these risks?
In this post and the previous one we have examined fat tail events and how to deal with them. Investors are rightly unhappy with fat tail events especially when they are fat tailed losses. Investors would like their managers to reduce this risk, but since the kurtosis in markets comes from a variety of sources the approaches have to be multi-layered.
Firstly, we have shown that by scaling positions by recent volatility, one can reduce the kurtosis of most financial securities considerably. Secondly, we presented a theoretical model which would have indicated that by adding together lots of uncorrelated assets with kurtosis, the portfolio kurtosis should have negligable kurtosis. However, in reality for real managers or widely available models, this doesn’t seem to be the case. Finally, we presented a more subtle model of multivariate kurtosis. We built a set of uncorrelated Gaussian Mixture models, where the high volatility component of the mixture happens infrequently, but when it does the assets are highly correlated. It was this model which seemed to have the features of diversified asset portfolios.
Clearly this generalised model for asset returns has consequences for risk management and allocation of capital to assets. Optimal solutions under the simpler asset returns models are no longer optimal and new optimal allocations must be calculated. Ideally you would be able to forecast when these random big events are going to happen and suitably adjust your risk profile when a big event is likely to happen tomorrow. Even if that was possible for the known unknowns like FOMC announcements or central bank governors speaking, it is going to be impossible to predict events like 9/11. The difficulty is compounded by the relative infrequency of these events. Events which happen once every 100 days will only have happened around 70 times in the past 30 years. That might seem like a reasonable amount of data, but once you factor in different classifications of events (geopolitical, economic, surprise announcement etc), the number of data points you have to build a forecasting model is very small. It is also critical not to be misled by a generative model which produces plausible returns. It’s possible that this correlated Gaussian Mixture model produces very realistic results, but in fact the underlying processes in the real world are completely different.(19) If a model like this does encapsulate some of the features of the real world, then it suggests that the free lunch of diversification is in fact a more complex dining experience. There are trade-offs to be made between increasing Sharpe ratio (something which investors are rightly rather keen on), but there may be an implied and hidden cost associated with this — a higher frequency of unexpectedly large returns.
This is one of those rare cases where the complexities of real world comes to our aid. In our generative model, we assumed that each of the underlying assets was a black box return generating system with a Sharpe ratio of 1.0. In the real world, the constituents of most investors portfolios are managers or, in our case, a diversified basket of models each of which may trade 100 or more assets. We have the ability to look inside these ‘black boxes’ and analyse the commonality of the positions within the models which can help to determine those times in which fatter tails are more likely and also to manage the trade-off between higher Sharpe and higher kurtosis. For us, this is an area of very active fundamental financial research and is something that we will cover in future posts.
Risk management is a topic which comes up very frequently in our discussions and meetings with investors.(20) It is a complex multifaceted problem involving understanding “risk” on a variety of related dimensions. Volatility, fat tails, skewness, liquidity, VAR, expected shortfall, drawdown…risk of permanent impairment of capital. As we have seen, none of these issues are entirely independent from each other, which makes analysing and understanding them subtle, challenging and ultimately satisfying.