/

Quantitative Return Forecasting

2. Traditional time series analysis

[this page | pdf | references | back links]

Return to Abstract and Contents

Next page

 

2.1          Consider first a situation where we only have one time series where we are attempting to forecast future values from observed past values. For example, the time series followed by a given variable might be governed by the following relationship, where the value at time t of the variable is denoted by  where  is constant.

 

2.2          This is a linear first order difference equation. A difference equation is an expression relating a variable  to its previous values. The above equation is first order because only the first lag () appears on the right hand side of the equation. It is linear because it expresses  as a linear function of  and the innovations .  are often treated as random variables, but we do not always need to do this.

 

2.3          Such a model of the world is also an autoregressive model, with a unit time lag and is therefore typically referred to as an  model. It is also time stationary, since  is constant. Nearly all linear time series analysis assumes time invariance. We could however introduce secular changes by assuming one of the variables on which the time series is based is a dummy variable linked to time. An example commonly referred to in the quantitative investment literature is a dummy variable set equal to 1 in January but 0 otherwise, to identify whether there is any ‘January’ effect.

 

2.4          If we know the value  at  then we find using recursive substitution that .                 We can also determine the effect of each individual   on, say, , the value of  that is  time periods further into the future value than . This is sometimes called the dynamic multiplier . If  then such a system is stable, in the sense that the consequences of a given change in  will eventually die out. It is unstable if . An interesting possibility is the borderline case where , when the output variable  is the sum of its initial starting value and historical inputs.

 

2.5          We can generalise the above dynamic system to be a linear p’th order difference equation by making it depend on the first  lags along with the current value of the innovation (input value) ,  i.e. . This can be rewritten in vector/matrix form as a first order difference equation, but relating to a vector, if we define the vector as follows:

 

 

2.6          These sorts of dynamic systems have richer structures than simple scalar difference equations. For a p’th order equation we have:  (if  is the element in the i’th row and k’th column of ). To analyse the characteristics of such a system in more detail, we first need to identify the eigenvalues of . These are the values of  for which  where  is the identity matrix. They are the roots to the following equation:

 

 

2.7          A p’th order equation such as this always has p roots, but some of these may be complex numbers rather than real ones, even if (as would be the case in practice for investment time series) all the  are real numbers. Complex roots correspond to cyclical (sinusoidal) behaviour. We can therefore have combinations of exponential decay, exponential growth and sinusoidal (perhaps damped or inflating) behaviour. For such a system to be stable we require all the eigenvalues  to satisfy , i.e. for their absolute values all to be less than unity.

 

2.8          Eigenvalues are closely associated with principal components analysis. All non-negative definite symmetric  matrices, , will have  non-negative eigenvalues  and associated eigenvectors  (the eigenvectors can sometimes be degenerate) that satisfy . The eigenvalues can be the same in which case the eigenvectors can be degenerate. The eigenvectors are orthogonal (or can be chosen to be orthogonal if they are degenerate), so that any n-vector  can be written as .

 

2.9          The principal components are the eigenvectors of the relevant covariance matrix corresponding to the largest eigenvalues, since they explain the greatest amount of variance when averaged over all possible positions. This is because . There is no fundamental reason why all stocks should be given equal weight in this averaging process. Different weighting schemas result in different vectors being deemed ‘principal’.

 


NAVIGATION LINKS
Contents | Prev | Next


Desktop view | Switch to Mobile