1. Introduction
Nonstationary time series are very frequently encountered in fields such as meteorology
Ivanova and Ausloos (
1999), seismology
Corral (
2004,
2006), physiology
Ashkenazy et al. (
2001);
Hausdorff et al. (
1995);
Peng et al. (
1995), and finance
Bassler et al. (
2007);
Galluccio et al. (
1997);
Liu et al. (
1997). The nonstationary mark limits the applicability of statistical methods in the quest for the process generating the series
Tsay (
2002) and may convey spurious effects in the detection of correlations
Kantelhardt et al. (
2002);
Peng et al. (
1994). Depending on the suspected cause of nonstationarity, different strategies may be applied to suppress it and to simplify the signals. For instance, various detrending procedures have been proposed
Kantelhardt et al. (
2002);
Peng et al. (
1994), although their success is not always guaranteed
Bryce and Sprague (
2012);
Chen et al. (
2002). Other approaches
Dacorogna et al. (
1996,
2001);
Corral (
2004);
Zumbach (
1997) are instead based on a reassessment of the time stamp in terms of which series is recorded. In particular, in the case of seismology, it has been advanced
Corral (
2004,
2006) in that the distribution of inter-occurrence times between earthquakes becomes invariant under rescaling of a properly redefined time: this redefinition renders inter-occurrences stationary on the basis of Omori’s aftershock law
Omori (
1894).
Finance is a field in which the very definition of time constitutes a long-standing problem, since a clock properly adjusted to financial activity is not easy to identify. Financial markets have to cope with daily human routines throughout the world and bursts and doldrums occur at various time scales. In addition, during nights, weekends, and festivities, transactions stop in most cases. These interruptions, at the end of which asset prices turn out to have changed anyhow, together with the recurrent patterns in the activity (seasonalities), immediately signal the inadequacy of “natural” physical time as the appropriate one in terms of which to describe the stochastic evolution of financial markets. Thus, various alternatives have been studied
Clark (
1973);
Dacorogna et al. (
2001);
Ané and Geman (
1999);
Gillemot et al. (
2006);
Jensen et al. (
2004);
Mantegna and Stanley (
2000), including trading time, volume time, and tick-by-tick time. A time definition (theta-time) has also been put forward with the specific intent of removing the seasonalities in financial time series
Dacorogna et al. (
1996,
2001);
Zumbach (
1997). Theta-time is designed to record the progress of market activity through the increase of the volatility as measured by the average absolute return. For the latter, a power-law behavior is assumed as a function of physical time. However, theta-time suffers some limitations. In particular, as in most theoretical studies, inactive market periods are cut from the analyzed data set, and the corresponding returns ignored
Dacorogna et al. (
2001), with the consequence that the correspondence between the sum of returns and the real asset price is destroyed. On the other hand, the alternative of assigning an arbitrary time interval to periods of market inactivity would preclude the possibility of establishing a meaningful link between returns and financial time. In addition, a discussion of the effectiveness of theta-time in enforcing the stationarity of the returns probability density function (PDF) has yet to be addressed.
In summary, the appropriate duration to be ascribed to overnight and similar returns remains an open issue, and in general, there is little focus on the requirement that increments over intervals of equal span should be identically distributed in order to make standard empirical sampling (e.g., on intervals sliding along the historical series of an asset) applicable. This, for instance, is a difficulty in studies of return predictability
Potí (
2018), which rely on regression analysis. More generally, one can say that establishing a consistent time scale for the financial series is a requisite for studies aimed at assessing the efficient market hypothesis or to model leverage effects in portfolio risk evaluation.
Within the context of time series affected by cyclic nonstationary behavior in finance (see, e.g.,
Gardner et al. (
2006)), we show here that one can promote the approximate fulfillment of a statistical scaling symmetry as an operative criterion for the definition of a novel time scale. Our basic results are that: (i) the returns distribution becomes stationary if recorded over the novel time scale; (ii) we identify a proper time duration to be attributed to the overnight returns; (iii) we are able to properly include overnight returns in the time series sequence. In terms of this time scale, empirical sampling on sliding time intervals bridging intraday and interday durations become possible.
2. Scaling and Hurst Exponent
Simple scaling is an approximate scale invariance revealed when cumulative distribution functions of returns of certain financial indexes over different time intervals are compared. Within specific ranges, this symmetry is reasonably well satisfied by the distributions of logarithmic interday returns of various indexes and assets. If we indicate the return
r over the time span
(here,
is an integer number of days) as the log-difference of the asset price value
s,
, this means that the empirical cumulative distribution functions (CDFs) of
r over different
’s can be approximately collapsed onto each other in force of the scaling law
where
H is called the Hurst exponent in view of his seminal work on the Nile floods
Hurst (
1951), and
G, whose corresponding PDF is not Gaussian, is a scaling function. The estimation of
H for various markets has been addressed, e.g., in Reference
Di Matteo et al. (
2005);
Galluccio et al. (
1997). The Hurst exponent is also known to be very close to
for indexes of mature markets, a value which we will assume in our analysis. Collapse means that the plots of the CDF’s of returns
and
for intervals
and
, respectively, coincide if, e.g., the first is plotted as a function of
and the latter as a function of
. The deceptive simplicity of Equation (
1) already suggests the strategy we are going to follow: Assuming the value
, we will argue the appropriate financial duration of a given interval as that which best allows extending the validity of Equation (
1). This procedure will be followed to quantify especially the financial duration of periods of market inactivity or of intraday intervals, which are affected by cyclic nonstationarity.
Equation (
1) is approximately valid with
and the
G function characterizing the market, for intervals of integer numbers of days in physical time. It thus establishes a simple bi-univocal correspondence between return CDF’s and interval durations. Stationarity of the CDF’s for such intervals qualifies it as a proper clock for financial activity. Indeed, the collapse protocol outlined above allows one to argue the ratios of durations for such intervals. However, Equation (
1) definitely does not hold at the intraday time scales if intervals are measured in physical time. In the intraday, nonstationarities affect the CDFs for returns defined over intervals with equal physical time duration
Allez and Bouchaud (
2011);
Bassler et al. (
2007);
Galluccio et al. (
1997), and
H can be very different from
in selected time windows
Baldovin et al. (
2015);
Bassler et al. (
2007). Due to nonstationarity, these intraday determinations of
H are based on sampling the return CDF from corresponding intervals selected from an ensemble of daily histories and not by letting the intervals slide along the time series. Furthermore, even in the interday domain, multiscaling effects
Borland et al. (
2005);
Calvet and Fisher (
2002);
Di Matteo (
2007) and a slow crossover to Gaussianity of the return PDF
Cont (
2001) prevent from a fully satisfactory realization of the collapses implied by Equation (
1), especially for
exceeding a few days. In spite of these limitations, we show below that the outlined program can be carried out to a satisfactory approximation on time scales spanning from a few minutes to a few days, thus allowing the definition of a financial scaling time (FST) within this range.
Some aspects of our method have interesting precursors in the history of financial markets analysis. Indeed, stimulated by the discovery of fat tails
Cont (
2001);
Mandelbrot and Taylor (
1967);
Mantegna and Stanley (
1995), previous studies
Ané and Geman (
2000);
Clark (
1973);
Mandelbrot (
1973) attempted to modify the returns PDFs through the introduction of a stochastic redefinition of time, in order to cast them into Gaussians. Here, no reference to probabilistic models is made, and our FST is not stochastic. However, similar to these earlier studies, our time redefinition is intended to alter some fundamental property of the returns PDFs: in our case, it enforces a simple scaling symmetry.
Our construction does not amount to the proposal of a specific parameter-dependent model of market dynamics. However, one should mention that the simple scaling expressed by Equation (
1) has been recently assumed as a main modeling ingredient of the stochastic dynamics of financial indexes
Baldovin and Stella (
2007);
Baldovin et al. (
2015);
Peirano and Challet (
2012);
Stella and Baldovin (
2010);
Zamparo et al. (
2013). Combining scaling with ideas of fine graining inspired by the renormalization group approach of statistical mechanics
Kadanoff (
2005), these models reproduce many stylized facts exposed by interday time series analysis, including some multiscaling effects.
3. FST Definition and Construction
Our basic ansatz is that, once expressed in terms of FST, Equation (
1) turns into a scaling law with
:
where
represents the FST duration of an interval, and
G is a suitable scaling function. For simplicity, we denote by the same symbols used in Equation (
1) the CDF of returns and the scaling function expressed as functions of
. As it will be shown below, we are able to extend the validity of this scaling law in both intra-and interday regimes.
The CDF in Equation (
2) may be used to express expectation values, which are empirically based on sampling determinations. Concerning the sampling, we will operate differently for interday and intraday intervals. For
’s corresponding to an integer number of days in physical time, we will sample from intervals of equal duration sliding along the historical series. In the intraday regime, the expectation values will be initially based on sampling over corresponding intervals in an ensemble of daily histories. Indeed, in spite of the manifest nonstationarities at the intraday level
Allez and Bouchaud (
2011);
Bassler et al. (
2007);
Dacorogna et al. (
2001);
Galluccio et al. (
1997), the process of return aggregation appears compatible with the assumption of one-day ciclostationarity
Gardner et al. (
2006). In other words, the evolution of the aggregate return from the opening during each market day can be regarded as a realization of the same stochastic process
Baldovin et al. (
2015);
Bassler et al. (
2007);
Dacorogna et al. (
2001), and empirical averages can be taken over correspondent time windows within different days.
Before describing our method, some remarks concerning the implications of Equation (
2) and the choice of intervals are necessary: Treating
r as a continuous variable, the
q-th moment of the absolute return in an interval of duration
is clearly given by
where
is an amplitude depending on
q and
g, the derivative of
G or the PDF of rescaled returns.
1 In particular, for
, we obtain
. If we consider an interval of duration
, the aggregated return
must satisfy
which implies
since
. Thus, the validity of Equation (
2) and time additivity impose a linear uncorrelation between returns in successive intervals of time. This linear uncorrelation, which does not imply independence
Cont (
2001), corresponds to the martingale property for the stochastic process defining the price formation
Bouchaud and Potters (
2003) and is generally valid if the market is efficient and the interval duration is above a few minutes
Bouchaud and Potters (
2003);
Cont (
2001). Therefore, in the choice of intervals for which we will define FST by means of collapses, we will not consider durations so short to imply sensible violations of the condition in Equation (
5). Another obvious limitation when considering markets with interruptions of activity is, for example, that the interval between closure and opening in the following day (overnight) can not be split into shorter intervals due to the absence of intermediate price determinations.
To best fit the collapses implied by Equation (
2), we apply the Kolmogorov–Smirnov (KS) two-sample test
DeGroot (
2010). Assuming the CDF of the daily return as that corresponding to a unit of FST (
), for any other chosen interval, the appropriate
will be determined as the value realizing the optimal collapse of its CDF with the reference unit one as implied by (
2). Indeed, the KS test allows to identify this value of
as that which minimizes the distance between the reference CDF and the one under consideration plotted as a function of
. This distance will be discussed in the next section.
The FST construction proceeds as follows: on the physical time axis, we first operate a partition into intervals within the intraday consistent with the one-day periodicity. To do so, the time between opening and closure in each trading day is split into a certain number of basic intervals with an identical physical time duration
long enough to guarantee approximate satisfaction of Equation (
5) for returns in contiguous intervals. In such a way, the whole daily time span open-to-open can be viewed as the union of these consecutive basic intervals plus the last one coinciding with the full overnight (or weekend) closure. Other time intervals, either within or exceeding the duration of one day, are similarly constructed as the appropriate union of consecutive intraday basic intervals and overnights. To associate an adequate FST-duration to each interval with extremes belonging to the above partition, we proceed by application of the KS test to each elementary interval chosen in the intraday and to the overnight interval. The values of
assigned to such intervals allow mapping the partition of the physical time axis onto the FST
axis.
The above partition of the physical time axis is conveniently specified in terms of a set of time marks
, where
is trading day and
is a multiple of
within the open market phase of each trading day. As shown in the next section, the application of the KS test identifies the duration in FST of the various
intervals occurring during the day,
, and overnight,
. Note that the CDF of returns in the intraday and overnight are sampled from the ensemble of daily histories. Specifically, the FST corresponding to this partition of the physical time axis is then defined in such a way that intervals on the
axis have duration
where
is the FST partition corresponding to
, and
. Notice that in Equation (
6), consistently with the following analysis, the FST duration of over-weekends and other periods of market closure is assumed to be equal to that of overnights.
The above partition of the
axis is thus obtained by imposing an optimal scaling collapse between the CDF of each individual interval chosen on the
t axis and that of the reference open-to-open interval. Such collapses are not full, since scaling is not obeyed exactly. At the same time, also in view of the fact that returns on contiguous intervals are not strictly uncorrelated, one can not expect additivity of
durations to be strictly satisfied as if Equation (
2) would be exactly obeyed. Therefore, for example, we can anticipate that adding the two
’s corresponding to a full daily opening and to overnight will not give exactly 1 as a result. Then, for a consistency assessment, the KS test is also applied to other intraday intervals belonging to the partition, as well as to multi-day intervals, i.e., intervals of two, three, and more days.
2 In the latter case, the
scale differs much less from the
t scale due to the fact that scaling is already reasonably well satisfied in physical time.
4. Methods and Results
Here, we describe in detail the implementation of the above ideas to the S&P500 index, recorded at 1-min frequency between 9:40 and 16:00 New York time from September 1985 to June 2013.
3 After excluding those days for which the records are not complete (e.g., holidays and market anticipated closures or delayed openings), the data set includes
trading days. For each single day
l (
), there is a total of 381 index values
(
, with
). Thus, a first partition of the physical-time axis is conveniently identified as
, and in terms of this partition, returns are defined as
, with
. In order to work with zero empirical averages, returns are detrended:
, where the average is calculated over all possible returns with the same
n,
and same
, with
. In this notation, overnight returns are indicated as
and they include single night returns, returns over weekends, returns over holidays and other market closures. Closures due to specific market reasons should, in principle, be treated differently from weekends and normal holidays (e.g., several days following the black Monday 1987 are missing in the dataset because of anticipated market closure). Because we verified that they introduce only minor effects, we do not make such distinctions in our analysis.
In the identification of the FST duration of the various intervals, a key step consists in checking whether two random variables
X and
Y are identically distributed. A basic tool to establish this is the KS two-sample test
DeGroot (
2010). Given two samples
,
, with empirical cumulative distribution functions
,
, respectively, the test is based on the rescaled supremum of the distributions difference:
In cases in which the samples refer to independent variables, the test can rely on the KS theorem to assign a significance level to any
, under the null hypothesis that
,
come from the same distribution. If, however, samples are extracted from the same time series and data display long-range dependence, such as in the case examined in the present study, the mapping between
and the associated significance level is expected to be strongly affected
Chicheportiche and Bouchaud (
2011), and only model-dependent statements can be produced. Still,
quantifies how close the distributions
and
are.
In
Table 1, we report
calculated for different
Y samples relative to various intervals in physical time, once the one-day open-to-open return distribution is taken as the reference
X sample. The table also displays the values of the FST duration,
, and of
for the chosen
Y-samples obtained as follows. Within a variational strategy, we rescale each
Y-sample as
according to Equation (
2), and obtain the FST duration of the interval as the value
of
for which
reaches its minimal value
. We notice that the
values are all rather large in comparison to the corresponding
. While entries for 10, 20, 38 min in
Table 1 refer to averages over the whole day of intervals of the respective duration, as explained below,
Figure 1 details the values of
for all individual 20-min intervals from day opening to closure.
Indeed, the KS two-sample test enables us to determine the duration in FST of every physical time interval occurring during a day. For instance, if we split the trading day in 19 contiguous intervals with a physical time duration of 20-min and we call
(
) their duration in terms of FST, the first 20-min interval of the day corresponds to
, whereas twenty minutes at half and at the end of the day amount to
and
, respectively. We can also establish the duration of an overnight interval in the FST:
. As a side note, we observe that, in terms of FST, overnights are much shorter than trading days. Notice also that, assuming Equation (
2), returns over different intervals with the same duration in terms of FST are by construction identically distributed.
A qualitative inspection of how the FST definition emphasizes the simple scaling properties of the empirical CDFs is offered in
Figure 2. The comparison of
Figure 2a with
Figure 2b makes it evident that only in terms of the FST the data-collapse implied by Equation (
2) can be assumed to hold to a reasonable approximation from 20 min up to a few days.
Table 1 allows to further clarify our approach: Equation (
2), if amounting to a fully satisfied scaling relation, would imply values of
smaller than the threshold set by the significance level chosen in the KS test for every considered
Y-sample. Such possibility is prevented by the fact that even in the interday range Equation (
2) can not be satisfied completely due to both, multiscaling effects and a slow crossover to Gaussianity for the pdf,
g, of the returns. The crossover to Gaussianity is due to the progressive loss of dependence of successive returns
Cont (
2001). However,
Table 1 suggests that Equation (
2) can be considered approximately valid in a time window ranging from about 20 min to a few days. To provide a comparison, if in place of the one-day distribution we take a zero-average Gaussian, the KS analysis would return
values between
and
, with the lower values corresponding to
y’s associated with longer time intervals, as one would expect in view of the slow crossover to Gaussianity.
Results (again, see
Table 1) suggest that for the 10-min returns, the approximate scaling relation of Equation (
2) is not well satisfied. Indeed, for this specific dataset (including entries of thirty years ago) a duration
over which contiguous returns can be considered linearly uncorrelated corresponds to 20 min (See
Appendix B for details). However, note that, due to the impact of information technology on trading practice, the more recent the data are, the lower the threshold can be
Bouchaud and Potters (
2003). Thus, the 10 min interval is excluded from the FST window, and a more convenient partition of time can be indicated as
, where index
l is run over the days
and the index
m over the 20-min intervals in which the trading day is split,
.
In
Figure 3a, we outline the FST pattern resulting from such physical time partition.
,
. The result is obtained by averaging over all possible days in the dataset.
Figure 3a highlights the non-linear character of the FST vs. the physical one. If, reversely, one plots equally-spaced contiguous
-intervals as a function of the physical time (
Figure 3b), it becomes evident that the FST runs faster at the beginning and at the end of the open market day and slower at noon, New York time.
We already anticipated that, since Equations (
2) and (
5) are not exactly satisfied, one should not expect the FST scale, such as that reported in
Figure 3, to be fully consistent as far as additivity properties are concerned. For our choice
, we verified that the difference between the
’s guaranteeing the best data collapse according to Equation (
2) and those obtained by simply assuming additivity, does in general not exceed 25% of the value of the scaling factor itself. For instance,
. Furthermore, consistently with the positive value of linear correlations, this difference is typically positive, with exceptions for some intervals, including overnights, possibly due to a negative correlation between overnights and the first 20 min returns. Notice that it is because of this discrepancy that in
Figure 3a the values for
are not exact integers.
5. Further Results and Applications
Volatility is a central quantity in financial practice, assessing the intensity of market fluctuations. In physical time it may be defined as
, and in FST as
, where
(
) is the return in an interval of duration
(
) starting at
t (
). Clear evidence of the nonstationarity of returns in the physical time scale is given by the characteristic “U” shape assumed by the intraday volatility defined over the day-by-day ensemble of returns
Admati and Pfleiderer (
2015);
Andersen and Bollerslev (
1997);
Baldovin et al. (
2015);
Bassler et al. (
2007) (see
Figure 4a). Once analyzed in terms of FST, volatility stationarity is instead sensibly recovered. Indeed, the plot in (
Figure 4b) is obtained after performing a partition into 19 intervals of equal FST duration (
) of the open market period. One clearly sees that the variation of this volatility is much less pronounced, as a sign of partial restoration of stationarity.
The availability of a time series with almost stationary increments, thus strongly reducing the seasonalities appearing in the physical time scale, conveys the methodological advantage that empirical analyses can be performed through ordinary, sliding-window techniques in the intraday regime, hence extending the sample size considerably in favor of higher statistical confidence. An example of how this extension can be performed is offered by the equal
intervals identified for the plot in
Figure 4b. For an interval of FST duration equal to
, the sampling of CDF can be based on a data set 19 times larger than that available for intraday
t-intervals.
Let us now recall that Equation (
2) straightforwardly descends
In the presence of a general Hurst exponent
, such as in Equation (
1), one gets instead
A consequence of Equation (
9) is that a log–log plot of the
qth-moment
vs.
should be a straight line with slope
. Therefore, the empirical analysis of the returns’ moment as a function of the time interval duration is particularly revealing about the meaning of the FST. As a matter of fact,
Figure 5 highlights that the recurrent patterns deviating from straight behavior in physical time (
Figure 5a) are wiped out in the FST (
Figure 5b).
As already mentioned in the introduction, one of the factors preventing a full satisfaction of scaling, i.e., an equation such as Equation (
2) either in physical time or in FST, is the presence of multiscaling. This means that the Hurst exponent is not strictly constant but might show a certain dependence on
q, especially when analyzing the behavior as a function of
t or of
of relatively high order moments of the absolute return. Therefore, multiscaling may be easily detected by analyzing as a function of
q the slopes of the log–log plots in
Figure 5. Quite remarkably, multiscaling features that manifest in physical time (
Figure 6a) are still present but become much less pronounced when the FST is adopted (
Figure 6b). Since our definition of FST emphasizes simple scaling, there should be no surprise in realizing this attenuation of multiscaling effects.
Thus far, results concern single-point statistics. In finance, a fundamental two-point indicator is the volatility autocorrelation function
Cont (
2001);
Bouchaud and Potters (
2003);
Dacorogna et al. (
2001);
Mantegna and Stanley (
2000), i.e., the correlation between absolute values of two returns at a given lag. At variance with the linear one, the volatility autocorrelation is known to decay very slowly with the time lag
Cont (
2001);
Bouchaud and Potters (
2003);
Dacorogna et al. (
2001);
Mantegna and Stanley (
2000). In
Figure 7a, the
volatility autocorrelation at lag
t,
,
4 is plotted; while
Figure 7b reports the analogous quantity in FST,
, for
. In both cases, the autocorrelation has been evaluated through a sliding-window procedure. The ciclostationary quality of
in physical time is a direct consequence of the “U” shape in
Figure 4a. It reveals that the lack of stationarity in the intraday prevents a successful application of sliding interval sampling. In contrast, in FST-lag, the periodic structure of
is almost completely removed, pointing out the methodological advantage in such an empirical estimation. It is interesting to notice that if we try to fit the decay of the volatility autocorrelation in FST with a power law of the form
,2 we obtain a power law exponent
, in agreement with what is expected from the study of stylized facts in finance
Cont (
2001).