This paper details a method for extreme value prediction on the basis of a sampled time series. The method is specifically designed to account for statistical dependence between the sampled data points in a precise manner. In fact, if properly used, the new method will provide statistical estimates of the exact extreme value distribution provided by the data in most cases of practical interest. It avoids the problem of having to decluster the data to ensure independence, which is a requisite component in the application of, for example, the standard peaks-over-threshold method. The proposed method also targets the use of subasymptotic data to improve prediction accuracy. The method will be demonstrated by application to both synthetic and real data. From a practical point of view, it seems to perform better than the POT and block extremes methods, and, with an appropriate modification, it is directly applicable to nonstationary time series. 1. Introduction Extreme value statistics, even in applications, are generally based on asymptotic results. This is done either by assuming that the epochal extremes, for example, yearly extreme wind speeds at a given location, are distributed according to the generalized (asymptotic) extreme value distribution with unknown parameters to be estimated on the basis of the observed data [1, 2]. Or it is assumed that the exceedances above high thresholds follow a generalized (asymptotic) Pareto distribution with parameters that are estimated from the data [1–4]. The major problem with both of these approaches is that the asymptotic extreme value theory itself cannot be used in practice to decide to what extent it is applicable for the observed data. And since the statistical tests to decide this issue are rarely precise enough to completely settle this problem, the assumption that a specific asymptotic extreme value distribution is the appropriate distribution for the observed data is based more or less on faith or convenience. On the other hand, one can reasonably assume that in most cases long time series obtained from practical measurements do contain values that are large enough to provide useful information about extreme events that are truly asymptotic. This cannot be strictly proved in general, of course, but the accumulated experience indicates that asymptotic extreme value distributions do provide reasonable, if not always very accurate, predictions when based on measured data. This is amply documented in the vast literature on the subject, and good references to this literature are [2, 5, 6]. In an effort to
References
[1]
S. Coles, An Introduction to Statistical Modeling of Extreme Values, Springer Series in Statistics, Springer, London, UK, 2001.
[2]
J. Beirlant, Y. Goegebeur, J. Teugels, and J. Segers, Statistics of Extremes, Wiley Series in Probability and Statistics, John Wiley & Sons, Chichester, UK, 2004.
[3]
A. C. Davison and R. L. Smith, “Models for exceedances over high thresholds,” Journal of the Royal Statistical Society. Series B. Methodological, vol. 52, no. 3, pp. 393–442, 1990.
[4]
R.-D. Reiss and M. Thomas, Statistical Analysis of Extreme Values, Birkh?user, Basel, Switzerland, 3rd edition, 1997.
[5]
P. Embrechts, C. Klüppelberg, and T. Mikosch, Modelling Extremal Events, vol. 33 of Applications of Mathematics (New York), Springer, Berlin, Germany, 1997.
[6]
M. Falk, J. Hüsler, and R.-D. Reiss, Laws of Small Numbers: Extremes and Rare Events, Birkh?user, Basel, Switzerland, 2 nd edition, 2004.
[7]
A. Naess and O. Gaidai, “Monte Carlo methods for estimating the extreme response of dynamical systems,” Journal of Engineering Mechanics, vol. 134, no. 8, pp. 628–636, 2008.
[8]
A. Naess, O. Gaidai, and S. Haver, “Efficient estimation of extreme response of drag-dominated offshore structures by Monte Carlo simulation,” Ocean Engineering, vol. 34, no. 16, pp. 2188–2197, 2007.
[9]
R. L. Smith, “The extremal index for a Markov chain,” Journal of Applied Probability, vol. 29, no. 1, pp. 37–45, 1992.
[10]
S. G. Coles, “A temporal study of extreme rainfall,” in Statistics for the Environment 2-Water Related Issues, V. Barnett and K. F. Turkman, Eds., chapter 4, pp. 61–78, John Wiley & Sons, Chichester, UK, 1994.
[11]
R. L. Smith, J. A. Tawn, and S. G. Coles, “Markov chain models for threshold exceedances,” Biometrika, vol. 84, no. 2, pp. 249–268, 1997.
[12]
S. Yun, “The extremal index of a higher-order stationary Markov chain,” The Annals of Applied Probability, vol. 8, no. 2, pp. 408–437, 1998.
[13]
S. Yun, “The distributions of cluster functionals of extreme events in a dth-order Markov chain,” Journal of Applied Probability, vol. 37, no. 1, pp. 29–44, 2000.
[14]
J. Segers, “Approximate distributions of clusters of extremes,” Statistics & Probability Letters, vol. 74, no. 4, pp. 330–336, 2005.
[15]
G. S. Watson, “Extreme values in samples from -dependent stationary stochastic processes,” Annals of Mathematical Statistics, vol. 25, pp. 798–800, 1954.
[16]
M. R. Leadbetter, G. Lindgren, and H. Rootzén, Extremes and Related Properties of Random Sequences and Processes, Springer Series in Statistics, Springer, New York, NY, USA, 1983.
[17]
D. E. Cartwright, “On estimating the mean energy of sea waves from the highest waves in a record,” Proceedings of the Royal Society of London. Series A, vol. 247, pp. 22–28, 1958.
[18]
A. Naess, “On the long-term statistics of extremes,” Applied Ocean Research, vol. 6, no. 4, pp. 227–228, 1984.
[19]
G. Schall, M. H. Faber, and R. Rackwitz, “Ergodicity assumption for sea states in the reliability estimation of offshore structures,” Journal of Offshore Mechanics and Arctic Engineering, vol. 113, no. 3, pp. 241–246, 1991.
[20]
E. H. Vanmarcke, “On the distribution of the first-passage time for normal stationary random processes,” Journal of Applied Mechanics, vol. 42, no. 1, pp. 215–220, 1975.
[21]
M. R. Leadbetter, “Extremes and local dependence in stationary sequences,” Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete, vol. 65, no. 2, pp. 291–306, 1983.
[22]
T. Hsing, “On the characterization of certain point processes,” Stochastic Processes and Their Applications, vol. 26, no. 2, pp. 297–316, 1987.
[23]
T. Hsing, “Estimating the parameters of rare events,” Stochastic Processes and Their Applications, vol. 37, no. 1, pp. 117–139, 1991.
[24]
M. R. Leadbetter, “On high level exceedance modeling and tail inference,” Journal of Statistical Planning and Inference, vol. 45, no. 1-2, pp. 247–260, 1995.
[25]
C. A. T. Ferro and J. Segers, “Inference for clusters of extreme values,” Journal of the Royal Statistical Society. Series B, vol. 65, no. 2, pp. 545–556, 2003.
[26]
C. Y. Robert, “Inference for the limiting cluster size distribution of extreme values,” The Annals of Statistics, vol. 37, no. 1, pp. 271–310, 2009.
[27]
A. Naess and O. Gaidai, “Estimation of extreme values from sampled time series,” Structural Safety, vol. 31, no. 4, pp. 325–334, 2009.
[28]
P. E. Gill, W. Murray, and M. H. Wright, Practical Optimization, Academic Press Inc, London, UK, 1981.
[29]
W. Forst and D. Hoffmann, Optimization—Theory and Practice, Springer Undergraduate Texts in Mathematics and Technology, Springer, New York, NY, USA, 2010.
[30]
N. R. Draper and H. Smith, Applied Regression Analysis, Wiley Series in Probability and Statistics: Texts and References Section, John Wiley & Sons, New York, NY, USA, 3rd edition, 1998.
[31]
D. C. Montgomery, E. A. Peck, and G. G. Vining, Introduction to Linear Regression Analysis, Wiley Series in Probability and Statistics: Texts, References, and Pocketbooks Section, Wiley-Interscience, New York, NY, USA, Third edition, 2001.
[32]
A. Naess, Estimation of Extreme Values of Time Series with Heavy Tails, Preprint Statistics No. 14/2010, Department of Mathematical Sciences, Norwegian University of Science and Technology, Trondheim, Norway, 2010.
[33]
E. F. Eastoe and J. A. Tawn, “Modelling the distribution of the cluster maxima of exceedances of subasymptotic thresholds,” Biometrika, vol. 99, no. 1, pp. 43–55, 2012.
[34]
J. A. Tawn, “Discussion of paper by A. C. Davison and R. L. Smith,” Journal of the Royal Statistical Society. Series B, vol. 52, no. 3, pp. 393–442, 1990.
[35]
A. W. Ledford and J. A. Tawn, “Statistics for near independence in multivariate extreme values,” Biometrika, vol. 83, no. 1, pp. 169–187, 1996.
[36]
J. E. Heffernan and J. A. Tawn, “A conditional approach for multivariate extreme values,” Journal of the Royal Statistical Society. Series B, vol. 66, no. 3, pp. 497–546, 2004.
[37]
Numerical Algorithms Group, NAG Toolbox for Matlab, NAG, Oxford, UK, 2010.
[38]
E. J. Gumbel, Statistics of Extremes, Columbia University Press, New York, NY, USA, 1958.
[39]
K. V. Bury, Statistical Models in Applied Science, Wiley Series in Probability and Mathematical Statistics, John Wiley & Sons, New York, NY, USA, 1975.
[40]
B. Efron and R. J. Tibshirani, An Introduction to the Bootstrap, vol. 57 of Monographs on Statistics and Applied Probability, Chapman and Hall, New York, NY, USA, 1993.
[41]
A. C. Davison and D. V. Hinkley, Bootstrap Methods and Their Application, vol. 1 of Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge University Press, Cambridge, UK, 1997.
[42]
J. Pickands, III, “Statistical inference using extreme order statistics,” The Annals of Statistics, vol. 3, pp. 119–131, 1975.
[43]
A. Naess and P. H. Clausen, “Combination of the peaks-over-threshold and bootstrapping methods for extreme value prediction,” Structural Safety, vol. 23, no. 4, pp. 315–330, 2001.
[44]
N. J. Cook, The Designer’s Guide to Wind Loading of Building Structures, Butterworths, London, UK, 1985.
[45]
N. J. Cook, “Towards better estimation of extreme winds,” Journal of Wind Engineering and Industrial Aerodynamics, vol. 9, no. 3, pp. 295–323, 1982.
[46]
A. Naess, “Estimation of long return period design values for wind speeds,” Journal of Engineering Mechanics, vol. 124, no. 3, pp. 252–259, 1998.
[47]
J. P. Palutikof, B. B. Brabson, D. H. Lister, and S. T. Adcock, “A review of methods to calculate extreme wind speeds,” Meteorological Applications, vol. 6, no. 2, pp. 119–132, 1999.
[48]
O. Perrin, H. Rootzén, and R. Taesler, “A discussion of statistical methods used to estimate extreme wind speeds,” Theoretical and Applied Climatology, vol. 85, no. 3-4, pp. 203–215, 2006.
[49]
M. E. Robinson and J. A. Tawn, “Extremal analysis of processes sampled at different frequencies,” Journal of the Royal Statistical Society. Series B, vol. 62, no. 1, pp. 117–135, 2000.
[50]
A. Naess, A Note on the Bivariate ACER Method, Preprint Statistics No. 01/2011, Department of Mathematical Sciences, Norwegian University of Science and Technology, Trondheim, Norway, 2011.