REGION-OF-INFLUENCE APPROACH TO THE FREQUENCY ...

1 downloads 0 Views 250KB Size Report
Nov 26, 2002 - A regional frequency analysis of heavy k-day precipitation totals (k = 1 to 7 days) ..... cumulative distribution function, the so-called growth curve.
REGION-OF-INFLUENCE APPROACH TO THE FREQUENCY ANALYSIS OF HEAVY k-DAY PRECIPITATION AMOUNTS L. Gaál 1, J. Kyselý 2, P. Štěpánek 3 1

Czech Hydrometeorological Institute, Prague, [email protected] Institute of Atmospheric Physics AS CR, Prague, [email protected] 3 Czech Hydrometeorological Institute, Brno, [email protected] 2

Abstract A regional frequency analysis of heavy k-day precipitation totals (k = 1 to 7 days) has recently been carried out in the Czech Republic by means of the L-moment-based regionalization approach proposed by Hosking and Wallis. This method has generally been considered a kind of standard in the field of the frequency analysis since the early 1990s worldwide. However, there are other frequently referred regionalization techniques in the literature, and one of the most remarkable methods is the region-ofinfluence (ROI) approach. Its main idea is that there is no need to delineate fixed boundaries between the proposed regions but rather each site can have its own “region” – a group of sites that are sufficiently similar to the site of interest. The ROI method has been proposed in order to overcome possible inconsistencies that may occur on boundaries of fixed homogeneous regions. In such cases, a classical approach of the regional analysis may lead to undesirable step changes of variables and estimated quantiles; such problems are eliminated in the ROI method by defining a unique set of stations (a “region of influence”) for each site under study. The ROI method has up to now been used predominantly in connection with a flood frequency analysis; the current study is one of the first attempts to develop the ROI approach to the frequency analysis of precipitation extremes. A comparison of the results with findings of the “standard” regional frequency analysis methods in the area of the Czech Republic is performed. Key words: regional frequency analysis, region-of-influence approach, extreme precipitation events, Czech Republic 1

Introduction

Information on design values (quantiles) of extreme precipitation is inevitable at various fields of human activities: in water resources research, designing dam and sewer systems, flood protection, soil and vegetation loss protection in case of intensive convective storms etc. In a traditional “at-site” approach to a frequency analysis, the desired precipitation quantiles have long been estimated only using the data sample at the site of interest. There is, however, a fundamental drawback of this singlesite approach: in practice, one often needs design values corresponding to return periods that are much larger than lengths of available series of observations. In order to overcome such problems, the so called regional frequency analysis came along in the 1960s (Dalrymple, 1960) and has gained wider popularity since the 1980s (e.g. Wiltshire, 1986; Lettenmaier et al., 1987; Hosking and Wallis, 1993). The core idea of the regional approach is a substitution of time extent for space: based on a multi-site analysis one can obtain more reliable quantile estimates than using a classical, at-site approach. The ultimate question of the regional frequency analysis is therefore the following one: How to draw the regions? Recently, the majority of the regional precipitation (or flood) frequency analyses have been based on fixed regions drawn according to political (Gellens, 2002; Pilon et al., 1991; Adamowski et al., 1996), geographical (Sveinsson et al., 2002; Kohnová et al., 2005), hydrological and/or climatological considerations (Smithers and Schulze, 2001; Kjeldsen et al., 2002; Fowler and Kilsby, 2003). A generally accepted guideline to regional frequency analysis is the so-called regional L-moment algorithm (Hosking and Wallis, 1997), which is also based on the delineation of fixed regions.

The region-of-influence (abbr. ROI) method introduced by Burn (1990) is an alternative approach to the regional frequency analysis. The main idea of the technique is that there is no need to delineate fixed boundaries between the proposed regions but rather the regions are defined in a flexible way. It means that each site can have its own “region”, that is, a unique set of sufficiently similar stations, from which extreme precipitation information is transferred to the site of interest. The similarity of sites is evaluated by a properly chosen set of site attributes (site characteristics and/or site statistics; see Section 3). Therefore, even two neighboring stations may have very different sets of stations that represent the regions of influence for each site. The ROI method was proposed in a flood frequency analysis in order to overcome possible inconsistencies that may occur on boundaries of fixed homogeneous regions. In such cases, a classical approach of the regional analysis may lead to undesirable step changes of variables and estimated quantiles; such problems are eliminated in the ROI method by defining a unique set of stations for each site under study. Burn (1990), analyzing Canadian catchments, concluded that the ROI method is superior to the “standard” regional approaches: the novel ROI technique resulted in improved flood quantile estimates even at “difficult” individual stations. Zrinji and Burn (1994) revisited and extended the ROI approach for ungauged sites. Holmes et al. (2002) applied the ROI approach in regionalization of low flow characteristics within the United Kingdom, while Castellarin et al. (2001) used the ROI technique as a background for comparison of several hydrological similarity measures in northern Italy. Up-to-now, the ROI method has been used predominantly in connection with the flood frequency analysis. The present study focuses on development of the region-of-influence approach to the frequency analysis of precipitation extremes, and presents its comparison with the “standard” regional frequency analysis methods in climatological conditions of the Czech Republic. The paper is structured in a following manner: after a short description of selected stations and their data, a detailed overview of the mathematical background of the ROI method is presented. Due to its complexity, the analysis has not been finished at the time of the paper submission deadline, therefore results of the analysis are missing and will be presented at the conference. 2

Data

Daily precipitation totals measured at 145 stations operated by the Czech Hydrometeorological Institute (CHMI) were used as an input dataset (Fig. 1). The altitudes of stations range from 158 to 1322 m a.s.l., and the observations span the period 1961-2005. Three main criteria were applied when selecting the stations and forming the dataset:  the stations approximately evenly cover the area of the Czech Republic;  there were no significant station moves during 1961-2005 (all sites where location changes exceeded 50 metres in altitude were excluded from the analysis);  the daily series of precipitation records are uninterrupted. At about 10% of sites, gaps in daily records appeared (in total not exceeding 2 months over 45 years). These missing daily data were estimated using measurements at 3 to 5 nearest locations available in the climatological database of the CHMI, according to a methodology proposed by P. Štěpánek (note that the CHMI operates a dense network of gauging sites; mean distance to the nearest measuring site was only 16 km for the locations where missing data were estimated). All other potential station records with more than 2 months of missing values were excluded from the analysis at this stage; however, they might be used in future when the ROI method will be applied to construct maps of design value estimates of precipitation extremes. Samples of maximum annual 1-, 3-, 5- and 7-day precipitation amounts were drawn from each station records and are examined as extreme precipitation events. Summer and winter half-years are analyzed separately. Such an approach allows, at least in a rough approximation, to differentiate between extreme precipitation amounts of various origin: convective events that dominate in summer and longlasting stratiform events of frontal origin that are typical for autumn and winter.

Fig. 1 Location of stations used in the analysis.

The data underwent standard quality checking for gross errors as well as checking in terms of a discordancy measure based on L-moments (Hosking and Wallis, 1993). 3

Mathematical model of the region-of-influence method

The region of influence for a given site consists of a group of sites that are sufficiently similar to the site of interest. The similarity of sites is judged according to site characteristics and/or site statistics (Hosking and Wallis, 1997). Site characteristics are quantities that are known before any measurement or observation is obtained at a site and include, for example, the location, the elevation and other physical-geographical properties associated with a site. Some long-term characteristics of the precipitation regime of sites (such as mean annual precipitation or long-term averages of monthly/seasonal precipitation totals) are also frequently classified here: although they are set from the at-site measurements, their values can also be estimated at ungauged sites from climatological maps with a relatively smaller degree of uncertainty compared to extremes. On the other hand, site statistics are simply the measurements or results of statistical processing of the observed data at a given site. Hereafter, site characteristics and site statistics will be termed generally as site attributes. The distance metric The closeness of each site to every other site in the attribute space is evaluated by means of the weighted Euclidean distance metric that is known mainly from cluster analysis: 1

2 2 M Dij =  ∑ Wm ( X mi − X mj )  ,  m =1 

(1)

where Dij is the weighted Euclidean distance between sites i and j, Wm is the weight associated with the m-th site attribute, X mi is the value of the m-th attribute at site i, and M is the number of attributes. The distance metric matrix D is symmetrical ( Dij = D ji ) with zeros on its diagonal ( Dii = 0 ). The region of influence for a given station is constructed according to the following scheme. Firstly, the site with the lowest value of Dij , j = 1, ..., N is added to the ROI for site i. Particularly, in the very first step it is the site i itself, for which the distance metric Dii = 0 is always the one with the lowest value. Following this, a further site with the next lowest value of Dij is added into the ROI for site i. Sites are successively added into the ROI until a given option (see below: “Pooling a station’s ROI”) for forming of the ROI is fulfilled. As the site attributes X m may have substantially different magnitudes, a transformation of the initial site attributes before calculating (1) should be applied. A possible alternative of such operation is normalization of the variables:

X→

X −X

σX

,

(2)

where X is the mean and σ X is the standard deviation of the attribute X. As a result of normalization (2), each of the site attributes X m are of a comparable magnitude, i.e. they have zero mean and unit variance. Selection of site attributes has a key role in the ROI method: the success of the whole procedure lies in finding the right number and combination of proper site characteristics and/or statistics. In the first attempts with the ROI technique for regional precipitation analysis in the Czech Republic, three different alternatives of selection of site attributes are discussed:

a) Alternative #1: site statistics The whole set of site attributes consists exclusively of statistical characteristics that are related to the examined data sample at each of the sites. The following site statistics are considered: 1) coefficient of variation ( cv ) - a traditional characteristics of the scale of the data sample:

cv =

σ , µ

(3)

where µ is the mean and σ is the standard deviation of the data sample. 2) Pearson’s second skewness coefficient (PS) - a less traditional characteristics of the skewness of the data sample (Weisstein, 2006):

PS =

3( µ − m)

σ

,

(4)

where m is the median of the data sample. 3) 10-year design precipitation estimated using the generalized extreme value (GEV) distribution ( R10 y ) - a characteristics of extreme value magnitudes of the data sample. Selection of the aforementioned site statistics ensures that the ROI for a given site consists of sites for which characteristics of the probability distribution functions are similar enough to the one of the target site. The set of attributes in alternative #1 follows Burn’s concept, which has been applied in the introduction of the ROI method (Burn, 1990). There are, however, small differences between the original and the current analysis. Burn used a modification of the Pearson’s skewness coefficient, where the factor “3” in the numerator of eq. (4) has been neglected. Considering the fact that the attribute PS in the current analysis underlies the transformation (2), it can be concluded that there would be no difference between results based either on the “original” or the “modified” Pearson’s skewness coefficient. Another difference can be found in selection of the third site attribute. Burn used Q10 y , which is “a plotting position estimate of the 10-year flood event, interpolated from the available annual flow series”. Instead of it a GEV estimation of the 10-year value has been used herein, mainly for two main reasons: (1) “The normalized 10-year flood estimator has very little bias and approximately a normal distribution in small samples” (Lu and Stedinger, 1992). As the sites in the precipitation frequency analysis for the Czech Republic has 45 years of observations, the design precipitation with return period T = 10 years can reliably be estimated from each of the at-site data samples. (2) As shown in previous studies on regional approaches to the precipitation frequency analysis, the GEV is a generally acceptable distribution in each of the 4 homogeneous regions within the Czech Republic (Kyselý et al., 2006; Kyselý and Picek, 2006). As a consequence, we believe that the uncertainty of choosing the GEV-estimated 10-year design precipitation can be at most comparable with the uncertainty of the Burn’s method of the 10-year value estimation. b) Alternative #2: general climatological site characteristics The set of site attributes is made up by characteristics that describe the long-term precipitation regime of the site, regardless of the observed precipitation extremes. The following quantities are taken into consideration: 1) mean annual precipitation ( R ); 2) the ratio of the mean precipitation totals for the summer and winter half-year ( RS / RW );

3) a characteristics of inter-annual variability of precipitation amounts (exact definition of such a site characteristics is not readily available at the time of preparing this contribution; however, the basic idea is that this site attribute is similar to the Lapin’s index of the Mediterranean effect, defined for climatological conditions of the Slovak Republic; Lapin and Melo, 2005; Gaál, 2005). Using the site characteristics of alternative #2 in the distance metric would, in principle, result in groups of sites with similar climatological conditions that may be to some extent related also to mechanisms that generate heavy precipitation. In practice, however, there is no guarantee that proximity of sites in the M-dimensional space of climatological site characteristics implies similarity in terms of extreme precipitation regime. c) Alternative #3: geographical site characteristics This set of site attributes consists of the well-known and widely used geographical characteristics of the site location: 1) latitude ( ϕ ); 2) longitude ( λ ); 3) elevation above the sea level (H). Having alternative #3 selected, there emerge groups of sites which members are close to the site of interest in a geographical sense. Nevertheless, it cannot be strictly understood as the ordinary geographical proximity between two points. For example, a higher-elevated site is usually grouped together with other higher-elevated sites and is not necessarily joined with other sites, which are located really close to the target site in the traditional sense of latitude and longitude. Again, concerning alternative #3, the same drawback can be mentioned as in the previous case: pooling sites according to geographical characteristics does not necessarily imply similarity in terms of precipitation climate. Alternatives #1-#3 also take into consideration the question of the design value estimation at gauged / ungauged sites. When the regional analysis is aimed to obtain quantiles at ungauged sites as well, alternative #1 is not feasible as data sample to determine site statistics are not available. In such cases, much attention should be paid on finding the most appropriate climatological characteristics. Eq. (1) incorporates a further factor: a weighting coefficient associated with the attributes. In practice, weighting coefficients reflect the relative importance of the attributes within each of the alternatives. Weights can be computed, for example, by means of multivariate analysis, where independent variables can be the selected site attributes and the dependent variable could be a suitably chosen site characteristics. Burn (1990) used a simple correlation analysis between each of the selected attributes and the 100-year flood event, respectively. The resulting correlation coefficients were then set as the weights for the attributes. Such a multivariate or correlation analysis is also a useful tool in reducing the number of candidate attributes according to objective criteria. In the current analysis, equal (unit) weights Wm = 1, m = 1, ..., M have been selected for each of the alternatives. What is more, M = 3 for each of the three alternatives, and therefore no further analyses have been carried out in order to reduce the number of the candidate attributes. In the current study, there have been no attempts to evaluate the relative importance and/or the number of the site attributes because (1) the set of attributes in each of the alternatives reflects reasonable considerations; (2) we believe that M = 3 is the minimum number of attributes which is sufficient for calculation of the matrix of distance metric; (3) our attention was focused on other - possibly more important - aspects of the ROI approach. However, such a task is an excellent challenge for the upcoming investigations within the problems of a regional frequency analysis by the technique of ROI.

Pooling a station’s ROI As soon as the appropriate site attributes are selected and the distance metric matrix is calculated there remain two main tasks so that each site’s ROI can be found. The first one is to determine a cutoffpoint of the distance metric for the i-th site: all sites below the selected threshold value will be included into the site’s ROI, and, on the other hand, any station with a distance value in excess of the threshold will not:

I i = { j : Dij ≤ θ i } ,

(5)

where I i is the set of stations in the ROI for site i, and θi is the threshold distance value for site i. The next important feature of the ROI is a determination of the weighting parameters associated with each of the sites included in the ROI. These weighting coefficients should reflect the relative closeness of any site of the ROI to the site of interest. The substance of this step is evident: the closer a given site is to the site of interest in sense of the distance metric, the bigger amount of information it provides in the regional precipitation frequency analysis. In a mathematical form:

and

ηij = f ( Dij , Ψ ) ∀ j ∈ I i

(6)

ηij = 0 ∀ j ∉ I i ,

(7)

where ηij is the weight for site j in the ROI for site i; f ( ) is a functional relationship defining the weight, and Ψ is parameter vector for the function f ( ). Following Burn’s (1990) framework, the threshold distance θi and the weights ηij in the current analysis are determined according to 3 different options. These options reflect 3 different philosophies of pooling information from the sites of the ROI in order to get more accurate quantile estimates at the site of interest. A) Option #1: “Less sites with high values of weights” The basic idea of option #1 is that the ROI for a given site entails only a limited number of stations; however, all of the selected stations are given weights substantially different from zero while computing regional relationships. The threshold value θi (5) is defined as follows:

θi = θ L

if NSi ≥ NST ,

(8)

where θ L is a lower threshold value defining a desired proximity for stations to be included in the ROI for site i, NSi is the number of stations in the ROI for site i with the threshold at θ L , and NST is the target number of stations for a ROI. If the number of sites in the ROI for the threshold at θ L is lower than a desired minimum number NST then a less restrictive threshold θi should apply:

 NST − NSi    NST 

θi = θ L + (θU − θ L ) 

if NSi < NST ,

where θU is an upper threshold value for sites with fewer than NST stations in the ROI. The weighting function for this option is defined as:

(9)

n

D  ηij = 1 −  ij  ,  TP 

(10)

where Ψ = {TP, n} are the parameters of the weighting function. There are 5 parameters to be initialized for option #1: NST, θ L , θU , TP and n. B) Option #2: “More sites with different values of weights” In option #2, a relatively large number of sites are included in the ROI for a given site. Stations which are sufficiently similar to the site of interest have unit weights, while stations that are less similar have appropriately low values of weighting functions. The threshold value θi is defined as follows:

θi = θU ,

(11)

where θU is a constant threshold value. The weighting function for this option is defined as:

ηij = 1 and

 D −θ  ηij = 1 −  ij L   TN − θ L 

if Dij ≤ θ L ,

(12)

if θ L < Dij ≤ θU ,

(13)

n

where θ L is a lower threshold value for the distance matrix, and TN and n are the parameters of the weighting function. TN is defined in a specific way:

where

TN = max (TLi , TPP ) ,

(14)

TLi = max ( Dij ) .

(15)

{ j}

Ψ = {θ L , θU , TPP, n} are the four parameters of the weighting function for option #2 to be initialized. C) Option #3: “All sites with different values of weights” Option #3 is nearly the same as option #2 with the only difference that in option #3 all of the analyzed stations are included in the ROI for a given site with proper values of weighting functions. The threshold value θi is defined as follows:

θi = TLi ,

(16)

and the definition of the weighting function is the same as in case of option #2 – see eqs. (12)(15). As option #3 entails all of the sites there is no need to deal with selection of the upper threshold θU ; therefore the number of the parameters to be initialized is 3 ( Ψ = {θ L , TPP, n} ).

Parameter estimation for the ROI options In case of options #1-#3, values of several parameters have to be selected following subjective considerations, in accordance with the theoretical background of individual options. The distance metric matrix should serve as the basis for estimation of the threshold values θ L , θU , TP and TPP. As mentioned (see above “The distance metric”), the distance metric matrix is a symmetrical one with zeros on its diagonal. The non-zero values above (below) the diagonal form an upper (lower) triangular matrix that contains the distance metric values between each pair of stations. It is possible to sort the metric values within a triangular matrix in ascending order, to derive their empirical distribution, and then find certain statistical characteristics (maximum, minimum, median or given percentiles) of such distribution. Burn (1990) suggests choosing the values of the parameters θ L , θU , TP and TPP right according to the empirical distribution of distance metric values. The lower distance threshold θ L should be associated with the 25th percentile distance value (in both options #1 and #2); the upper distance threshold θU should be the 75th percentile (again, in both options #1 and #2), while the parameters TP and TPP should be the 85th percentile of the distance metric distribution. The target number of sites NST for option #1 was set to 15 (20). Considering the fact that the total number of sites appearing in the current analysis is 145, the choice NST = 15 (20) seems to be a reasonable compromise between having too many or too few sites, respectively, within a ROI. Small NST does not necessarily have to represent a remarkable improvement of the regional approach in comparison with a pure at-site analysis. On the other hand, too high value of NST may lead, in principle, to creating potentially inhomogeneous sets of stations within a ROI for a given site. In the Burn’s original framework, NST was set to 15, i.e. approximately to one third of the total number (46) of the available hydrometric stations. Exponent n [eqs. (10) and (13), respectively] also significantly influences the magnitudes of the weighting coefficients ηij . According to Burn (1990), the value of n in option #1 was set to 2.5. Such a choice corresponds with the conception of option #1: any ROI consists of a relatively small number of sites, with each site within the ROI having a high value of weighting coefficient that is not unduly different from 1.0. On the other hand, exponent n in case of options #2 and #3 was set to 0.1. This choice enables one to make a clear distinction between two types of sites in a ROI: it assigns relatively low values of weights to sites which are “further” from the site of interest (between the threshold values θ L and θU ), while the information from “closer” sites is pooled by weighting coefficients 1.0. Estimation of at-site precipitation quantiles using information from the ROI As soon as each station’s ROI and the appropriate weighting coefficients are known, it is possible to estimate the at-site precipitation quantiles using information from the sites of the ROI. The design values of extreme precipitation may be computed by the L-moment-based index storm procedure. The number of sites in the analysis is N with the j-th site having sample size nj. In the index storm procedure, at-site data X j ,k , j = 1, …, N, k = 1, …, nj are rescaled by the so-called index storm (usually the sample mean µ j , as well as in the current analysis) in order to get dimensionless data:

x j ,k =

X j ,k

µj

, k = 1, ..., n j .

(17)

These dimensionless data x j ,k at site j are then used to compute the sample L-moments l1( j ) , l2( j ) , … and L-moment ratios:

l2( j ) l1( j )

(18)

lr( j ) = ( j ) , r = 3, 4, ... , l2

(19)

t ( j) =

and

t

( j) r

where j denotes the index of the analyzed site, t ( j ) is the sample L-CV and tr( j ) , r = 3, 4, ... are sample L-moments ratios at site j (for a detailed definition and description of the L-moments refer to Hosking and Wallis, 1997). Regional L-moment ratios t ( i ) R and tr( i ) R , r = 3, 4, ... within the ROI for site i are derived from the atsite sample L-moment ratios as weighted regional averages, where two different weights are applied: the sample size nj (length of observation) and the weighting function based on the ROI distance metric ηij :

∑t t

=

(i ) R

( j)

j∈I i

∑n η j∈I i

∑t and

tr(i ) R =

j∈I i

( j) r

(20)

j ij

n jηij

∑n η j∈I i

n jηij

, r = 3, 4, ... ,

(21)

j ij

where I i is the set of stations included in the ROI for site i (5), for which the weighted regional Lmoment ratios are calculated. The regionally weighted values t (i ) R and tr(i ) R , r = 3, 4, ... are then used to derive the parameters of the selected distribution function in order to get the dimensionless cumulative distribution function, the so-called growth curve. Having the growth curve for the site of interest i, the precipitation quantiles with the desired return period T can be obtained by multiplying the dimensionless T-year growth curve value xiT and index storm µi :

X iT = µi xiT

(22)

In the current analysis, the generalized extreme value (GEV) distribution is used as the regional distribution function. It was selected mainly for two main reasons: (1) the analysis is focused on investigation of the results stemming from combinations of various alternatives of site attribute sets and options for defining weighting functions, respectively, and the possibility of choosing among several different distribution functions might have negative influence on clear explanation of the results; (2) the GEV distribution has been proved as a generally acceptable regional distribution function in previous studies on regional frequency distributions of precipitation extremes in the Czech Republic (Kyselý et al., 2006; Kyselý and Picek, 2006). For each of the sites (N = 145), precipitation durations (k = 1, 3, 5 and 7 days), alternative of set of attributes (a#1-a#3) and options of ROI procedures (o#1-o#3), 7 extreme precipitation quantiles X iT are determined, namely the design values with return periods of T = 2, 5, 10, 20, 50, 100 and 200 years.

Confidence intervals for the estimated quantiles and the evaluation of the ROI approaches Monte Carlo simulations are carried out in order to evaluate the uncertainty associated with the estimated quantiles, and at the same time, to evaluate the performance of various ROI approaches. The Monte Carlo experiment consists of NR = 1000 repetitions, with each of the realizations made up of the following steps: 1) Use a random number generator to generate artificial samples of precipitation data at each of the analyzed stations, having the same record lengths as their real-world counterparts. The simulated data samples at the i-th site of the region have kappa distribution with parameters corresponding to the at-site L-moments [1, t (i ) , t3(i ) , t4(i ) ]. The kappa distribution as a parent distribution is used in order to avoid too early a commitment to a particular distribution as a parent. In cases when fitting of the kappa distribution fails the GEV or generalized logistic (GLO) distribution is used instead. 2) Calculate the at-site statistics, determine the distance metric matrix, and define the region of influence and weighting function values for each station, each of the above described alternatives of attribute sets and each option of ROI procedures. It should be noted that in case of alternatives #2 and #3, there is no need to recalculate the values of Dij , I i and ηij in each repetition of the Monte Carlo simulation as the distance metric is determined from unchanging site characteristics. 3) Determine the at-site estimates of L-moments, the regional L-moments within each station’s ROI, and, finally, the “simulated” extreme precipitation quantiles for each station in accordance with the above described alternatives and options. From the Monte Carlo experiment, it is possible to draw confidence intervals for the estimated extreme precipitation quantiles. For each quantile X iT (for all combinations of the alternatives and options), the confidence interval is defined as the interval between the 5th and the 95th percentile of the empirical distribution of the simulated quantiles. The relative performance of various ROI options is also evaluated using the simulated extreme precipitation quantiles. The simulated design values are used to calculate the root mean square error (RMSE) and bias for each quantiles through:

 1 RMSE T =  N  and

1 BIAS = N T

1 NR  Xˆ − X ∑ ∑  X T i =1 NR m =1  i N

T i,m

T i

T T 1 NR  Xˆ i , m − X i ∑ ∑  X T i =1 NR m =1  i N

   

2

   

 .  

1 2

(23)

(24)

Eqs. (23) and (24) are summations through each repetition of the Monte Carlo experiment (m = 1 to NR) and each of the analyzed stations (i = 1 to N); RMSET and BIAST are the root mean square error and the relative bias for return period T, respectively; X iT is the “real” value for the T-year event at site i, and Xˆ iT,m is the simulated value for the T-year event at site i from the m-th sample of Monte Carlo simulation. Furthermore, the performance of different ROI options can be compared (1) with results of the former regional precipitation frequency analysis that has been carried out using the “traditional” regionalization approach of Hosking and Wallis (4 homogeneous regions within the Czech Republic); (2) with results of a regional frequency analysis where the whole country is treated as a one-and-only homogeneous region; and (3) with results of a traditional at-site frequency analysis without a regional approach.

4

Results

Results of the analysis will be presented at the conference. Acknowledgements The work described in this paper was supported by a young-scientists’ research grant from the Grant Agency of Academy of Sciences of the Czech Republic, under project ID B300420601. This support is gratefully acknowledged. References ADAMOWSKI, K., ALILA, Y., PILON, P.J., 1996. Regional rainfall distribution for Canada. Atmospheric Research 42(1-4): 75-88. BURN, D.H., 1990. Evaluation of regional flood frequency analysis with a region of influence approach. Water Resources Research 26(10): 2257-2265. CASTELLARIN, A., BURN, D.H., BRATH, A., 2001. Assessing the effectiveness of hydrological similarity measures for flood frequency analysis. Journal of Hydrology 241: 270-287. DALRYMPLE, T., 1960. Flood frequency analyses. Water Supply Paper 1543-A. U.S. Geological Survey, Reston. FOWLER, H.J., KILSBY, C.G., 2003. A regional frequency analysis of United Kingdom extreme rainfall from 1961 to 2000. International Journal of Climatology 23: 1313-1334. GAÁL, L., 2005. Introduction of Lapin's indices into the cluster analysis of maximum k-day precipitation totals in Slovakia. Meteorologický časopis - Meteorological Journal 8(2): 85-94. GELLENS, D., 2002. Combining regional approach and data extension procedure for assessing of extreme precipitation in Belgium. Journal of Hydrology 268: 113-126. HOLMES, M.G.R., YOUNG, A.R., GUSTARD, A., GREW, R., 2002. A region of influence approach to predicting flow duration curves within ungauged catchments. Hydrology and Earth System Sciences 6(4): 721-731. HOSKING, J.R.M., WALLIS, J.R., 1993. Some statistics useful in regional frequency analysis. Water Resources Research 29: 271-281. HOSKING, J.R.M., WALLIS, J.R., 1997. Regional frequency analysis: an approach based on L-moments. Cambridge University Press, Cambridge; New York; Oakleigh. 224 p. ISBN 0-52143045-3. KJELDSEN, T.R., SMITHERS, J.C., SCHULZE, R.E., 2002. Regional flood frequency analysis in the KwaZulu-Natal province, South Africa, using the index-flood method. Journal of Hydrology 255(1-4): 194-211. KOHNOVÁ, S., GAÁL, L., SZOLGAY, J., HLAVČOVÁ, K., 2005. Analýza maximálnych úhrnov zrážok v povodí horného Hrona. Stavebná fakulta, Slovenská technická univerzita v Bratislave. 1. vyd. 120 s. ISBN 80-227-2339-8. KYSELÝ J., PICEK J., HUTH R., 2006. Formation of homogeneous regions for regional frequency analysis of extreme precipitation events in the Czech Republic. Studia Geophysica et Geodaetica, accepted. KYSELÝ, J., PICEK, J., 2006. Regional design value estimates of extreme precipitation events in the Czech Republic. Proc. of Bioclimatological workshop, Strečno, Slovakia, this volume. LAPIN, M., MELO, M., 2005. Priestorová interpretácia výstupov klimatických scenárov v povodí Hrona a Váhu geoštatistickými metódami. In: Pekárová, P. a kol. (2005): Scenáre zmien vybraných zložiek

hydrosféry a biosféry v povodí Hrona a Váhu v dôsledku klimatickej zmeny. VEDA, Bratislava, 49-80. ISBN 80-224-0884-0. LETTENMAIER, D.P., WALLIS, J.R., WOOD, E.F., 1987. Effect of regional heterogeneity on flood frequency estimation. Water Resources Research 23(2): 313-323. LU, L.-H., STEDINGER, J.R., 1992. Sampling variance of normalized GEV/PWM quantile estimators and a regional homogeneity test. Journal of Hydrology 138: 223-245. PILON, P.J., ADAMOWSKI, K., ALILA, Y., 1991. Regional analysis of annual maxima precipitation using L-moments. Atmospheric Research 27: 81-92. SMITHERS, J.C., SCHULZE, R.E., 2001. A methodology for the estimation of short duration design storms in South Africa using a regional approach based on L-moments. Journal of Hydrology 241: 4252. SVEINSSON, O.G.B., SALAS, J.D., BOES, D.C., 2002. Regional frequency analysis of extreme precipitation in Northeastern Colorado and Fort Collins Flood of 1997. Journal of Hydrologic Engineering 7(1): 49-63. WEISSTEIN, E.W., 2006 [online]. Pearson's Skewness Coefficients. From MathWorld - A Wolfram Web Resource. http://mathworld.wolfram.com/PearsonsSkewnessCoefficients.html. Last modified 2002-11-26. [cit. 2006-06-20] WILTSHIRE, S.E., 1986. Identification of homogeneous regions for flood frequency analysis. Journal of Hydrology 84: 287-302. ZRINJI, Z., BURN, D.H., 1994. Flood frequency analysis for ungauged sites using a region of influence approach. Journal of Hydrology 153(1-4): 1-21.