HTML
-
Daily (20: 00-20: 00) precipitation data used are from 141 stations in Southeast China (Zhejiang Province, Fujian Province, Shanghai, and Taiwan) during 1960-2019 (Fig. 1). The data for the Chinese mainland are obtained from the National Meteorological Information Center (NMIC) of the China Meteorological Administration (CMA), while those for Taiwan Island are from the meteorological authority of Taiwan.
The historical best-track dataset during 1960-2019, including position, maximum surface wind speed near TC center, and pressure of the TCs at 6 hourly intervals, is obtained from the Shanghai Typhoon Institute (Ying et al. [36]). The operational NWP model forecast tracks were obtained from the NMIC for the period 2004-2019.
To compare the precipitation forecast results of the DSAEF_LTP model with that from NWP models, we also include corresponding rainfall forecast historical data from the European Centre for Medium-Range Weather Forecasts (ECMWF) model, the National Centers for Environmental Prediction Global Forecast System (GFS) model, the Global/ Regional Assimilation Prediction System (GRAPES) model of the China Meteorological Administration and the Shanghai Meteorological Service WRF ADAS Real-Time Modeling System (SMS-WARMS), with horizontal resolutions being 0.125° × 0.125°, 0.25° × 0.25°, 0.25° × 0.25°, and 0.09° × 0.09°, respectively. To process the data uniformly, we have converted the horizontal resolutions of the four models to 0.1° × 0.1° by bilinear interpolation.
-
Application of the DSAEF_LTP model (Ren et al. [37]; Ding et al. [32]; Jia et al. [33-34]) consists of four steps: (1) obtaining the TC track forecast from the NWP model, (2) constructing the GIV, (3) identifying analogs, and (4) finding the ensemble LTP of the analogs.
Table 1 lists the eight parameters (P1-P8) of the original DSAEF_LTP model. The third column gives the values of the parameters in the experiments. The first step is to combine the observed track of the target TC before the initial time and the forecast track after the initial time with its complete track. The observed track prior to the initial time (P1) and the operational NWP model forecast track after the initial time are combined as the target TC track.
Parameter Description Experimental values P1 Initial time The complete track of the target TC consists of the observed track before the initial time and the forecast track after the initial time. 1: 1200 UTC on day 1
2: 0000 UTC on day 1
3: 1200 UTC on day 2
(day 1, the day of TC precipitation on land; day 2, the day before day 1)P2 Similarity region A designated region within which the TSAI is calculated. It is a rectangle with diagonal points A and B. A is the location of the TC at 0, 12, 24, 36 or 48 h before the initial time and B is the location of the TC at 0, 12 or 24 h before the maximum lead time (i.e., at the time the predicted TC track ends). There are 15 experimental values through a combination of A and B. P3 Threshold of the segmentation ratio of a latitude extreme point A parameter of TSAI that represents the bending degree of TC tracks 1, 0.1; 2, 0.2; 3, 0.3 P4 Overlapping percentage threshold of two TC tracks A parameter of TSAI that represents the longitudinal (latitudinal) degree of overlap of TC tracks 1, 0.1; 2, 0.2; 3, 0.3; 4, 0.4; 5, 0.5; 6, 0.6 P5 Seasonal similarity A parameter indicating the TC landfall time 1, the whole year; 2, May-November; 3, JulySeptember; 4, the same landfall month as the target TC; 5, within 15 days of the landfall time of the target TC P6 Intensity similarity A parameter indicating the differences between the intensity of the target TC. There are four categories of TC intensity and the similarity of TC intensity is divided into five levels Four categories: 1, average intensity on the first rainy day; 2, maximum intensity on the first rainy day; 3, average intensity on all rainy days; 4, maximum intensity on all rainy days
Five levels: 1, all grades; 2, the target TC intensity is the same grade or above that of the target TC; 3, the same grade or below; 4, only the same grade; 5, the same grade or one grade differenceP7 Number (N) of analog TCs screened for the ensemble forecast N target TCs with the first N most similar GIVs to that of the target TC 1-10 for 1, 2 … and 10, respectively P8 Ensemble method Ensemble forecast method 1, mean; 2, maximum Total number of schemes 3×15×3×6×5×4×5×10×2 1, 620, 000 Table 1. Parameters of the DSAEF_LTP model.
The second step is to construct the GIV. GIV includes TC internal factors and environmental factors of TC. So far, there are three TC internal factors in the GIV: TC track, landfall season, and tintensity of the TC.
The third step is to determine the similarity of GIVs. P2, P3, and P4 (i.e., similarity region, threshold of the segmentation ratio of a latitude extreme point, and overlapping percentage threshold of two TC tracks) are used to determine historical TCs whose tracks to that of the target TC. TC track Similarity Area Index (TSAI) (Ren et al. [37]) is calculated in this process. It represents the degree of TC track similarity. The more similar the two TC tracks are, the closer the index is to 0. After determining the track similarity, the seasonal similarity and the intensity similarity are determined by P5 and P6, respectively. After that, N historical TCs (analogs) with GIV most similar to the target TCs are selected where N, the number of analogs, is specified by P7. In the fourth step, the TC precipitation of analogs are used to obtain the ensemble precipitation forecast of the target TC using an ensemble method as specified by P8.
As described, the analog-ensemble forecast technique is specified by a set of parameters, each having a range of values. The combinations of different parameter values are called schemes. In this paper, the schemes of each experiment are different, which are introduced in detail in Section 3.1.
Based on Table 1, Table 2 provides the five new values for the improved parameter P2 (Jia et al. [33]) and the five new values for the improved parameter P8 (Jia et al.[34]). The serial numbers for the five new values of P2 are 16th-20th, respectively. Meanwhile, these for the five new values of P8 are 3rd-7th, with the names being the optimal percentile (90th percentile in this study), fuse, probability matching mean, equal difference-weighted mean, and TSAI-weighted mean, respectively.
Parameter Experimental values P2 Similarity region The vertex corresponding to the starting times is used as the southeast vertex to make a square with a side length of 2000 km as the 16th similar region scheme. The midpoint of the southwest corner of this similar region and the first similar region is point A, and the midpoint of the northeast corner is point B. A and B are the two diagonal points of the 17th similar region. Move the 16th similar region as a whole until its southeast corner reaches point A as the 18th similar region. Move the 16th similar region as a whole until its northwest corner reaches point B as the 19th similar region. Make a north-south and east-west value line through points A and B respectively, and the intersection point is C. Make a square with a side length of 2, 000km at the northwest vertex of point C as the 20th similar region scheme. P8 Ensemble method 3rd, Optimal percentile (90th percentile in this study):
1). For each station, pre (i), i = 1, 2, …, m is sorted from small to large. Pre (r) is the precipitation ranked r.
2). d = 1 + (m - 1) × 0.9
3). The integer part of d is r, and the decimal part is f
4). Prep = pre (r) + [pre (r + 1)-pre (r)] × f
4th, Fuse:
Calculation rules of the forecast precipitation at each station:
1). If Max (pre (i)) ≥ 100 mm, Prep = Max (pre (i));
2). If the 90th percentile value of pre (i) ≥ 50 mm, the equals the 90th percentile value of Pre (i);
3). If the 75th percentile value of pre (i) ≥ 50 mm, the equals the 75th percentile value of Pre (i);
4). If the median value of pre (i) ≥ 10 mm, the Prep equals the median value of Pre (i);
5). If none of the above conditions can be met, the Prep equals the 10th percentile value.
5th, Probability matching mean (PM):
1). Arrange all the precipitation data for the m members of 141 stations in ascending order (containing 141 × m stations' rainfall data). Divide the 141 × m data into 141 equal parts in reverse order, retaining the median of each part and recording them as Prem (k), k = 1, 2, …, 141.
2). For a station, the average precipitation of m selected analogs at this station is $\text { Prea }=\frac{\sum\limits_{i=1}^m \operatorname{pre}(i)}{m} $; the Prea of 141 stations is ranked in reverse order; the ranking of each station's prea is recorded as k.
3). Corresponding to the prem (k) of each station based on the k of each station, and prem (k) is the predicted precipitation for this station, Prep = prem (k).
6th, Equal difference-weighted mean (ED-WM):
The weight of the precipitation for the selected similar TC whose similarity rank i is
$ W(i)=\frac{(2 \times m-i) \times 2}{(3 \times m-1) \times m}(i=1, 2, \ldots, m)$, the forecasted precipitation is $ \operatorname{Prep}=\sum\limits_{i=1}^m W(i) \times \operatorname{Pre}(i)$.
7th, TSAI-weighted mean (TSAI-WM):
$ A(i)=\frac{1}{\operatorname{TSAI}(i)}(i=1, 2, \ldots, m)$; the weight of the precipitation for the selected similar TC whose similarity rank i is $ W(i)=\frac{A(i)}{\sum\limits_{i=1}^m A(i)}$, and the forecast precipitation is $\text { Prep }=\sum\limits_{i=1}^m W(i) \times \operatorname{Prep}(i) $Table 2. New values for improved parameter P2 (similarity region) and P8 (ensemble method).
-
(a) OSAT
The observed TC precipitation are obtained from daily precipitation data over Southeast China by the Objective Synoptic Analysis Technique (OSAT) for partitioning TC precipitation (Ren et al. [38-39]; Wang et al. [40]). This method can identify the precipitation generated by TCs from daily observed precipitation data based on the distance between the stations and the precipitation centers.
(b) Threat score (TS)
The index to evaluate the forecast performance in this paper is the TS, which is widely used in meteorological operational forecasts and is calculated as follows:
$$ \mathrm{TS}=\frac{\text { hits }}{\text { hits }+\text { misses }+\text { false alarms }} $$ The TS ranges from 0 to 1 and the closer it is to 1, the higher the hit rate.
(c) BIAS score
BIAS is used to describe how close the forecasted precipitation area coincides with the observed precipitation area, calculated as follows:
$$ \text { BIAS }=\frac{\text { hits }+\text { false alarms }}{\text { hits }+\text { misses }} $$ Take value 1 as the critical point. Bias greater than 1 means it is likely to be false alarms, and less than 1 means it is likely to miss.
2.1. Data
2.2. Methodology
2.2.1. DSAEF_LTP MODEL
2.2.2. OTHER METHODS
-
Table 3 provides the best schemes of the four experiments. It shows that the values of the two improved parameters, P2 and P8, in the best schemes are all new values. P2 is the 17th value in DSAEF_LTP_2. P8 is the 4th value in DSAEF_LTP_3. In DSAEF_LTP 4, P2 is the 20th value and P8 is the 4th value. In the following paragraphs, we use the results of the best schemes to represent the performances of DSAEF_LTP_ 1, DSAEF_LTP_2, DSAEF_LTP_3, and DSAEF_LTP_4.
Parameter DSAEF_LTP_1 DSAEF_LTP_2 DSAEF_LTP_3 DSAEF_LTP_4 P1 Initial time 1 2 1 2 P2 Similarity region 7 17 2 20 P3 Segmentation ratio 2 3 1 2 P4 Longitudinal overlap 3 3 5 5 P5 Seasonal similarity 1/2 3 1 1 P6 Intensity similarity (1, 3) (all, 1) (3, 3) (1, 4) P7 Number of analogs 4 4 9 8 P8 Ensemble method 2 2 4 4 Table 3. Parameter values in the best schemes for the four experiments of DSAEF_LTP model over Southeast China.
Figure 3 shows the distribution of TS for the four experiments. For the simulation experiments (Fig. 3a), the TSsum of DSAEF_LTP_4 (0.6352) is the highest among the four experiments. TS250 and TS100 are 0.2647 and 0.3710 respectively, which implies that the DSAEF_LTP model can significantly improve the simulation ability of the model after improving both parameters, similarity region and ensemble scheme. Moreover, the TS of DSAEF_LTP_2 (0.5874) is slightly better than that of DSAEF_LTP_3 (0.5788). This means that compared with the introduction of the new ensemble method parameter values, the performance of the DSAEF_LTP model improves more by introducing new similarity regions.
Figure 3. Distribution of threat score TS for the four experiments. (a) simulation and (b) forecast experiment. (DSAEF_LTP_1: pink, DSAEF_LTP_2: green, DSAEF_LTP_3: red, DSAEF_LTP_4: blue)
Figure 3b shows the TS of the best scheme in the four forecast experiments. The performance of the DSAEF_LTP model in forecast experiment is slightly different from the simulation experiment. On the one hand, compared with DSAEF_LTP_1, the performance of DSAEF_LTP_2 has not improved. The TSsum of DSAEF_LTP_2 is 0.2639 higher than that of DSAEF_LTP_1 in the simulation experiment, while the TSsum is 0.0020 lower than that of DSAEF_LTP_1 in the forecast experiment. On the other hand, the forecast performance of DSAEF_LTP_3, not DSAEF_LTP_4, is the best among the four experiments, which indicates that the DSAEF_LTP model has the most obvious improvements in forecast performance after introducing just the new ensemble methods.
The tracks of training samples and independent samples are given in Fig. 2 in Section 3.2. If we pay attention to what we define as the typical TC track, i.e., TCs moving in a southeast towards northwest direction or making landfall over Southeast China, the numbers in the training samples and the independent samples are 28 (or 60% of tracks) and 4 (22%) respectively. The change in track characteristics between the training and independent samples may be the cause of the different model having optimum predictions from simulation to forecast experiments. This will be explored further in Section 4.3.
In conclusion, DSAEF_LTP_4 performs best in the simulation experiment and DSAEF_LTP_3 performs best in the forecast experiment. Thus, DSAEF_LTP_3 and DSAEF_LTP_4 are selected as representative experiments in subsequent analyses.
-
Figure 4 shows the TS of DSAEF_LTP_3, DSAEF_LTP_4 and the four dynamical models. The results show that the forecast performance of DSAEF_LTP_3 is best, whose TSsum, TS250, and TS100 scores are 0.4881, 0.1678, and 0.3203, respectively. DSAEF_LTP_4 ranks second, whose TSsum, TS250, and TS100 scores are 0.3504, 0.0950, and 0.2552, respectively. Among the dynamical models, the ECMWF model has the best forecast performance, and its TSsum (i. e., TS250 + TS100) is 0.2930 (i. e., 0.0583 + 0.2347). This is followed by the SMS-WARM, GFS, and GRAPES models. That is, DSAEF_LTP has an advantage in forecasting accumulated rainfall of ≥ 250 mm and ≥ 100 mm.
Figure 4. Comparison of the TS of the DSAEF_LTP model with four numerical models (ECMWF, GRAPES, GFS, SMS-WARMS) in independent forecast experiments.
Figure 5 compares the average BIAS of DSAEF_LTP_3 and DSAEF_LTP_4 with four dynamical models in different magnitudes. BS250 (BS100) indicates the BIAS of accumulated precipitation ≥ 250 mm (≥ 100 mm). The BS250 and BS100 of the two improved DSAEF_LTP model (DSAEF_LTP_3 and DSAEF_LTP_4) are more than 1, implying that the model tends to have false alarms in accumulated forecasting precipitation above 250 mm and 100 mm. SMS-WARM, ECWMF, GFS, and GRPAPES are four dynamic models with the BS250 and BS100 being all less than 1, which means that the area of misses is wider than that of false alarms.
-
TSsum of the training samples and independent samples with the best scheme of DSAEF_LTP_3 are ranked from high to low, respectively. Figs. 6a and b respectively compare the intensity and precipitation characteristics of the five TCs with high TSsum ranking and the five TCs with low TSsum ranking in simulation experiment and forecasting experiment. For both the simulation and the forecast experiments the average maximum daily precipitation and average maximum accumulated precipitation of the five TCs with high TSsum are greater than that of the five TCs with low TSsum ranking. In contrast, the intensities of high performing versus low performing examples are almost the same. As for the issue of tracks discussed earlier, Fig. 7 compares the tracks of the five TCs with high TSsum ranking and the five TCs with low TSsum ranking in both the simulation experiment (7a, b) and the forecasting experiment (7c, d). In the simulation experiments, the tracks of the five TCs with high TSsum ranking all move in the southeast-northwest direction, i.e., they are TCs with typical tracks. Meanwhile, in the forecast experiment, over half of the TCs with high TSsum ranking have TC tracks of landfalling over Southeast China or southeast-northwestward direction. By comparison, the tracks of the five TCs with low TSsum ranking are not the typical TC track. The characteristics of TCs with good forecasts in DSAEF_LTP_4 are similar to those in the DSAE_LTP_3 model (figure omitted). Thus, the TCs with high TSsum ranking of DSAEF_LTP model are characterized by heavy precipitation and typical TC tracks over Southeast China.