Comparative study from ground-based rain gauges vs. rainfall products at different time steps in the southeast of the Republic of Djibouti

. The Republic of Djibouti is a small country in the Horn of Africa and as for most regions of Africa, ground rainfall stations are sparse. This study aims to compare at different time steps (annual, monthly


Introduction
The Republic of Djibouti is a small country (23 200 km 2 ) located in the Horn of Africa and as for most regions of Africa, ground rainfall stations are sparse and unequally distributed.
To palliate the low density of the rainfall observation network in some areas, there are alternative or complementary products we can find on the Web such as rainfall estimate products (P-Datasets).Since the 1980s, these P-Datasets have blossomed and offer interesting opportunities because, usually the dataset is continuous, and spatially available and, rainfall data are assessed at daily or hourly time steps.
We can define three groups of rainfall estimation products according to the input data and rainfall data retrieval process: those based on spatial information from ground measurement points, those based on reanalysis data derived from physical and dynamic models, and those based on satellite information using passive microwave (PMW) and infrared (IR) information (Le Coz and van de Giesen, 2020; Satgé et al., 2020).Lots of combinations of these three groups exist on the WEB.
The objective of this study is to determine which of these rainfall products is the most reliable compared to groundbased rainfall stations for the south-east of Djibouti.The south-east zone concentrates most of the population and economic exchanges between Djibouti and its neighbouring countries.Therefore, our study focuses on this part of the country This study is the first component of a broader work that aims to improve the knowledge and understanding of the hydrological processes involved in the Ambouli basin.The Ambouli basin is one of the country's largest catchments Published by Copernicus Publications on behalf of the International Association of Hydrological Sciences.G. M. Omar et al.: Comparative study from ground-based rain gauges vs. rainfall products (794 km 2 ) and is of major importance.The Ambouli aquifer is the main source of drinking water for the city of Djibouti.The Ambouli basin. is of particular importance for flood risk of the city of Djibouti.

Study area
The whole territory of the Republic of Djibouti presents an arid tropical climate which is characterized by the irregularity and the weakness of precipitations, by high temperatures during all the year, by the absence of perennial water course and by a very intense evaporation.The climate of the country is characterized by two distinct seasons: a cool season from October to April and a hot season from May to September (Houmed-Gaba, 2009).Due to geology of volcanic origin, the Republic of Djibouti is characterized by a very steep relief which is a succession of massifs, plateaus and plains also represented in the southeastern part (study area).

Rain gauges
There are 14 rainfall stations in the south-east of the Republic of Djibouti managed by the "Agence Nationale de la Météorologie de Djibouti (ANM)".Out of 14 stations, only 5 have at least 10 months or more of data available per year.Thus, we use a network of 5 rain gauges: Aerodrome, Serpent, Ali-Sabieh, Hol-Hol and Loyada (Fig. 1).The rainfall data from the network cover the period 1951-1990.The rainfall are at monthly time step for 4 rain gauge stations (Serpent, Ali-Sabieh, Hol-Hol and Loyada) and the Djibouti Aerodrome station is at daily time step (over the period 1981-2020).These rainfall data are regulated by the quality control and quality assurance system of the World Meteorological Organization (WMO).

P-Datasets
Products selection for testing was motivated by their availability over the region of interest and as they are commonly used in operational or research studies focusing on hydrology or agronomy.Table 1 presents temporal and spatial characteristics of the P-Datasets.

Methodology
This study assesses the performance of 15 precipitation estimation products at daily, monthly, and annual time step against the ground rainfall stations of the south-eastern zone of the Republic of Djibouti.
The prerequisite for the choice of studied period is a shared time window between P-Datasets and rain gauge data.Since the P-Datasets are only available from the beginning of the 1980's, the common study period is 1980-1990.
The P-Datasets validation is carried out on a spatially aggregated level: point-to-pixel.The grid values of the precipitation estimation products containing each rain gauge station are extracted and pairwise compared.
The performances of P-Datasets were evaluated using the quantitative metrics (KGE and MBE) and categorical indices (POD, FAR, Accuracy, Error and HSS), at annual, seasonal, monthly and daily temporal scale 3.1 Pre-processing (pre-treatment) As the P-Datasets differ in spatial resolution -ranging from 0.0375°for TAMSAT v.3 to 1°for GPCC v.7 -and for easy comparison, the P-Datasets were resampled to a spatial resolution of 0.1°by 0.1°using bilinear interpolation (Nikulin et al., 2012;Akinsanola et al., 2016;Satgé et al., 2020).Moreover, to address the uniformity in the temporal span, the P-Datasets available at a sub-daily time step were aggregated to a daily time step records over the period of study.

Metrics
The comparative study includes quantitative and categorical metrics.Although wide range of metrics are available to assess data performance (Wang et al., 2003;Segele et al., 2008;Diro et al., 2009;Ayehu et al., 2018), there is no single one that encapsulates all aspects of interest.For this reason, it is useful to consider several metrics and to understand the type of information or insight they might provide (Akinsanola et al., 2016).

Quantitative metrics
The metrics used for the ground to rainfall products statistical comparison are: -Kling Gupta Efficiency (KGE) is the Euclidean distance computed using the coordinates of bias (β = y x ), standard deviation (γ (Gupta et al., 2009;Clark et al., 2021): -Mean Bias Eror (MBE) is used to estimate the average bias in the rainfall product, and it provides a good indication of the mean overestimate of predictions.A positive value of MBE means an overestimation: where x corresponds to the ground reference rainfall and y corresponds to the P-Datasets.-Accuracy: -Error (7)

Results and discussion
4.1 Annual Comparisons (Fig. 2) Strong agreement with the rain gauge data is observed for some of the P-datasets.Five P-Datasets have a high KGE -EWEMBI, GPCC, JRA_55_Adj, MSWEP and WFDEI_GPCC.WFDEI_GPCC is the best P-Datasets whereas MERRA-2 PTC and ERA 5 had the worst performance.
Loyada rain gauge has the lowest KGE except for the rainfall estimation products TAMSAT and WFDEI_CRU.This can be explained either by the location of the station or the digitalization of the data (because it is a manual rain gauge station, when data are collected and then they are digitalized by a technician which can generate an uncertainty.

Monthly comparisons (Table 3)
Regarding the KGE, all P-Datasets systematically present values within −0.16 and 0.70 which is much more efficient than the annual time step.The top five P-Datasets at monthly time step are the same as those at annual time step: EWEMBI, GPCC, JRA-55_Adj, MSWEP and WFDEI_GPCC.WFDEI_GPCC is always the best among these fives.As for the annual time step, MERRA-2 PTC corresponds to the least reliable P-Dataset.

Daily comparisons (Table 4)
For the daily time step analyses, we use data from Djibouti Aerodrome station for the period 1981-2005 which is the common period between the station and all the P-Datasets.The GPCC product is not present in the analysis of daily time step data because it has a monthly temporal resolution.
The ability of the P-Datasets to quantify the amount of daily precipitation is relatively low, with most products having negative KGE values (Table 4).Only the best performing products at the annual and monthly time step EWEMBI, JRA-55_Adj, MSWEP v.2.2, and WFDEI_GPCC have positive KGE values.Concerning MBE, values are relatively homogeneous for all products from −0.13 to 0.42 mm.The low value of MBE is due to the low number of rainy days.
Table 4 shows the results of the POD, FAR, Accuracy, Error and HSS indices.Regarding POD, the P-Datasets MERRA-2 (PT and PTC) and ERA5 (the worst performing products at annual and monthly time step) almost do not detect non-rainy days, which explain the almost perfect POD values for these products.EWEMBI, JRA-55 Adj, and WFDEI-GPCC (the best performing products at the annual and monthly time step) show POD values < 0.7.Regarding FAR, we observe relatively low values for the products (FAR > 0.9 except for CHIRPS which presents a FAR equal to 0.75).For the Accuracy and Error indices, the results are moderately satisfactory for most of the products; however, as for the FAR values, the CHIRPS product presents the best Accuracy and Error values, respectively 0.9 and 0.1.Satgé et al. (2020), defined a non-rainy day using several threshold values ranging from 0 to 25 mm were applied for the P-Datasets.They found that maximum value of HSS was obtained with a threshold value of 1 mm.We consider the same threshold of 1 mm and observe that "HSS > 1 mm" are higher than "HSS" for all the P-Datasets.The best performing P-Datasets in terms of "HSS > 1 mm" are the same as at the annual and monthly time step (EWEMBI, JRA-55 Adj, MSWEP and WFDEI-GPCC) with the addition of CHIRPS.

Conclusion
Quantitative and categorical metrics allow us to determine which of the P-Datasets is the most reliable to reproduce the rainfall amounts measured by a 5 rain gauge network regulated by the quality control and quality assurance system of the World Meteorological Organization (WMO) over the south-east of Republic of Djibouti.For the annual and monthly time step, the KGE is used for the product rankings.For the daily time step, we add to KGE a categorical index, the HSS performance index, which allows us to assess the ability of these products to detect rainfall events.
For our study area, five P-Datasets present high KGE values at annual and monthly time step: EWEMBI, GPCC, JRA-55 Adj, MSWEP and WFDEI-GPCC.Therefore, the best performing products at the daily time step are CHIRPS, EWEMBI, JRA-55 Adj, MSWEP and WFDEI-GPCC.
To complete this first study, and to deepen the knowledge concerning the Ambouli basin, it would be necessary to carry out a comparative analysis at the basin scale and use another temporal window to validate the reliability of these products.data analysis, writing/reviewing and editing.CS did the methodology, data analysis, writing and editing.GM did the methodology, data analysis, writing and editing.MJ did the data analysis, review writing and editing.FS did the methodology and data collection.MIN did the data collection.AHH did the data collection.
Competing interests.The contact author has declared that none of the authors has any competing interests.
Disclaimer.Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.Special issue statement.This article is part of the special issue "IAHS2022 -Hydrological sciences in the Anthropocene: Variability and change across space, time, extremes, and interfaces".It is a result of the XIth Scientific Assembly of the International Association of Hydrological Sciences (IAHS 2022), Montpellier, France, 29 May-3 June 2022.
Review statement.This paper was edited by Christophe Cudennec and reviewed by two anonymous referees.

Figure 1 .
Figure 1.Location of the study area and rain gauge stations.

Figure 2 .
Figure 2. Statistical validation of rainfall products for rain gauge stations in the south-eastern part of the Republic of Djibouti, at annual time step for the period 1980-1990.

Table 2 .
Contingency table used for the categorical statistical analysis of rainfall.

Table 4 .
Quantitative (KGE to MBE column)and categorical (POD to HSS column) statistical indicators at daily time step at Djibouti aerodrome station.