Letter to the Editor : Caution needed when using gridded meteorological data products for analyses in Africa

The recent paper by Ng et al., Association between temperature, humidity and ebolavirus disease outbreaks in Africa, 1976 to 2014 [1], addresses an important issue: whether there are meteorological conditions associated with the onset of human Ebola virus disease outbreaks. However, the findings presented in this study are undermined by the use of a gridded climate dataset built, for the region in question, on little-to-no actual meteorological data.

The recent paper by Ng et al., Association between temperature, humidity and ebolavirus disease outbreaks in Africa, 1976 to 2014 [1], addresses an important issue: whether there are meteorological conditions associated with the onset of human Ebola virus disease outbreaks.However, the findings presented in this study are undermined by the use of a gridded climate dataset built, for the region in question, on little-to-no actual meteorological data.

Ng et al. perform their analysis using University of East
Anglia Climate Research Unit (CRU) gridded estimates of surface meteorological conditions [2].This 0.5° resolution global dataset merges monthly meteorological station observations with a global land surface climatology.The derived time series of CRU surface variables have been validated through comparison with other gridded climate products; however, these comparisons were made using hemispheric and other large regional spatial averages of the gridded data.Validation was not reported at the local 0.5° resolution of the gridded data.Furthermore, Harris et al. found discrepancies from other gridded datasets 'mostly in regions and/or time periods with sparser observational data' [2].
Ng et al. used the CRU dataset to identify temperature and humidity conditions associated with the onset of 28 Ebola virus disease outbreaks in Africa during 1976 to 2014.All the Ebola virus disease outbreaks included in this analysis, with the exception of the current West African Ebola virus disease outbreak, occurred in central Africa between 8-34°E and 7°S-6°N (Figure 1).Within this region of more than 3 million km 2 , during 1976 to 2012, fewer than 40 stations per month provided temperature data for construction of the CRU dataset and no stations provided humidity (i.e.vapour pressure) data (Figure 2).Indeed, since 1992, on average fewer than seven stations per month provided temperature data for construction of the CRU dataset.
The CRU dataset uses a land surface climatology and a correlation decay distance (CDD) that spatially interpolates station records to inform estimates in all grid cells, including those without stations [2].Even after accounting for this CDD, there are regions of central Africa with no station temperature record signal, and most of Africa has no vapour pressure signal (Figure 3).These issues are not minor.The CRU dataset is designed and validated for large-scale climate analyses; however, the analysis performed by Ng et al. explored local conditions associated with the onset of specific Ebola virus disease outbreaks.The CRU dataset is likely not appropriate for this analysis; consequently, the Ng et al. study conclusions must be qualified.In other instances, for example, in analyses of meteorological conditions associated with malaria in the Kenyan highlands, the findings made using the CRU dataset

Figure 1
Locations of Ebola virus disease outbreaks included in the analysis of Ng et al. [1].The dashed box outlines the region (7°S-6°N, 8-34°E) in which the majority of Ebola virus disease outbreaks have occurred.
were significantly different from those made using a richer complement of station observations, accessed through collaboration with the Kenyan Meteorological Department [3].Unfortunately, local meteorological station records are often non-existent or difficult to access in much of Africa.
Study of the relationships between meteorological conditions and infectious disease outbreaks is important research.It is vital the best meteorological records be identified, made available and used for these analyses [4], and that these analyses be postponed or qualified in the absence of good records.

Figure 2
Figure 2 Time series of total station records for the region 7°S-6°N, 8-34°E used in construction of the University of East Anglia Climate Research Unit dataset

Figure 3
Figure 3Plots of distribution station records included in construction of the University of East Anglia Climate Research Unit dataset for January 1996