Spatial modeling, prediction and seasonal variation of malaria in northwest Ethiopia

Objectives The aim of this study was to determine the spatial modeling, seasonal variation of malaria and making prediction map of malaria in northwest Ethiopia. Results The overall average cumulative annual malaria incidence during the study period was 30 per 100 populations at risk. The highest proportion (29.2%) was observed from June 2015 to October 2016. In temporal analysis of clusters, the epidemic was observed from 2015/7/1 to 2016/12/31 throughout the study period in all districts. Hotspot areas with high clusters (p < 0.001) were observed in Metema district it accounts 18.6% of the total malaria cases. An area of high median predicted incidence proportion (> 50%) was seen in the southwest part of the region. Most of the northern part of the study area was predicted to have a low median incidence proportion (< 10%). Electronic supplementary material The online version of this article (10.1186/s13104-019-4305-1) contains supplementary material, which is available to authorized users.


Introduction
Malaria is a mosquito-borne infectious disease of humans caused by the genus Plasmodium, which are introduced into the circulatory system by the bite of an infected female anopheles mosquito [1]. It is one of the major public health challenges undermining development in the poorest countries [1][2][3][4].
Malaria is largely seasonal in Ethiopia, with the major incident occurring during the rainy.
Season from April to October [5]. In Ethiopia 75% of the country is malarious with about 60% of the total population living in areas at risk of malaria. That is, 50.6 million people are at risk from malaria, and four to five million people are affected by malaria annually [1,6].
The study area is one of the lowland malarious regions in Ethiopia. The dominant species of malaria in the region are both P. falciparum and P. vivax [7]. It accounts 19% of the national malaria burden [7,8]. Currently, the study area accounts 31% (1.3 million cases) of Ethiopia's malaria burden [9].
Malaria mosquitos are aggregated over large areas and time periods. There are few studies examining the extent and drivers of local variation in malaria exposure.
In geographical location, close proximity share common exposures which influence the disease outcome. Ignoring the potential spatial correlation in neighboring areas due to common exposure could result in incorrect model estimates. A little research was conducted by considering spatial correlation. But, using a geostatistical modeling with Bayesian framework takes into account spatial clustering by introducing location-specific random effect parameters in the covariance matrix of a function of distance between locations.
The findings of this study can be used to increase the evidence for targeting control measures and will contribute to the development of models capable of predicting future malaria scenarios. has had 28.7 °C; 14.1 °C and 0% to 87.5% mean maximum and minimum temperature and relative humidity respectively. The study area also had and two rainy seasons: the main one is from June to September, followed by a shorter one from March to May. The dry season ranges from October to February [9].
Malaria data were obtained from monthly reported surveillance forms of north Gondar zone health offices between early 2014 and late 2017. The data were collected from the study population, that is patients who visit health institutions in the study area and were aggregated at woreda levels. The spatial coordinates (the latitudes and longitudes) for each woreda were obtained from the Ethiopian demographic and health survey (EDHS) GPS data reference.
For clustering and spatial pattern was detected by using ArcGIS and Sat Scan ™ software, version 9.1 [10] using the Kulldorf method.
For computing hypothesis (Ho) test and the presence spatial autocorrelation we used global and local Moran's I test statistic [11]. Significant clustering, variable or dissimilar patterns and random patterns were declared when the mean Moran's I values are positive, negative and zero respectively. Z score values were used to see local clustering of malaria cases. Z score values above 1.96 and below negative 1.96 were used to show hotspot and coldspot areas respectively and any value between the two shows random distribution of cases. Interpolation was done using Empirical Bayesian kriging for predictions and smoothing.

Result
About 916,204 malaria cases which were reported from the study period, 69.8% were plasmodium falciparum, 26.8% plasmodium vivax and 2.2% of the case were mixed infected. 71.78% of total malaria cases were observed in adults aged above 15 years old and 8.5% were childrens under 15 years old.
We have identified two major malaria transmission periods, the major transmission time starts from mid-April with highest occurrence in June and July this falls to its lowest in December. The second high transmission was from October to November (Fig. 1).
Even though the cumulative annual malaria incidence was 30 per 100 populations at risk, it has showed a significant variation across districts and months. The highest incidence (44.8%) occurred in Metema district from July to November 2016. Whereas, lowest incidence (5%) occurred in Debark Town administration district from late November to the end of December 2014. From the total malaria cases reported the largest percentage  Hot spot areas with high cluster of malaria transmission and cold spot areas with low level clusters were identified. Hotspot areas with high clusters (p < 0.001) were seen in Metema district it covers 18.6% of the total malaria cases (Fig. 2).
Districts that had lower malaria incidence proportion in the region are indicated by high bright green color on the map and they were clustered around the North and east part of the study area, while the high malaria incidence proportion are indicated by high dark and less dark red color which are located in the Western part of the study area.
The maximum peak, where spatial clustering highly pronounced is at a distance of 135.778 kms with corresponding Z score of 9.81 (p-value < 0.00). This distance band is used for the analysis of hot spot and cold spot clusters (Additional file 4). The global Moran's index statistic for the malaria incidence per 100,000 population was 0.31 (p-value = 0.007), indicating the presence of significant positive spatial autocorrelation over the whole study area.
In this analysis the z-score with corresponding p-values showed that there is statistically significant autocorrelation at 0.1, 0.05 and 0.01 level of significance (Additional file 5).
An area of high median predicted incidence proportion (> 50%) was seen in the southwest part of the region. Most of the northern part of the study area was predicted to have a low median incidence proportion (< 10%). The area of highest predicted incidence proportion within the study area corresponds with the majority of high incident districts. The black ramp color indicates the predicted malaria high risk areas and red ramp color indicates less risk areas of malaria (Additional file 6).
The dominated species plasmodium falciparum were covered 69.8% of the total malaria cases. Other studies also agreed that plasmodium falciparum were the dominant species in the northwest parts of Ethiopia [15,22] and both Plasmodium falciparum and Plasmodium vivax is common in Amhara region [7]. The reason may relate to temperature; temperatures more than 18 °C for p. Falciparum and more than 15 °C for vivax are suitable for the growth of these two species. Obviously, in the most part of the country, especially in northwest Ethiopia, temperature is greater than the specified minimum scale [23]. The above idea was supported by researchers in Tanzania [24].
The 71.8% of malaria victims were adults aged above 50 years. This is in agreement with other studies [12,14]. However, other studies reversely stated that malaria infection decreased with increasing age group (70); it reasoned out higher body temperature individuals had more susceptible to malaria, which are incapable of clearing parasites more effectively as adults. Even though, the study area is an active investment and agricultural hot spot areas in Ethiopia whereby there are frequent adult immigration from another part of the country.
The 29.2% cases from the total cases were recorded from July 2015 to December 2016. It is supported by other studies in Ethiopia [9,15,25], because of high malaria burden in high-risk geographic areas [26].
The major transmission period were observed from the beginning of mid-April and peaked in June and July, declining at the end of December. The second peak of was observed between October and November. Other studies agreed that in Ethiopia season there are two peak malaria transmission periods, but the occurrence months were varied [27,28]. However, whatever the starting point all agreed that malaria transmission were high at the first wet season (summer rains) while the latter followed the end of the wet season [28].
The spatio-temporal models indicated that the most likely cluster was located in Dembia district and in East Belesa. All these identified clusters are closely related to a specific geographical area and share similar geographical parameters, such as altitude and weather conditions. Spatial clusters were detected in Metema, Tach Armachiho and Gendawuha town administration districts, all of which are located in the Ethiopia-Sudan border regions of the country. These districts are located far from referral hospitals and contain a high number of seasonal migrants due to the presence of agricultural investments and malaria is poorly controlled in these travelers and hard to reach populations. This highlights the risk of cross-border transmission of malaria in Ethiopia and Sudan, particularly in predominantly male migrant populations.
The highest predicted malaria incidence was observed in the region of Dembia, Takusa, Metema district (North West part of the zone) this region can be attributed to sesame (local name "selit") growing which is a predominant economic activity. The sesame paddies are suitable habitats for malaria vector breeding. In southeastern region and central North part of malaria is low largely due to its location in the highlands [29].

Conclusions
Malaria is still a major public health problem in the study area with an overall average cumulative annual incidence of 30 per 100 populations at risk. This malaria incidence proportion in the region was higher than the country statistics. Malaria incidence has also showed a significant spatio-temporal clustering. Most of the northern part of the study area is predicted to have a low median incidence.