Introduction

Global climate change has an unprecedented impact on forests worldwide, causing changes in ecosystems functions and services, species abundance, and biodiversity1. Its effects include phenological changes and change local and global species distribution, increasing the risk of plant species extinction on a local and global scale1,2. As a result, understanding the effects of climate change on the distribution and abundance of a plant species in the current and future time is critical3. Understanding many environmental issues and predicting species responses to environmental change are facilitated by using the species distribution models (SDMs)4,5. The ability of SDMs to predict the probability of species presence has many ecological applications, such as conservation of many threatened and endangered species and prediction of suitable habitats under alternative climate change scenarios6,7. Ensemble modelling is an alternative to the single SDM algorithms, and through it, we use multiple algorithms simultaneously to generate an ensemble SDM8. The advantage of the ensemble model approach is its ability to improve the model predictions and reduce the overfitting considering the variability in individual modeling techniques' predictions9,10.

Soil properties significantly impact plant growth and distribution11. For example, several studies have reported the importance of soils in driving species distribution12,13,14. The inclusion of quantitative edaphic variables, such as pH, inorganic carbon, and volumetric soil water content, can significantly improve the prediction quality of the distribution of a single plant species15,16,17. As a result, focusing solely on climate predictors may result in an inadequate quantification of the niche of a given species. However, the availability of ecophysiologically significant variables, such as microclimate and edaphic variables, is hard to be sourced18. Therefore, the accurate fine-scale predictions of species distributions require developing predictors that depend on significant ecological factors, especially those better reflecting the local soil properties and climatic conditions, and allow for more18,19.

Juniperus phoenicea L. (Cupressaceae family) is an evergreen monoecious coniferous tree20. Its distribution covers the North African Mediterranean countries and it extends to the Arabian coast of the Red Sea toward the east and to the Canary Islands and Madeira to the west21. It grows on hills and dunes in North Africa and in arid mountainous regions20. The current status of J. phoenicea is recorded as Least Concern (LC) at the IUCN website (https://www.iucnredlist.org/species/16348983/99965052). The conservation status of a "least concern" species indicates neither threatened nor near-threatened within the species range. However, the species is subjected to several pressures, including overcutting for biofuel and medicinal purposes, natural habitat degradation, and repeated drought, especially in arid and semi-arid regions14. Although the species is found in many protected areas22, there is evidence of an ongoing decline of J. phoenicea populations in many global regions, including Egypt, where it was classified as a very rare species. During the past few decades, J. phoenicea has suffered from over-collection, habitat destruction, and degradation in North Sinai, Egypt23,24.

Furthermore, there were limited seedling recruitment and high mortality rates in both old and young individuals of J. phoenicea at the same sites of North Sinai. Moreover, the youngest tree was 50and 96 years old in two populations in Northern Sinai, indicating that these populations are declining25. The main reported important factors that limit the distribution of Juniper in North Sinai populations are altitude, topography, and soil factors, especially pH and salinity26,27. El-Banna23 and Farahat25 found that Juniper individuals at Sinai's Mountains have poor vitality at the high altitudes (600–1000 m a.s.l.), but better vitality, with most healthy foliage and reproductive branches at lower altitudes (350–500 m a.s.l.). Moreover, in arid and semi-arid regions (e.g., southern Saudi Arabia, Oman, and Algeria), the juniper populations are critically threatened, especially in areas facing repeated drought14,27,28. Accordingly, this reflects that J. phoenicea is locally abundant in the West Mediterranean countries while it might be highly threatened in the East Mediterranean countries and Arabian Peninsula.

The potential distribution of J. phoenicea had been well addressed recently using the bioclimatic variables at global scale29 or using bioclimatic, and soil variables at a local scale14. However, more attention should be paid to the effects of other relevant, meaningful variables, such as potential evapotranspiration and aridity, and human influence. Studying these factors is very important for J. phoenicea, which prefers growing in humid habitats, yet growing in arid regions. Therefore, conservation of this species in arid mountainous areas in the Sinai, Red Seas, and Saudia mountains is a real challenge. Another challenge facing the conservation of this species is its limited ability of dispersal and regeneration23,25,29,30.

Accordingly, the objectives of the present study were to (1) evaluate the relative importance of climatic, edaphic, and human-influence variables in explanation of the potential distribution of J. phoenicea, (2) determine the most influential predictor factors that might be limiting the regeneration of J. phoenicea, (3) identify the most suitable areas which could be proposed as priority conservation sites for restoration planning. We expect that the current study's findings will help find out the most suitable areas to support the survival of this species and be used for future conservation, especially in declining populations.

Methods

Species distribution data

We obtained a total of 8220 occurrence record data of J. phoenicea from three sources: GBIF.org (https://doi.org/10.15468/dl.abpfdb, accessed on 01 January 2021), Moustafa et al. (2016), and Farahat (2020). In ESRI ArcGIS 10.5, we removed duplicates and the records outside the shapefile of the study area (ESRI world map), resulting in 7162 occurrence records (Fig. 1A). After deleting the reciprocated missing values of the environmental variables of climate, topography, soil, and human influence, the occurrence of J. phoenicea was reduced further into 7067 records.

Figure 1
figure 1

Global distribution map of Juniperus pheonicea generated in ArcGIS 10.5 (A) and a growing plant at a dry, rocky habitat in north Sinai (Egypt) (B).

Environmental and human influence data and multicollinearity

The digital elevation model (DEM) was obtained from the U.S. geological survey (https://www.usgs.gov) at 30 arc-seconds spatial resolution. The nineteen bioclimatic variables of the current climate (1970–2000) were taken from WorldClim v 2.0 (https://www.worldclim.org/data/) at a resolution of 30 arc-seconds. The data of potential evapotranspiration (PET), actual evapotranspiration (AET), and aridity index (AI) were downloaded from the CGIAR-CSI Global database31 (www.cgiarcsi.org) at a spatial resolution of 30 arc-s (~ 1 km at the equator). Then, the elevation and climate data layers were resampled into resolution 2.5 arc-min using ArcGIS 10.5. The physical and chemical soil properties were represented by nine quantitative variables downloaded from the ISRIC-World Soil Information database at 0–2 m depth and a spatial resolution of 30 arc-s32. We used the spatial analyst toolbox to generate the mean raster layers of the different soil depths. The layers were then resampled to 2.5 arc-min (~ 5 km) resolution using ArcGIS10.5.

The data of Global Human Modification of Terrestrial Systems (anthropogenic stressors) that could influence the terrestrial ecosystems were downloaded from the NASA Socioeconomic Data and Applications Center (SEDAC) at 1 Km2 spatial resolution33. This database comprises 13 anthropogenic stressors, including human settlement, agriculture, transportation, and infrastructures.

To avoid overfitting the models, we carried out correlation tests between all the selected 31 predictor variables (environmental and human influence variables, Supplementary Material). The uncorrelated variables were kept after using the variance inflation factor (VIF) that measures how strongly each predictor can explain the rest of the predictors34. The VIFcor and VIFstep functions of the package "usdm" were used to accomplish VIF analysis35 in R 4.1.1. This analysis helped exclude the variables with the VIF values > 5 and a correlation threshold of 0.7536.

The multicollinearity analysis resulted in 15 variables, including altitude, human influence, seven climatic variables, and six soil variables (Table 1). The resolution of 2.5 arc min was used to accept more flexibility of the interactive geographical relationship between the species and its environment37.

Table 1 Relative importance and range of predictor variables explaining potential distribution.

Ensemble modeling and model accuracy

The ensemble-modeling (EM) approach was combined with the Generalized Linear Model (GLM), Random Forest (RF), and Boosted Regression Trees (BRT), which have high stability and transferability5,38. We projected each model under the current climate-related data using 70% and 30% of the training data and performance evaluation, respectively. The availability of data on both species' presence and available environment-related data (pseudo-absence data) are essential to achieve the most effective SDMs. Thus, the number of pseudo absences was randomly sampled for each species and equaled ten times the number of presences39,40.

The ensemble modeling (EM) approach reduces uncertainty in the model predictions. The EM is prevalent to standard models in optimizing the model performance and transferability41,42. We weighted the ensemble models by the True Skill Statistic (TSS) using the "sdm" package in R 4.1.134. The maximum training sensitivity plus specificity (MTSS) was used as a recommended threshold to minimize commission and omission errors43,44. The value of the area under the curve (AUC) closer to 1 demonstrates superior model performance45. The evaluation of the model performance was also assessed by True Skill Statistic (TSS) that assesses the model accuracy45,46. The TSS is better than AUC as it is threshold-dependent and accounts for both sensitivity and specificity, with values ranging from 0 to 145.

Results

Model performance

The ensemble models showed excellent fits and high performance in predicting J. phoenicea distribution. Although we have used many environmental predictor variables of climate, soil, and human influence, the model performance fitted perfectly with mean values of AUC and TSS greater than 0.95 (Table 1), indicating the high preference of J. phoenicea to the climate and soil conditions (environmental filters).

Global potential suitability of J. phoenicea

Most of the presence records were found in the West-Mediterranean region, particularly in Spain (Fig. 1), with the highest potential suitable areas compared to other countries. Moreover, some countries of the West-Mediterranean region, either with very few occurrence records or without any records, showed high suitability, such as Tunisia, Algeria, Morocco, France, Portugal, Italy, and Malta. On the other hand, the East-Mediterranean region showed relatively less potential suitability, which may be attributed to the few occurrence records. Still, some countries of the East-Mediterranean region, such as Turkey, Greece, Lebanon, Palestine, and Jordan, with very few occurrence records, showed high suitability compared to other countries, particularly of the South-Mediterranean region, such as Egypt and Libya (Fig. 2).

Figure 2
figure 2

Global potential habitat suitability generated from the ensemble modelling and visualized in ArcGIS 10.5 using the maximum training sensitivity plus specificity threshold (MTSS).

Key factors determining the potential global distribution of J. phoenicea

Based on the relative importance of the predictor variables generated by the ensemble models, potential evapotranspiration (PET) was the most important climatic factor contributed with 34.8% of the potential distribution of J. phoenicea (Table 1), followed by isothermality (Bio3, 16.5%), and precipitation of the warmest quarter (Bio18, 15.2%), aridity index (AI, 9.2%), and finally temperature seasonality (Bio4, 8.6%). Also, the percent of soil texture clay fraction was a critical predictor variable contributed by 14% in the probability of the species presence (Table 1).

The response curves showed that the maximum presence probability of J. phoenicea was at the narrow range of PET, from 500 mm to 900 mm. A sharp decline in the presence probability was up to 1500 mm, after which it became constant (Fig. 3). Furthermore, the likelihood of J. phoenicea occurrence decreased gradually with the increase of temperature seasonality (Bio4), but the presence probability increased at a narrow range of approximately 5–7.5 °C. On the other hand, the potential occurrence suitability of J. phoenicea showed a gradual increase with the increase of aridity index ranging from 0.1 up to 0.6 (i.e., towards humid climate), indicating the preferability of J. phoenicea to humid habitats (Fig. 3). Moreover, the potential presence of the species increases with the increase of clay content, particularly at the narrow range (30–40%) (Fig. 3).

Figure 3
figure 3

Response curves of the most important predictor variables explaining the potential global distribution of Juniperus phoenicea.

Discussion

As evidenced by an independent test dataset, the selected environmental variables in this study resulted in a well-performed model with high AUC and TSS, indicating a high predictive accuracy46 for the potential distribution of J. phoenicea. This study revealed that predictor variables with narrow ranges such as clay content, temperature seasonality, and potential evapotranspiration could be considered limiting factors for J. phoenicea potential distribution. The total contribution of PET and AI was 44%. In addition, the temperature-related variables (Bio3 and Bio4) contributed with 25.1%, soil clay fraction with 14%, and precipitation of the warmest quarter (Bio18) with 15.2% of the variation in the potential distribution J. phoenicea. On the other hand, very low contribution percentages represented human influence and topography. Such results indicate the negative impact of high temperature and low precipitation on the distribution of J. phoenicea in the world. Significantly, changes in PET, Bio4, and aridity are associated with global warming, inferring a significant impact of climate change on the potential distribution of J. phoenicea. Similarly29, found that temperature-defined climatic factors (Bio1–Bio11) explain approximately 24% of the current global potential distribution of J. phoenicea L. sensu stricto (s.s.), while precipitation factors (Bio12–Bio19) explain more than 75% of the distribution. Bio 19 had a high contribution (precipitation during the coldest quarter, 30.9%), and it was the main driving factor for the ecological niche of the species.

At the local scale, the main key factors for the distribution of J. phoenicea in Algeria were the total soil carbon (22.1%), driest month precipitation (Bio14, 19.2%), slope (11.1%), seasonality of precipitation (Bio15, coefficient of variation) (10.3%), total soil nitrogen (7%), and soil available water capacity during summer (6.3%)14. Besides, it was reported that the populations of this species undergo severe drought conditions and showed dieback of its foliage in many countries such as Egypt24,25,30, Libya47, Algeria14, as well as few sites in Spain48,49. Cramer50 reported that the future global warming in the Mediterranean region is expected to exceed global rates by 25%, indicating that the conservation priority should be given for the arid and semi-arid North-Mediterranean African and Middle East countries, in addition to Southern European countries; the temperature increase has been projected to range between 2 °C and 4 °C by the 2080s in Southern Europe51. This provides an urgent need for conservation planning such as reforestation or afforestation programs for this endangered species in its natural habitats or the potential climatically and edaphically suitable habitats based on the ensemble model output of the current study.

Based on this literature, extreme drought episodes (high temperature, very low precipitation) are the main reason behind the current deterioration of the J. phoenicea populations. Accordingly, the impact of drought seems to be a more pronounced factor at the southern distribution range of the species where high temperature, low precipitation, and severe drought episodes are more frequent than the western Mediterranean sites52,53. The presence of more potential suitable habitats for J. phoenicea by increasing the value of the aridity index, i.e., more humid climate and more soil clay content, could explain its extensive distribution in the West-Mediterranean region compared to the East-Mediterranean region. The sensitive response of J. phoenicea to aridity determines its probability of occurrence by previous studies such as54. Besides, the presence of high soil clay content help in more retention of water and support the growth of the plants for a long time compared to sandy or rocky soil that is predominant in many areas of J. phoenicea in the eastern Mediterranean countries25,47. It is apparent from the results that altitude is not a limiting factor for the distribution of J. phoenicea and human influences. On the contrary, we believe, according to our observations in many field sites, that the human influences, including destruction, grazing, wood, and seed collections, have a strong effect on the distribution and regeneration of the species25,30,47, but it masked by the high contribution percentages of other variables in the analysis.

In conclusion, it is revealed from the results that the area of the potential current distribution of J. phoenicea in the West Mediterranean region is still higher than that in the East Mediterranean. This entails instant conservation and protection of the current distribution areas of declined J. phoenicea populations in eastern Mediterranean countries, including Egypt, Jordon, Saudi Arabia, Libya, and Algeria. Restoration actions, including reforestation and afforestation, should be applied particularly in the arid and semi-arid ecosystems . Strict regulations must be put in place to prevent the logging of juniper's wood and the collection of its seeds for medicinal and commercial purposes. Moreover, due to the greater sensitivity of this species to hotter droughts49, its responses to the predicted future global warming should be investigated at a global scale under different scenarios of climate change.