1. Introduction
In recent decades, the increasing need for research-grade, near-real-time, and spatially dense environmental observations has propelled the number of in situ mesoscale networks for meteorological, agricultural, and hydrological monitoring. Mesoscale networks record multiple essential climate variables (Bojinski et al. 2014) and operate at a horizontal spatial resolution ranging from a few to several hundred kilometers (American Meteorological Society 2019). Information from mesoscale environmental monitoring networks is critical and has been widely used to track climate trends (Garbrecht et al. 2014), better understand soil moisture–rainfall feedbacks mechanisms (Findell and Eltahir 1997; Ford et al. 2015b), validate remote sensing soil moisture and evapotranspiration products (Liu et al. 2011), quantify surface and subsurface soil water storage (Ochsner et al. 2019; Lollato et al. 2016; Patrignani and Ochsner 2018; Ochsner et al. 2013; Swenson et al. 2008), estimate potential drainage recharge rates (Wyatt et al. 2017), and support natural hazard forecasting such as droughts (Ford et al. 2015a; Mozny et al. 2012), flash flooding (Basara 2001; Terti et al. 2019), wildfires (Carlson et al. 2002; Krueger et al. 2015, 2016; Reid et al. 2010), and severe thunderstorms (Gagne et al. 2012). Furthermore, the decision support tools, outreach programs, and scientific discoveries that stem from mesoscale networks propagate through their surrounding community by helping decision-makers and policy makers and contributing to increasing public safety (Ziolkowska et al. 2017). Thus, the spatial configuration of mesoscale networks plays a pivotal role for collecting spatially unbiased observations and for studying and detecting both large-scale and localized environmental phenomena that may otherwise go undetected.
The World Meteorological Organization and steering committees of mesoscale networks have developed extensive guidelines and protocols detailing best practices for selecting the location of new monitoring stations (Shafer et al. 1993) based on dominant land cover, terrain slope, site accessibility, presence of obstacles, and soil properties [Brock et al. 1995; American Association of State Climatologists (AASC); AASC 2019]. There is, however, a lack of a practical method for guiding and planning the spatial configuration at the network level. The location of additional monitoring stations must be in concert with the spatial arrangement of existing stations and aimed at strengthening the ability of the network to effectively capture environmental phenomena across the spatial domain of the network.
One of the earliest methods used for designing the spatial configuration of mesoscale networks consisted of political boundaries. During the inception of the Oklahoma Mesonet (Brock et al. 1995; McPherson et al. 2007) in the early 1990s, county boundaries were used as a template for defining the location of stations. The initial goal of the network’s steering committee was to deploy a monitoring station in each of the 77 counties across Oklahoma (roughly one station per 32 km). The use of political boundaries is a simple and attractive method that has the advantage of covering the entire geographical domain of the network with nonoverlapping subregions. Additional motivations for the use of political boundaries is that private and public local sponsors commonly take pride of ownership and that usually state and federal emergency and disaster assistance programs are based on county level boundaries. However, areas defined by political boundaries can unevenly represent the spatial domain of the network and may still contain contrasting topographical features, land covers, soil types, and even microclimate patterns that make the selection of siting locations not trivial. For instance, the area of counties within the contiguous United States ranges between 5 and 52 000 km2, with counties of western states having a notoriously larger area than counties of eastern states. So, by following political borders, areas with frequent and potentially hazardous environmental events may end up underrepresented, weakening the ability of the network to increase public safety.
More sophisticated optimization approaches such as spatial decorrelation and spatial variance-reduction methods that account for the spatial structure of the observation variables have been widely used to instrument catchment-scale rain gauge networks (Bastin et al. 1984; Bras and Rodríguez-Iturbe 1976; Pardo-Igúzquiza 1998), solar ultraviolet radiation networks (Schmalwieser and Schauberger 2001), groundwater quality monitoring networks (Herrera and Pinder 2005), and air pollution monitoring networks (Kumar 2009). However, these methods have two major drawbacks. The first drawback is that sophisticated optimization approaches are mostly oriented at short-term and application-specific networks, while modern automated mesoscale networks are aimed at long-term (30–50 yr) data collection of multiple variables at several heights (e.g., wind speed at 2 and 10 m), depths (e.g., soil moisture at 5, 10, 20, 50, and 100 cm), and time intervals (e.g., 1, 5, and 60 min along with daily), while also generating numerous value-added products (e.g., reference evapotranspiration, vapor pressure deficit, growing degree days) (Table 1). In application-specific networks, the problem often resides in minimizing a single objective function (e.g., semivariogram, correlogram), but the multivariate character of modern mesoscale networks would require minimization of a complex multiobjective function that not only accounts for a large number of variables, but also for the different degree of spatial autocorrelation of each variable. For instance, daily observations of air and surface soil temperature across Kansas can exhibit a strong (r > 0.9) spatial correlation for stations located even more than 500 km apart, whereas daily precipitation and soil moisture have much noisier and shorter correlation distances (Fig. 1). Multiobjective (Trujillo-Ventura and Hugh Ellis 1991) and entropy-based state-space optimization models (Bueso et al. 1998) have been proposed to integrate multiple observation variables. However, these methods have not gained traction in the community driving the deployment of mesoscale networks. The second drawback is that an existing dataset for the same observation variable and study area is required to compute the parameters of models describing the spatial structure. This is a major limitation in regions where no prior data are available—which may very well be the reason for deploying or augmenting a mesoscale network in that region (Arsenault and Brissette 2014). In the United States, extensive daily records and monthly summaries of precipitation and air temperature are available from nationwide networks such as the National Weather Service (NWS) Cooperative Observer Program (COOP). For instance, monthly air temperature and precipitation from the COOP network have been used to design the spatial configuration of the U.S. Climate Reference Network using an iterative degradation method (Vose and Menne 2004). However, in situ data availability is often limited in other regions of the world, particularly for less ubiquitous variables such as soil moisture and solar radiation.
Essential variables recorded by new 10-m towers of the Kansas Mesonet.
In light of these limitations and the multifunctional facet of modern mesoscale environmental monitoring networks, our objective was to study the application of a new method that solely relies on the geometric configuration of the network to identify the largest unmonitored area of the network, and from it, derive the tentative location of a new station. Our idea departs from previous methods that relied on the spatial structure of observation variables and is aimed at demonstrating a practical method that can be used to guide the deployment of future monitoring stations. The specific question that we will try to answer is: What is the optimal siting location of the next N monitoring stations given the current spatial configuration of a given mesoscale network? Note that we are not concerned with finding the optimal density of the network, but rather with searching for the optimal location of future monitoring stations, regardless of any long-term target network density.
We first describe the problem of finding the largest unmonitored area and the method for identifying the optimal location of future stations using the Kansas Mesonet as an example. We also demonstrate how this method can be easily applied as a marching process to create a road map of the location of N future siting locations. We have not found in the literature any other method providing a long-term road map for siting future networks stations. For a more comprehensive analysis, the method was applied to other statewide and nationwide environmental monitoring networks of the United States. Last, we present a few examples on how to extend the proposed approach in combination with georeferenced information of wildland fires, soil type, and frequency of drought events. For convenience, area is represented throughout this paper following the XX2 format to indicate the length of one side of a square of the given area, for example, 1002 km2 = 10 000 km2 (e.g., Ochsner et al. 2013).
2. Materials and methods
a. Description of the Kansas Mesonet
The Kansas Mesonet is a mesoscale environmental monitoring network across Kansas that was established in 1984 when the Kansas State University Kansas Research and Extension network began a small network of 13 automated weather monitoring stations consisting of electronic sensors mounted on 2-m tripods. As a consequence of the growing number of public and private supporters in recent years, the network underwent a major expansion and currently consists of 62 active stations. Most of these stations have been upgraded to 10-m permanent monitoring towers (Fig. 2) to meet the highest standards of scientific research and make observations comparable to surrounding networks (viz., the Oklahoma Mesonet and Nebraska Mesonet). Stations consisting of 10-m towers monitor precipitation, air temperature, relative humidity, barometric pressure, incident solar radiation, wind speed and direction, soil temperature, and soil moisture at multiple vertical levels (i.e., depths and heights) and time intervals (Table 1). As a test-case scenario for this analysis we used the spatial configuration of the network as of 2016, which only consisted of 56 stations (Fig. 3). This choice was made to best represent the scenario that motivated this study.
b. Method description
The location of a new monitoring station was determined by finding the centroid of the largest unmonitored area of the network. In this paper, we defined the largest unmonitored area as the largest empty circle (LEC) among the existing stations of the network (Fig. 4). Finding the LEC is a problem that belongs to the realms of computational geometry and is a method that solely relies on the spatial configuration of the network (Schuster 2008). Formally, if we let the discrete set of points P = {p1, p2, …, pn}, located on a two-dimensional Euclidean plane, then the problem consists of finding the largest empty circle C, such that no point of P lies in the interior of C (Toussaint 1983). In the context of this study, the set of points P is represented by the set of monitoring stations and the two-dimensional plane is represented by the planar spatial extent of the network. The tentative centroids for the LEC can consist of randomly generated points (i.e., brute force approach) or by simply using the vertices of Voronoi diagrams (also known as Thiessen polygons) (Voronoi 1909; De Berg et al. 2008; Preparata and Shamos 2012) that partition the plane into subregions of convex polygons containing a single monitoring station. We favored the use of Voronoi diagrams, which is another common structure of computational geometry that seamlessly integrates with the LEC method and is computationally faster.
Specifically, each station of the network belongs to a single Voronoi polygon and each Voronoi polygon consists of three or more vertices that represent the support area of each station. Each Voronoi vertex is equidistant (i.e., the circumcenter) to three stations determined by a Delaunay triangle. Voronoi vertices are of central importance for solving the LEC problem since they constitute the candidate centroids for the largest empty circle. However, a limitation of using Voronoi diagrams compared to randomly selected points is that Voronoi vertices are confined within the convex hull of the network. To resolve this issue, we intersected the edges of Voronoi polygons that extended beyond the convex hull with the network boundary (e.g., state or country border). The intersected points at the network boundary were added to the existing pool of tentative centroids for searching the largest empty circle.
Given the current spatial configuration of the network, the problem of finding the largest empty circle was solved by following these six steps: (i) partition the spatial domain of the network into P subregions using Voronoi diagrams (the resulting Voronoi vertices constitute the initial pool of tentative centroids for computing the LEC), (ii) intersect Voronoi diagrams with network boundaries and add the intersected boundary points to the existing pool of tentative centroids, (iii) compute the empty circles using each tentative centroid as the center of the circle and using the distance to the nearest station as the radius of the circle (Fig. 4), (iv) intersect circle boundaries with state boundaries if the empty circle extended beyond the spatial extent of the network, (v) compute the area of each empty circle (some intersected circles in the previous step will no longer preserve the shape and perimeter of the initial circle), and (vi) sort empty circle areas and find the LEC for the network. The geographic coordinates of the centroid of the LEC were used as the suggested optimal location of the next station. Once a new station is deployed, the new LEC needs to be recomputed by repeating the same six steps. This recomputation is required since the newly deployed station will change the configuration of the surrounding Voronoi polygons. Another advantage of computing the LEC using Voronoi diagrams is that these diagrams can be used to study the support area of each network station. All computations were programmed using MATLAB R2019a including Mapping Toolbox, version 4.8.
c. Auxiliary datasets
Georeferenced datasets of wildland fires, severe drought, and soil textural class were coupled with the set of empty circles to better capture unmonitored network areas with frequent natural hazards and underrepresented soil types. Soil texture was particularly included in this study because of the growing interest in measuring variables such as soil moisture to improve our understanding of land-atmosphere interactions, increase the forecasting skill of land surface models, and due to the strong connection of soil type with multiple agricultural and hydrological processes (e.g., erosion and runoff) (Ochsner et al. 2013).
Wildland fire records from 2000 to 2018 were obtained at 1-km spatial resolution from the Moderate Resolution Imaging Spectroradiometer (MODIS) Fire and Thermal Anomalies daily product generated by the National Aeronautics Space Administration (NASA) Terra (MOD14) and Aqua (MYD14) satellites that is available through the NASA Fire Information for Resource Management System web portal. The resulting dataset for Kansas contained a total of 78 222 events, which includes human-caused (mostly prescribed fires) and naturally occurring or accidental fires. Other sources of thermal anomalies (e.g., structure fires, industrial sources) are also contained in the dataset, but make up a minimal number of detections. To minimize the number of misclassified pixels due to unusual thermal signatures we used the confidence level associated with each pixel in the MOD14 and MYD14 products to only select fires detected with a confidence equal or higher than 50%. In this study we favored MODIS data due to high geolocation accuracy compared to other sources such as the federal fire occurrence dataset (Brown et al. 2002) that contains unverified location records (Hawbaker et al. 2013).
Drought data for the conterminous United States were obtained from the U.S. Drought Monitor (Svoboda et al. 2002) for the period 2000–18. The drought dataset consisted of weekly (N = 978 weeks) vector maps containing polygon areas delimiting the spatial extent of different drought categories across the United States. The U.S. Drought Monitor generates five drought intensity classes: abnormally dry (D0), moderate drought (D1), severe drought (D2), extreme drought (D3), and exceptional drought (D4). In our case, polygons for classes D2, D3, and D4 were converted from vector to raster using a 1-km grid to approximate the shape of the polygon. The resulting raster maps were stacked and then the total number of weeks under any of these three drought categories were computed for each pixel. This layer was intended to represent the most frequently affected areas of the United States by severe-or-worse droughts.
Soil texture for Kansas was determined based on 1-km gridded maps of sand and clay content at 5-cm depth obtained from the International Soil Reference and Information Centre SoilGrids product (Hengl et al. 2014), which for the United States is largely based on the U.S. Department of Agriculture (USDA) Natural Resources Conservation Service (NRCS) Soil Survey Geographic Database (Soil Survey Staff 2014). Soils were classified into one of the 12 soil textural classes defined by the U.S. Department of Agriculture.
Simplification of boundary density points for the conterminous United States was achieved using the Douglas–Peucker line-simplification algorithm setting a tolerance of 0.11° of arc. This was necessary to reduce the number of points and speed the computation process at the national level. For instance, the number of points defining the border of the United States was simplified from 301 839 points to 706 points, resulting in nearly identical boundaries and a difference in area of 1.5%, a value that can be assumed to be negligible for the purpose of this study.
To provide a broader context for the results of our study, we also included five statewide networks [Oklahoma Mesonet (McPherson et al. 2007; Brock et al. 1995), Nebraska Mesonet (Shulski et al. 2018), South Dakota Mesonet, North Dakota Mesonet, and New York Mesonet] and two nationwide networks [the U.S. Climate Reference Network (Diamond et al. 2013; Bell et al. 2013) and the Soil Climate Analysis Network (SCAN; Schaefer et al. 2007)].
d. Distribution analysis of unmonitored areas and station-support areas
3. Results and discussion
a. Unmonitored network area
On the basis of the spatial configuration of the Kansas Mesonet we identified a total of 145 empty circles with an area ranging from 252 to 1312 km2 (shaded circle in Fig. 5) and a median area of 622 km2 (Fig. 6a). Our findings revealed that the largest unmonitored area represents a stunning 8% of the area of the state, a value that is also comparable to the total area of about eight Kansas counties. The centroid of the LEC (red cross mark in Fig. 5) represents the point farthest (i.e., remotest) from any station across the entire network and indicates the ideal siting location of the next monitoring station of the Kansas Mesonet. The LEC centroid is located at geographic coordinates 38.3456°, −96.3599°, about 13 km to the southeast of Emporia, Kansas and right at the intersection of Lyon and Chase counties. The method of finding the LEC only provides tentative siting coordinates since the definitive location of the station will depend on land availability and network-specific site selection criteria that typically account for land cover, depth of the soil profile, landscape position, and presence of obstacles. The resulting LEC is almost entirely circumscribed into the Flint Hills ecoregion, an area dominated by natural grassland vegetation, rolling landscapes, and rocky soils that stretches about 400 km in the north–south direction between the northern part of Oklahoma and the Kansas–Nebraska border. The absence of Kansas Mesonet stations across much of this region is mostly due to the lack of Kansas Research and Extension Experiment Stations, low interest from public and private sponsors, and frequent prescribed wildland fires that increase the risk of accidental station damage.
On the other hand, station-support areas for the Kansas Mesonet ranged from 202 to 932 km2, with a median value of 602 km2 (Fig. 6b). The largest support area was for the Hill City station in the northwest portion of the state (shaded gray area in Fig. 5), which did not overlap with the largest unmonitored area and represented only 50% of the area of the LEC. This unexpected finding infringes upon the perception that new stations should be deployed so as to “split” the largest support area. In this particular case, the largest empty circle and the largest support area are on opposite sides of the network, meaning that the largest support area would remain unchanged after deploying the next monitoring station at the centroid of the LEC.
To put the analysis of the Kansas Mesonet into a broader perspective, we applied the method of identifying the LEC to five additional statewide and two nationwide mesoscale networks (Table 2). Not surprisingly, the Oklahoma Mesonet and the New York Mesonet with more than 120 stations each had the smallest LEC with an area of 612 km2. The LEC from these two networks are lower by almost a factor of 5 relative to that of the Kansas Mesonet (i.e., 1312 km2) and are smaller by nearly a factor of 100 relative to the LEC area for nationwide networks such as the SCAN and the U.S. Climate Reference network. The small unmonitored areas and the more uniform distribution of support areas give the Oklahoma Mesonet and the New York Mesonet the ability to capture localized natural phenomena such as convective storms, tornadic supercells, and even the development of flash droughts associated with patched irregular areas of coarse-textured soils, that in sparser networks may go completely undetected.
Summary of network area, LEC area, station-support area, and support area distribution as represented by the Gini coefficient for multiple statewide and nationwide mesoscale environmental monitoring networks across the United States. The areas are represented by XX2 to indicate the length of one side of a square of the given area; e.g., 1002 km2 = 10 000 km2. Station counts are for 2016.
The Kansas Mesonet, despite being one of the studied statewide networks with the lowest number of stations, resulted in a more uniform (lower Gini coefficient) support area distribution than comparable and denser networks. For instance, the Kansas Mesonet resulted in a Gini coefficient of 0.27 (Table 2), while the Nebraska Mesonet with 68 stations and the North Dakota Mesonet with 69 stations resulted in Gini coefficients of 0.35 and 0.28, respectively. Something similar occurred with nationwide networks, where the uneven distribution of the 185 stations of the SCAN resulted in a 47% greater Gini coefficient compared to the U.S. Climate Reference Network with just 110 stations across the conterminous United States (Table 2), highlighting the effectiveness of the iterative degradation method supporting the design of the U.S. Climate Reference Network (Vose and Menne 2004). From this analysis, it seems logical to us that the spatial resolution of mesoscale networks should not be only characterized by the mean size of the station-support area, but also by including a metric of spatial uniformity of the station-support areas (e.g., Gini coefficient) and the size of the largest unmonitored area.
b. Station deployment road map
The computation of the LEC can also be applied recursively to identify the optimal sequence of future stations (Fig. 7). The recursive implementation consisted of finding the centroid of the LEC to deploy the next monitoring station and then adding this new station to the list of all stations before starting the next iteration. In practice, it could take several years or even decades for a mesoscale network to reach its largest size, but the recursive implementation of the LEC method can enable researchers and network managers to plan the evolution of the network, identify priority areas, and create a long-term road map for better allocation of limited resources. The recursive analysis could be expanded to deploy N number of stations, but in practice mesoscale networks are often limited by funding and manpower, making this maximum value a complex variable that strongly depends on contemporaneous political–economic situation of the supporting institution.
The recursive method was applied to the Kansas Mesonet by growing the network from 56 stations to an idealized maximum of 105 stations, which equals the number of Kansas counties and is an arbitrary, but reasonable, number of long-term stations for this network. Deploying additional 49 stations resulted in a reduction of the largest empty circle from 1312 to 642 km2, a change of 76% relative to the current value (Fig. 8a). The deployment of the first station had the greatest impact, reducing the LEC area by 17%. To a lesser extent, the largest support area (LSA) was reduced from 932 to 602 km2, which is equivalent to a 58% percent reduction compared to the current value. After nearly doubling the size of the network, the areas of the LEC and the LSA tended to converge, suggesting that the configuration of the network tends to have similar and uniformly distributed monitored and unmonitored areas. This increased in uniformity can be clearly visualized by a decreased Gini coefficient from 0.27 for the current spatial configuration of the network to 0.14 for the hypothesized future configuration of the network. This change is also depicted by the reducing gap between the Lorenz curve corresponding to the network with 105 stations and the line of perfect support area distribution (Fig. 8b). The Gini coefficient is a useful metric to quantify the progress of the spatial support of the network during the augmentation stage, but the Gini coefficient is only a relative measure of support area distribution. Thus, it provides little information about the absolute magnitude of the spatial support of the stations, which is important from the point of view of capturing mesoscale phenomena. In other words, two networks with identical Gini coefficients can have completely different support areas.
The resulting road map contained some stations right at the network boundary. This is a consequence of including boundary points into the pool of tentative empty circle centroids. We identified at least three different ways of addressing the area between the convex hull and the boundary of the network. Our first alternative was to identify the intersecting points between the Voronoi polygons and the network boundaries (i.e., state line). A second alternative involved the generation of random points on the area between the convex hull and the network boundaries. A third alternative involved the use of stations from neighboring mesonet networks. We favored the addition of points from the intersection between Voronoi polygons and the network border (i.e., state border) to allow the network to grow beyond the boundaries imposed by the convex hull and improve the spatial coverage of unmonitored gaps near network boundaries. However, the selected approach is slightly more computationally demanding since borders delimited by natural boundaries often require more points than straight political boundaries. Our analysis treated mesoscale networks in isolation from other networks, but with the increasing number of mesoscale networks it is advisable that the deployment of new monitoring stations should involve the location of stations from neighboring networks as if any given network is part of a network of networks.
Two alternative geometric methods for finding the largest unmonitored area of a network considered in earlier stages of this study included computing the largest empty rectangle and the largest empty convex polygon. A common weakness of these two alternative methods is the susceptibility to detect large and elongated unmonitored areas, in which the polygon centroid can result adjacent to existing stations.
c. Application of the LEC with georeferenced environmental information
Up to this point the selection of the next monitoring station was solely determined by the geometric configuration of the existing stations. In this section we take the LEC method a step further and we demonstrate how it can be applied with georeferenced data of natural hazards and resources.
With the growing interest in monitoring soil-related variables such as soil moisture (Ochsner et al. 2013), knowledge of the spatial distribution of the predominant soil and detailed characterization of soil physical properties are becoming essential for accurate measurements (Scott et al. 2013). Soil textural class is a variable that synthesizes many of the continuous soil physical properties into a manageable and practical number of soil classes that have been widely used in hydrology and agriculture. Eight of the 12 USDA soil textural classes are present in Kansas as based on the percentage of sand and clay content at the 5-cm depth (Fig. 9). The predominant soil classes are loam (38.5%), silty clay loam (26.9%), and silt loam (24.8%), which together account for about 90% of the land area of the state (Table 3). The Kansas Mesonet currently represents five of the eight soil textural classes. Most monitoring stations are deployed in loam (28 stations), silt loam (18 stations), and silty clay loam (7 stations) soils. Silty clay loam is the predominant soil within the LEC, accounting for 91% (22 702 km2/24 981 km2) of the LEC area and representing nearly 27% (22 702 km2/84 555 km2) of the total area of silty clay loam soils across Kansas (Table 3, Fig. 9). Deploying a new station in the centroid of the LEC shown in Fig. 9 will not only decrease the size of the largest unmonitored area of the network, but will also increase the representativeness of the third most common soil textural class across Kansas, which is clearly underrepresented. In fact, eleven additional stations would need to be deployed in silty clay loam soils [100(7 + 11)/(56 + 11) = 26.9%, Table 3] to align the proportion of stations in each soil textural class with the areal proportion of each soil textural class across the state.
Area of the eight USDA soil textural classes present in Kansas and the LEC.
The utilization of prescribed fires over the Flint Hills ecoregion is another prominent use of weather data in the state. These prescribed fires are a useful management strategy to suppress invasive plants and improve grassland yield and palatability for cattle grazing. Accurate environmental monitoring is equally important over areas with prescribed fires to guide firefighters, land managers, smoke management, and increase awareness for conditions conducive for wildland fire. Wind speed and direction, air temperature, relative humidity, antecedent precipitation, and dry soil and vegetation conditions are primary variables utilized for prescribed burning and fire danger models (Carlson et al. 2002), all of which are variables monitored by modern mesoscale networks. The LEC with the greatest number of satellite-detected thermal anomalies was the same as the LEC identified using the spatial configuration of the network (Fig. 10). There were a total of 19 711 MODIS pixels representing fires in the period 2000–18 within the LEC. While the population density within this area is low with about 1.15 people per square kilometer (total of 96 937 people), there are multiple actively growing urban areas near the edge of the LEC. Deploying new monitoring stations within this LEC will not only enhance the ability of the network to better track and warn about the risk of wildland fires and air pollution in the region, but it will also complement the monitoring of underrepresented soil types.
Juxtaposing the spatial configuration of the SCAN with historical records of severe-or-worse drought events from the U.S. Drought Monitor revealed that some of the largest unmonitored areas of the SCAN also match regions with frequent and persistent droughts. The SCAN stems from the need to better monitor agricultural areas of the United States susceptible to drought conditions and currently consists of 185 stations within the conterminous United States. The SCAN effectively covers most agricultural areas within the United States susceptible to drought, but our analysis also revealed noticeable gaps in the network. The two largest and nonoverlapping unmonitored circles in the SCAN are located in areas of the country that also exhibit some of the highest frequency of severe-or-worse droughts (Fig. 11) in the period from 2000 to 2018. Both of these regions have a low population density and are characterized by rocky mountainous terrain dominated by bare soil or natural grasslands and shrublands. The centroid of the LEC area is located at coordinates 43.8749°, −107.9829° (cross mark in Fig. 11), which point is in Wyoming. The LEC has an area of 6352 km2 and encompasses parts of Wyoming, Montana, Idaho, and South Dakota. The median number of weeks under severe or worse drought conditions within the LEC was 280 weeks, with some localized areas reaching a maximum of 403 weeks, which is about 41% of the studied period. The second largest empty circle was located on Arizona and New Mexico. This is somewhat expected since part of this area belongs to the Sonoran Desert. Although the SCAN was deployed with the goal of monitoring environmental conditions and encourage research and decision making in agricultural areas of the United States (Schaefer et al. 2007), deployment of new stations may be required in the identified gaps to better understand the onset, development, and persistence of droughts as natural phenomena beyond the realms of agricultural production.
While the SCAN does not have presence in certain drought-prone areas of the United States, the Snowpack Telemetry (SNOTEL) network also developed and maintained by the USDA NRCS has multiple stations within these unmonitored SCAN regions. The SNOTEL network was developed to supply near-real-time snowpack information to forecast streamflow volumes and many sites also monitor soil moisture, soil temperature, solar radiation, and wind speed similar to sites of the SCAN, highlighting the importance of generating a network of networks to complement environmental observations. Another important remark from our network analysis of the Kansas Mesonet and SCAN suggests that, in addition to the spatial variability of essential variables, the design of future mesoscale environmental monitoring networks should account for the spatial distribution of natural resources and environmental hazards.
The previous analyses also exposed some of the limitations of the proposed geometric method for siting future monitoring stations. The method was described using the Kansas Mesonet, which is located inland surrounded by a rather flat landscape with gradual elevation changes, and may not be adequate to guide the placement of future monitoring stations in regions dominated by a large number of islands or by mountainous terrain. In regions presenting high elevation, meteorological processes such as orographic precipitation events may not be accurately represented. In coastal environmental monitoring networks, such as the National Observatory of Athens Automatic Network in Greece that extends to numerous islands, the limitation of the LEC method could be circumvented by considering islands as separate spatial domains. Then, the size of the different continental and insular LEC could be ranked to define the optimal location across the entire domain of the environmental monitoring network. Similarly, subregions of mesoscale monitoring networks characterized by mountainous terrain could be delineated and treated separately. In this context, an alternative solution that requires further exploration may consist of generating elevation-weighted LEC.
4. Conclusions
We investigated the problem of identifying the optimal location of future monitoring stations in modern automated mesoscale networks. The method consisted of finding the largest unmonitored area of the network by searching for the largest empty circle, in which its center represents the remotest location of the network and the tentative location for siting a new station. We demonstrated that simple and well-established concepts from computational geometry and statistical economics can be used to analyze the spatial distribution of stations and to objectively determine the siting location of future monitoring stations across an existing network, removing the need for subjective decisions and complicated optimization functions.
Spatially dense statewide mesoscale networks in the United States with more than 120 stations per state had largest unmonitored areas of about 60 km2, while nationwide mesoscale monitoring networks had largest unmonitored areas of 500–600 km2. Across multiple statewide and nationwide networks, the largest unmonitored area was spatially unrelated to the station-support area, which implies that both concepts are somewhat independent and should be considered together to characterize the spatial resolution of mesoscale networks. The proposed method was effective to generate a long-term road map of the location of future monitoring stations to help scientists, network managers, and state climatologists better plan the spatial coverage of the network and the allocation of limited resources for network growth and maintenance. The road map for the case of the Kansas Mesonet generated the optimal geographic location of the next 49 stations of the network. To our knowledge this is the first paper describing the location of N monitoring stations within an existing mesoscale network.
While we still recommend the use of geostatistical approaches that account for the spatial structure of one or few variables for siting stations in application-specific networks, the proposed method of identifying the largest unmonitored area solely based on the geometric configuration of the network provides an alternative, more practical, and amenable method for siting new stations in multivariate mesoscale networks. Due to the geometric nature, the proposed approach can be easily integrated with network boundaries and georeferenced information of natural resources and environmental phenomena such as soil type, wildland fires, and severe droughts. Future studies could focus on environmental hazards not covered in our study such as flooding events and severe thunderstorms. Our study focused on terrestrial environmental monitoring networks, but we do not see any major obstacle on why the proposed approach could not be extended to multifaceted oceanic networks.
Acknowledgments
Partial support for this project has been provided by the Kansas State University Agricultural Experiment Station (Contribution 20-175-J), the Kansas Soybean Commission through Agreement KSC/KSU 1782 and Award A00-0635; and the Kansas Corn Commission through Awards A00-0767-001 and A00-1110. The authors also thank three anonymous reviewers for their helpful and constructive feedback.
REFERENCES
AASC, 2019: Recommendations and best practices for mesonets, version 1. American Association of State Climatologists Rep., 37 pp., https://stateclimate.org/best-practices/.
American Meteorological Society, 2019: Mesoscale. Glossary of Meteorology, http://glossary.ametsoc.org/wiki/mesoscale.
Arsenault, R., and F. Brissette, 2014: Determining the optimal spatial distribution of weather station networks for hydrological modeling purposes using RCM datasets: An experimental approach. J. Hydrometeor., 15, 517–526, https://doi.org/10.1175/JHM-D-13-088.1.
Basara, J. B., 2001: Soil moisture observations for flash flood research and prediction. Coping with Flash Floods, E. Gruntfest and J. Handmer, Eds., Kluwer Academic, 231–241.
Bastin, G., B. Lorent, C. Duqué, and M. Gevers, 1984: Optimal estimation of the average areal rainfall and optimal selection of rain gauge locations. Water Resour. Res., 20, 463–470, https://doi.org/10.1029/WR020i004p00463.
Bell, J. E., and Coauthors, 2013: U.S. Climate Reference Network soil moisture and temperature observations. J. Hydrometeor., 14, 977–988, https://doi.org/10.1175/JHM-D-12-0146.1.
Bojinski, S., M. Verstraete, T. C. Peterson, C. Richter, A. Simmons, and M. Zemp, 2014: The concept of essential climate variables in support of climate research, applications, and policy. Bull. Amer. Meteor. Soc., 95, 1431–1443, https://doi.org/10.1175/BAMS-D-13-00047.1.
Bras, R. L., and I. Rodríguez-Iturbe, 1976: Network design for the estimation of areal mean of rainfall events. Water Resour. Res., 12, 1185–1195, https://doi.org/10.1029/WR012i006p01185.
Brock, F. V., K. C. Crawford, R. L. Elliott, G. W. Cuperus, S. J. Stadler, H. L. Johnson, and M. D. Eilts, 1995: The Oklahoma Mesonet: A technical overview. J. Atmos. Oceanic Technol., 12, 5–19, https://doi.org/10.1175/1520-0426(1995)012<0005:TOMATO>2.0.CO;2.
Brown, T. J., B. L. Hall, C. R. Mohrle, and H. J. Reinbold, 2002: Coarse assessment of federal wildland fire occurrence data: Report for the national wildfire coordinating group. CEFA Rep. 02-04, 30 pp., https://cefa.dri.edu/Publications/fireoccurrencereport.pdf.
Bueso, M. C., J. M. Angulo, and F. J. Alonso, 1998: A state-space model approach to optimum spatial sampling design based on entropy. Environ. Ecol. Stat., 5, 29–44, https://doi.org/10.1023/A:1009603318668.
Carlson, J. D., R. E. Burgan, D. M. Engle, and J. R. Greenfield, 2002: The Oklahoma Fire Danger Model: An operational tool for mesoscale fire danger rating in Oklahoma. Int. J. Wildland Fire, 11, 183–191, https://doi.org/10.1071/WF02003.
De Berg, M., O. Cheong, M. Van Kreveld, and M. Overmars, 2008: Computational Geometry: Algorithms and Applications. Springer, 386 pp.
Diamond, H. J., and Coauthors, 2013: U.S. Climate Reference Network after one decade of operations: Status and assessment. Bull. Amer. Meteor. Soc., 94, 485–498, https://doi.org/10.1175/BAMS-D-12-00170.1.
Findell, K. L., and E. A. B. Eltahir, 1997: An analysis of the soil moisture-rainfall feedback, based on direct observations from Illinois. Water Resour. Res., 33, 725–735, https://doi.org/10.1029/96WR03756.
Ford, T. W., D. B. McRoberts, S. M. Quiring, and R. E. Hall, 2015a: On the utility of in situ soil moisture observations for flash drought early warning in Oklahoma, USA. Geophys. Res. Lett., 42, 9790–9798, https://doi.org/10.1002/2015GL066600.
Ford, T. W., A. D. Rapp, S. M. Quiring, and J. Blake, 2015b: Soil moisture-precipitation coupling: Observations from the Oklahoma Mesonet and underlying physical mechanisms. Hydrol. Earth Syst. Sci., 19, 3617–3631, https://doi.org/10.5194/hess-19-3617-2015.
Gagne, D. J., II, A. McGovern, J. B. Basara, and R. A. Brown, 2012: Tornadic supercell environments analyzed using surface and reanalysis data: A spatiotemporal relational data-mining approach. J. Appl. Meteor. Climatol., 51, 2203–2217, https://doi.org/10.1175/JAMC-D-11-060.1.
Garbrecht, J. D., X. C. Zhang, and J. L. Steiner, 2014: Climate change and observed climate trends in the Fort Cobb experimental watershed. J. Environ. Qual., 43, 1319–1327, https://doi.org/10.2134/jeq2013.07.0286.
Gini, C., 1921: Measurement of inequality of incomes. Econ. J., 31, 124, https://doi.org/10.2307/2223319.
Hawbaker, T. J., V. C. Radeloff, S. I. Stewart, R. B. Hammer, N. S. Keuler, and M. K. Clayton, 2013: Human and biophysical influences on fire occurrence in the United States. Ecol. Appl., 23, 565–582, https://doi.org/10.1890/12-1816.1.
Hengl, T., and Coauthors, 2014: SoilGrids1km—Global soil information based on automated mapping. PLOS ONE, 9, e105992, https://doi.org/10.1371/journal.pone.0105992.
Herrera, G. S., and G. F. Pinder, 2005: Space-time optimization of groundwater quality sampling networks. Water Resour. Res., 41, W12407, https://doi.org/10.1029/2004WR003626.
Krueger, E. S., T. E. Ochsner, D. M. Engle, J. D. Carlson, D. Twidwell, and S. D. Fuhlendorf, 2015: Soil moisture affects growing-season wildfire size in the southern Great Plains. Soil. Sci. Soc. Amer. J., 79, 1567–1576, https://doi.org/10.2136/sssaj2015.01.0041.
Krueger, E. S., T. E. Ochsner, J. D. Carlson, D. M. Engle, D. Twidwell, and S. D. Fuhlendorf, 2016: Concurrent and antecedent soil moisture relate positively or negatively to probability of large wildfires depending on season. Int. J. Wildland Fire, 25, 657, https://doi.org/10.1071/WF15104.
Kumar, N., 2009: An optimal spatial sampling design for intra-urban population exposure assessment. Atmos. Environ., 43, 1153–1155, https://doi.org/10.1016/j.atmosenv.2008.10.055.
Liu, W., Y. Hong, S. Khan, M. Huang, T. Grout, and P. Adhikari, 2011: Evaluation of global daily reference ET using Oklahoma’s environmental monitoring network—Mesonet. Water Resour. Manage., 25, 1601–1613, https://doi.org/10.1007/s11269-010-9763-0.
Lollato, R. P., A. Patrignani, T. E. Ochsner, and J. T. Edwards, 2016: Prediction of plant available water at sowing for winter wheat in the southern Great Plains. Agron. J., 108, 745–757, https://doi.org/10.2134/agronj2015.0433.
Lorenz, M. O., 1905: Methods of measuring the concentration of wealth. Publ. Amer. Stat. Assoc., 9, 209–219, https://doi.org/10.2307/2276207
McPherson, R. A., and Coauthors, 2007: Statewide monitoring of the mesoscale environment: A technical update on the Oklahoma Mesonet. J. Atmos. Oceanic Technol., 24, 301–321, https://doi.org/10.1175/JTECH1976.1.
Mozny, M., M. Trnka, Z. Zalud, P. Hlavinka, J. Nekovar, V. Potop, and M. Virag, 2012: Use of a soil moisture network for drought monitoring in the Czech Republic. Theor. Appl. Climatol., 107, 99–111, https://doi.org/10.1007/s00704-011-0460-6.
Ochsner, T. E., and Coauthors, 2013: State of the art in large-scale soil moisture monitoring. Soil Sci. Soc. Amer. J., 77, 1888–1919, https://doi.org/10.2136/sssaj2013.03.0093.
Ochsner, T. E., E. Linde, M. Haffner, and J. Dong, 2019: Mesoscale soil moisture patterns revealed using a sparse in situ network and regression kriging. Water Resour. Res., 55, 4785–4800, https://doi.org/10.1029/2018WR024535.
Pardo-Igúzquiza, E., 1998: Optimal selection of number and location of rainfall gauges for areal rainfall estimation using geostatistics and simulated annealing. J. Hydrol., 210, 206–220, https://doi.org/10.1016/S0022-1694(98)00188-7.
Patrignani, A., and T. E. Ochsner, 2018: Modeling transient soil moisture dichotomies in landscapes with intermixed land covers. J. Hydrol., 566, 783–794, https://doi.org/10.1016/j.jhydrol.2018.09.049.
Preparata, F. P., and M. I. Shamos, 2012: Computational Geometry: An Introduction. Springer Science and Business Media, 398 pp.
Reid, A. M., S. D. Fuhlendorf, and J. R. Weir, 2010: Weather variables affecting Oklahoma wildfires. Rangeland Ecol. Manage., 63, 599–603, https://doi.org/10.2111/REM-D-09-00132.1.
Schaefer, G. L., M. H. Cosh, and T. J. Jackson, 2007: The USDA Natural Resources Conservation Service Soil Climate Analysis Network (SCAN). J. Atmos. Oceanic Technol., 24, 2073–2077, https://doi.org/10.1175/2007JTECHA930.1.
Schmalwieser, A. W., and G. Schauberger, 2001: A monitoring network for erythemally-effective solar ultraviolet radiation in Austria: Determination of the measuring sites and visualisation of the spatial distribution. Theor. Appl. Climatol., 69, 221–229, https://doi.org/10.1007/s007040170027.
Schuster, M., 2008: The largest empty circle problem. Swarthmore Rep., 10 pp., https://www.cs.swarthmore.edu/~adanner/cs97/s08/papers/schuster.pdf.
Scott, B. L., T. E. Ochsner, B. G. Illston, J. B. Basara, and A. J. Sutherland, 2013: New soil property database improves Oklahoma Mesonet soil moisture estimates. J. Atmos. Oceanic Technol., 30, 2585–2595, https://doi.org/10.1175/JTECH-D-13-00084.1.
Shafer, M. A., T. Hughes, and J. D. Carlson, 1993: The Oklahoma Mesonet: Site selection and layout. Preprints, Eighth Symp. on Meteorological Observations, Anaheim, CA, Amer. Meteor. Soc., 231–236.
Shulski, M., S. Cooper, G. Roebke, and A. Dutcher, 2018: The Nebraska Mesonet: Technical overview of an automated state weather network. J. Atmos. Oceanic Technol., 35, 2189–2200, https://doi.org/10.1175/JTECH-D-17-0181.1.
Soil Survey Staff, 2014: Soil survey field and laboratory methods manual, version 2.0. Soil Survey Investigations Rep. 51, 488 pp., https://www.nrcs.usda.gov/Internet/FSE_DOCUMENTS/stelprdb1244466.pdf.
Svoboda, M., and Coauthors, 2002: The Drought Monitor. Bull. Amer. Meteor. Soc., 83, 1181–1190, https://doi.org/10.1175/1520-0477-83.8.1181.
Swenson, S., J. Famiglietti, J. Basara, and J. Wahr, 2008: Estimating profile soil moisture and groundwater variations using GRACE and Oklahoma Mesonet soil moisture data. Water Resour. Res., 44, W01413, https://doi.org/10.1029/2007WR006057.
Terti, G., I. Ruin, J. J. Gourley, P. Kirstetter, Z. Flamig, J. Blanchet, A. Arthur, and S. Anquetin, 2019: Toward probabilistic prediction of flash flood human impacts. Risk Anal., 39, 140–161, https://doi.org/10.1111/risa.12921.
Toussaint, G. T., 1983: Computing largest empty circles with location constraints. Int. J. Comput. Inf. Sci., 12, 347–358, https://doi.org/10.1007/BF01008046.
Trujillo-Ventura, A., and J. Hugh Ellis, 1991: Multiobjective air pollution monitoring network design. Atmos. Environ., 25A, 469–479, https://doi.org/10.1016/0960-1686(91)90318-2.
Voronoi, G., 1909: Nouvelles applications des paramétres continus á théorie des formes quadratiques. Deuxième Mémoire. Recherches sur les paralléloèdres primitifs. J. Reine Angew. Math., 1909, 67–182, https://doi.org/10.1515/crll.1909.136.67.
Vose, R. S., and M. J. Menne, 2004: A method to determine station density requirements for climate observing networks. J. Climate, 17, 2961–2971, https://doi.org/10.1175/1520-0442(2004)017<2961:AMTDSD>2.0.CO;2.
Wyatt, B. M., T. E. Ochsner, C. A. Fiebrich, C. R. Neel, and D. S. Wallace, 2017: Useful drainage estimates obtained from a large-scale soil moisture monitoring network by applying the unit-gradient assumption. Vadose Zone J., 16, 1–15, https://doi.org/10.2136/vzj2017.01.0016.
Ziolkowska, J. R., and Coauthors, 2017: Benefits and beneficiaries of the Oklahoma Mesonet: A multisectoral ripple effect analysis. Wea. Climate Soc., 9, 499–519, https://doi.org/10.1175/WCAS-D-16-0139.1.