1. Introduction
The International Comprehensive Ocean–Atmosphere Data Set (ICOADS) (originally COADS) was first released in 1985 (Slutz et al. 1985) and covered the period 1854–1979 (Woodruff et al. 1987, 1993). The most recent release, R3.0.0, spans from 1662 to 2014 (Freeman et al. 2017) with near-real-time (NRT) monthly updates from 2015 to the present, provided as ICOADS R3.0.1. The ICOADS is an integrated source of environmental observations such as sea surface temperature (SST), air temperature (AT), sea level pressure (SLP), and wind speed and direction. These observations are gathered from a variety of observing platforms including ships, moored and drifting buoys, and fixed platforms like oil rigs and coastal offshore structures. ICOADS provides access to a range of observation types, globally, and through the entire marine instrumental record.
As a foundational dataset, ICOADS has been widely used in various climate-related data and assessment products, such as the U.S. National Oceanic and Atmospheric Administration’s (NOAA’s) NOAAGlobalTemp (Zhang et al. 2019; Vose et al. 2012); the NOAA Extended Reconstructed Sea Surface Temperature (ERSST) (Huang et al. 2015, 2017); the NOAA Optimum Interpolation Sea Surface Temperature (OISST) (Reynolds et al. 2007; Banzon et al. 2016; Huang et al. 2021); the Met Office (UKMO) Hadley Centre Sea Ice/Sea Surface Temperature/global surface temperature datasets HadISST (Rayner et al. 2003), HadSST4 (Kennedy et al. 2019), and HadCRUT5 (Morice et al. 2021); Nighttime Marine Air Temperature datasets HadNMAT2 (Kent et al. 2013), CLASSnmat (Cornes et al. 2020), and UAHNMATv1 (Junod and Christy 2019); the Intergovernmental Panel on Climate Change Climate Assessment Report (IPCC 2021); the Japan Meteorological Agency centennial-scale SST analysis (Hirahara et al. 2014); the NOAA–CIRES–DOE Twentieth Century Reanalysis (Slivinski et al. 2019); and the European Centre for Medium-Range Weather Forecasts (ECMWF) reanalysis ERA 5 (Hersbach et. al. 2020).
In 2003, the World Meteorological Organization (WMO) approved a policy that all observational data exchanged internationally on the WMO Global Telecommunication System (GTS) be migrated from the Traditional Alphanumeric Codes (TAC) to the table-driven code forms (TDCF), such as the Binary Universal Form for Representation of Meteorological Data (BUFR) format (WMO 2019). To align with the WMO policy, many countries and programs have moved, or are currently in the process of moving, to the BUFR format in transmitting NRT marine observational data.
BUFR is designed for efficient exchange and storage of meteorological and oceanographic data. BUFR’s table-driven structure provides greater flexibility compared to the TAC format; BUFR can easily be extended and is self-describing. These table-driven and self-descriptive features in BUFR allow for greater reporting precision and more metadata (e.g., Pelletier 2008). In TAC, limited by the coding method, the observational values have limited precision; in BUFR, value precisions are prescribed in separate WMO tables thus the data can have their native precisions. For example, SST precision is 0.1°C in TAC but 0.01 K in BUFR; wind direction is in tens of degrees in TAC versus one degree in BUFR. Similarly, the platform metadata, instrument information, and data meanings can be fully documented in BUFR, while TAC only provides observation datetime, location, and maybe limited instrument information. Furthermore, the TDCF allows the modification of tables, providing a route for changes and additions in precision, metadata, and data type.
The marine in situ reports distributed over the GTS in the older TAC format, such as the FM-13 SHIP code (WMO 2019), have been available for decades and have been a major input source for ICOADS monthly operational updates. With more GTS marine data transmissions being made in BUFR starting in 2016, there were fewer observations in TAC available for inclusion in R3.0.1. This was particularly true for the observations from drifting buoys, as they were among the first marine observation systems fully transitioned from TAC to BUFR. Even in those cases, however, the TAC streams still serve as backup sources in cases of BUFR transmission failures or other unexpected BUFR data distribution issues, so combining different data streams from various GTS data collection centers further enhances the reliability of the merged data.
To ensure that the ICOADS NRT dataset meets the needs of users, including product developers (Freeman et al. 2019; Kent et al. 2019; Huang et al. 2020), the NOAA National Centers for Environmental Information (NCEI), in collaboration with ICOADS partners, began the process of recovering and decoding the NRT BUFR reports from ships, moored buoys, and drifting buoys. Realizing the impacts of the international format transitions to the operational R3.0.1 product, NCEI began working with the NOAA National Data Buoy Center (NDBC), the Canadian Department of Fisheries and Oceans (DFO) Global Data Assembly Center for Drifting Buoys (GDAC-CA), and the NOAA National Weather Service Telecommunications Gateway (NWSTG). The objectives of the collaboration are to collect previously transmitted BUFR reports but no longer available on the GTS, to establish operational ingest of real-time BUFR transmissions from the NWSTG, and to ensure that the BUFR data are preserved in the NCEI archives. Meanwhile, a BUFR tool (see section 3a) was installed to decode the BUFR reports, then convert them to the ICOADS International Maritime Meteorological Archive format (IMMA1) (Smith et al. 2016). The decoded drifting buoy data were immediately made publicly available as an auxiliary dataset starting in January 2016 until February 2019. Concurrently further development to decode other platform types transitioning to BUFR transmissions, such as moored buoys and ships, as well as needed adjustment to the quality control processes was undertaken to produce the new R3.0.2 with a goal to blend the NRT TAC and BUFR for a more complete marine in situ dataset.
In this paper, we describe the ICOADS input data sources in section 2. In section 3, we describe the methodologies of BUFR decoding; data processing for the data collected by drifting buoys, moored buoys, and ships; and the data merging and duplicate record elimination to produce ICOADS R3.0.2 “Total” and “Final” files (Freeman et al. 2017). Section 4 compares the number of reports and coverage for essential climate variables (ECVs) between R3.0.1 and the newly merged TAC + BUFR dataset, R3.0.2. Section 5 describes both quantitative and qualitative gains from this much-needed improvement for the ICOADS NRT stream. Section 6 provides insight into the future of ICOADS.
2. Input data sources
The ICOADS R3.0.2 covers the period from 1 January 2015 to the present, and is updated every month. Input data are collected in TAC and BUFR formats from the various sources of the WMO GTS data and archived at NCEI. The BUFR input data (Fig. 1) include messages provided by NWSTG, NDBC, and GDAC-CA. The BUFR data from NWSTG were less complete from January 2015 to September 2017; thus, in order to make a more complete dataset, data from NDBC and GDAC-CA BUFR streams were collected and combined to help fill in missing data. Duplicated records were subsequently identified in the Total dataset and removed from the Final dataset (see section 3c). Starting from September 2017, operational collection of the NWSTG stream was set up and this was used as the operational source for R3.0.2.
Historically, the TAC data were acquired from both NCEI and NOAA’s National Centers for Environmental Prediction (NCEP). There was a period in which R3.0.1 and R3.0.2 overlapped (January 2015 to December 2021). For this period, the R3.0.1 blend of NCEI and NCEP TAC streams serve as the TAC input source for R3.0.2, and were merged with BUFR data records. Starting in January 2022 and onward, R3.0.2 became operational and R3.0.1 production as a stand-alone version ceased.
3. Methodologies
a. BUFR decoding method
Figure 2 shows the process flow of the BUFR decoding and TAC + BUFR merging. The ECMWF ecCodes software (https://confluence.ecmwf.int/display/ECC/ecCodes+Home) was used for decoding the BUFR data. The first step was to identify which observed variables were transmitted in the BUFR messages. Next, the corresponding key for each ICOADS variable was located in the BUFR tables and mapped to the associated IMMA1 format field. The appropriate WMO tables provide the correct unit and scale factor for each variable. Appendix A shows the ICOADS IMMA1 variables and their designated BUFR keys, where direct mappings were available. More BUFR keys can be found in the WMO Manual on Codes (WMO 2019).
b. Converting decoded BUFR data to the ICOADS common format
Following the mapping of the BUFR data to the ICOADS IMMA1 data and metadata fields, the BUFR values were converted to IMMA1 format. Due to the different category schemes between BUFR and IMMA1, new mapping tables were required (appendix A).
Additional checks were conducted to quality control the BUFR platform type assignments. For example, through detailed examination and testing of the buoy ID rules from the WMO (https://community.wmo.int/rules-allocating-wmo-numbers) and visual inspection of reports, we were able to determine that some buoys (e.g., 4802008 and 4802009), distributed as moored buoys based on the WMO-assigned ID structure in the BUFR messages, were not stationary. Quality control allowed these buoys to be reassigned the correct platform type of drifting buoy (see https://rda.ucar.edu/datasets/ds539.5/docs/drift_buoy_bufr2imma1_v5.pdf for more information). There were reports which missed platform types; these platform types were inferred and assigned from their station IDs.
c. UID assignments
The ICOADS unique identifier (UID) is a unique six-digit, base-36 value assigned to each individual ICOADS report. The UID provides a record tracking mechanism since the UID assignments are permanently assigned to the report (Freeman et al. 2017). This makes locating specific records possible, as well as being able to associate error adjustments or other information (i.e., reanalysis feedback, or observational metadata) back to a specific ICOADS report. Additional background information on UIDs can be found in Smith et al. (2016). UIDs are assigned to all reports prior to duplicate removal and were first assigned in release 2.5.1, and UIDs are also assigned in the R3.0.0 and R3.0.1 Total files. From January 2015 to December 2021, UIDs from ICOADS R3.0.1 are carried forward into ICOADS R3.0.2 in order to preserve existing UIDs. New UIDs were assigned to the reports from the BUFR stream. From January 2022, after cessation of R3.0.1, each source will be processed separately and a new UID assigned to every report, i.e., the NCEI TAC and NCEI BUFR streams.
d. Data QC
ICOADS data originate from many different types of observing platforms, such as ships, drifting buoys, moored buoys, coastal stations, and fixed ocean platforms. Random errors and systematic biases may be introduced during the observation and transmission processes (Kennedy et al. 2011). This has been more common in ship observations e.g., due to poor data management of historical data and poor siting of instrumentation (Kent et al. 2017). Quality controls (QCs) have been used starting from COADS release 1 (Slutz et al. 1985) onward. Updated trimming limits were used for R3.0 as described at https://icoads.noaa.gov/e-doc/R3.0-stat_trim.pdf. Data that exceed climatological thresholds are also flagged during the QC process (Wolter 1997; Smith et al. 2021).
Using the ICOADS QC and trimming software, we assigned QC flags as shown in appendix B. QC flags were assigned to 14 selected weather and ocean variables, and the sum of the weighted flags (see quality code in appendix C) determined which report to retain among the duplicates. The QC procedures were slightly different for the TAC and BUFR streams, as described below.
Quality control of the BUFR data used the standard ICOADS QC software, with an additional step which compared the platform types between the TAC and BUFR streams to ensure that the platform types assigned to observation with the same ID were consistent between the TAC and BUFR streams.
To make the older TAC data compatible with the modern BUFR reports, the following checks and modifications were made to the TAC data: (i) dates were validated, and if an erroneous date was detected, they were kept in the Total files but removed from the Final files; (ii) platform types were evaluated and corrected for observations in which TAC and BUFR reported the same platform ID but different platform types; approximately 300 IDs were found with this issue; (iii) missing platform types were filled in wherever possible; (iv) five-digit buoy IDs in the TAC format were converted to seven-digit buoy IDs; (v) attachment C98 was added, which provides the UID attachment for each R3.0.2 IMMA1 report; (vi) ice platform type reports from TAC drifting and moored buoy reports were identified and marked with the correct platform type (the ice buoy platform type was previously unused in R3.0.1 and was defined simply as drifting or moored buoys, but defined as ice buoys in BUFR); (vii) fixing incorrect IDs, e.g., converting ID 9077070 to ID C6OM7 due to the International Maritime Organization IMO number (9077070) erroneously being encoded as the ID rather than the true call sign (C6OM7); and (viii) fixing incorrect ID indicators (II) (e.g., “II” should be “11” if it is a seven-digit buoy ID; “II” was assigned to zero in ICOADS R3.0.1 when it was a seven-digit buoy ID).
e. Duplicate elimination
An important step in the production of R3.0.2 is the identification of duplicate observational records for flagging and removal from the Final files (duplicates are retained in the Total files). The QC is important in the duplicate elimination process as QC flags (see appendix C) are assigned to weather variables (see appendix B) automatically by the ICOADS QC software suite. The sum of the weighted flags helps determine which report to retain as unique or as the “best” duplicate in cases where multiple exact or near-duplicate matches occur. Ideally, duplicate reports can be identified by exact comparison (exact match) of location, time, and observed elements within the reports. Due to the different input data sources and transmission formats, however, slight differences in TAC and BUFR precisions (retained digits) and encoding/decoding techniques have made duplicates more challenging to identify. Reports that previously were exact duplicates now have precision-related differences in one or more of their fields. For example, the latitude and longitude values from some reports in the TAC stream have a precision of 0.1°, but the same reports from the BUFR stream have a precision of 0.01°. This apparent discrepancy would distinguish and identify them as separate and unique reports in the R3.0.1 algorithm. To account for potential near-duplicates due to these changes in precision, we explicitly allow a 0.05° tolerance level of latitude or longitude to define a duplicate in the R3.0.2 algorithm. By relaxing the location comparison criteria in the BUFR data, most of the near-duplicate matches can now be detected.
Similar checks are applied to other variables during the duplicate flagging process: First, potential duplicates (dups) are defined within the same 1° × 1° box and within ±1 h (“hour cross”) or 1 day (“day cross”). A check is then performed for seven observed elements (wind speed, visibility, present weather, past weather, SLP, AT, and SST) to identify duplicates. The checks involving present and past weather fields uses “allowances,” which consider weather elements that match under some environmental circumstances even though they were not identical as discussed above [see the ICOADS duplicate elimination (dupelim) documentation at https://icoads.noaa.gov/e-doc/R3.0-dupelim.pdf].
Next, we determine which duplicate report should be retained in the Final dataset. The quality code (release 1, supp. K; Slutz et al. 1985), as computed by the NCDC-QC, is the basis for the selection of one duplicate report over another. If quality codes are identical, a priority list by deck is used to select one of the duplicates. If priorities are identical, the report with the highest deck number (DCK) is preferred. The deck number for BUFR data is 798. SID for BUFR is 172. The priority for BUFR is 1. If the decks are identical, the report with the highest source ID (SID) is preferred. If the SIDs are identical, the second report in sort order is selected. The duplicate status is assigned from 0 to 14, from unique to worst duplicate (Table 1), respectively. No reports, however, are deleted at this stage and all duplicates are retained in a “Total” file in case further examination is needed at a later time. BUFR data were given higher priority over TAC, as BUFR retains higher precisions, contains more metadata, and provides additional variables sometimes not available in the TAC stream.
Duplicate status (DUPS) assignments as noted in Table D8 from Smith et al. (2016). For more information, see the ICOADS R3.0 Duplicate Elimination and Related Processing documentation at https://icoads.noaa.gov/e-doc/R3.0-dupelim.pdf. Italic items are dup status not found in this TAC + BUFR merge practice.
Finally, reports that are not flagged as landlocked and have a duplicate status of ≤2, where 0 is unique, 1 is most unique, and 2 is best duplicate with substitute (DUPS = 2 not assigned in R3.0.2), are retained for the Final file without duplicates.
4. Comparisons between ICOADS R3.0.2 and R3.0.1
a. Number of reports in R3.0.2 versus R3.0.1
Figure 3 shows the monthly number of reports identified as unique from ship, surface drifting buoy and moored buoy observations in the ICOADS R3.0.2 (TAC + BUFR) and ICOADS R3.0.1 (TAC-only). An important feature of ICOADS NRT R3.0.2 is that it has much higher numbers of drifting buoy observations (Fig. 3b) because the transition from TAC to BUFR formats started earlier for surface drifters. The number of surface drifting buoy observations in the R3.0.1 Final file decreased from about 1.1 million (M) in 2015 to less than 0.1 M by the end of 2020 (Fig. 3b) as TAC transmission was phased out quickly for these data (Fig. 4). In contrast, the observation numbers of surface drifting buoys in the R3.0.2 Final file were steadily above about 1.2 M from 2015 onward. The number of ship observations was slightly higher in the R3.0.2 Final file than in R3.0.1 after January 2016 (Fig. 3a), with a slow but steady increase, in particular in 2020.
The TAC ship reports showed an obvious drop at the end of 2019. The moored buoy report numbers showed relative consistency between R3.0.2 and R3.0.1, largely due to the transition from TAC to BUFR formats started later for surface moored buoys than drifters. Many moored buoys are still transmitting in TAC format. We also noticed that the number of moored buoy observations was slightly lower in R3.0.2 than in R3.0.1 between January 2015 and September 2017. Three factors explain the slightly reduced report numbers of moored buoy in the R3.0.2 Final file: First, some reports with invalid dates were included in R3.0.1 but were removed in R3.0.2 Final files. Second, duplicate identification criteria in R3.0.2 were slightly different from R3.0.1 (section 3). Third, note that there was no specific ice buoy category in R3.0.1 and ice buoys were classified as moored or drifting buoys, while in R3.0.2 ice buoys are their own classification, causing the number of moored and drifting buoys to be lower in R3.0.2.
b. Detailed contribution to R3.0.2: BUFR versus TAC
Figures 4a–c show the report numbers for the raw BUFR and R3.0.1 Total files as input to the ICOADS NRT R3.0.2. Figure 4a shows that there were little BUFR reports for ship observations before January 2016. After January 2016, the number of BUFR reports began to appear (2 × 105) and increased gradually to approximately 3 × 105 by the end of 2020 except for a sudden drop in the mid-2016. In contrast, the raw TAC ship reports maintained a steady level until near the end of 2019, decreased rapidly into early 2020, and then remained steady again.
Figure 4d shows the relative contributions from the TAC and BUFR reports for ship observations after duplicate elimination for the merged and processed ICOADS NRT R3.0.2. Recall that when there are duplicated reports between TAC and BUFR, BUFR reports were given a higher priority selection for R3.0.2 (section 3). This procedure is shown in Fig. 4d: When BUFR report numbers were low (most of 2015 and a dip in mid-2016), contributions from TAC reports were high, and vice versa. Combined with duplicate elimination, the TAC-BUFR merged reports remained quite steady with an overall slow increase during 2015–20.
Similar results were noted for the surface drifting buoys and moored buoys. For the surface drifting buoys, the TAC remained the primary distribution format through most of 2015, followed by a steady decrease (Fig. 4b). The raw BUFR surface drifting buoy reports showed a rapid increase in mid-2015; by mid-2016, the BUFR reports exceeded the TAC reports. Once BUFR was the predominant distribution format for drifting buoys, BUFR reports were then given a higher priority in R3.0.2. Figure 4e shows that the BUFR report contribution surpassed the TAC report contribution by mid-2015 and since late 2016 the TAC report contribution for the R3.0.2 Final file has been small.
The moored buoy reports (Figs. 4c,f) showed a persistent and dominant contribution from TAC reports until late 2019 when the number of BUFR reports exceeded TAC reports and remained higher thereafter. The TAC report contribution has a clear seasonal cycle; this was due to the seasonal summer mooring deployments in the U.S. Great Lakes and coastal (including lake) moored buoys being primarily serviced in the summer months. In contrast, the number of moored buoy observations from BUFR had been slowly increasing from 2015 to near the end of 2019, at which time there was a noticeable shift from TAC observations to a majority of BUFR observations.
c. ECV reports in R3.0.2 versus R3.0.1
The numbers of ECV observations are generally higher in the ICOADS NRT R3.0.2 Final file than in R3.0.1, as quantified below. The numbers of wind speed and direction (Fig. 5a), AT (Fig. 5c) and relative humidity (RH; Fig. 5e) observations were slightly higher in R3.0.2 than in R3.0.1. The number of SLP observations (Fig. 5b) was about 2.8 M in R3.0.1, but increased in R3.0.2 to 3.7 M by 2019 and has remained relatively steady since then. The number of SST observations (Fig. 5d) decreased in R3.0.1 from about 2.7 M in 2015 to 2.2 M in 2019, but increased in R3.0.2 from about 2.7 M in 2015 to 3.5 M since mid-2019. In addition, there were no salinity observations (Fig. 5f) in R3.0.1, but 0.05 M are now available in R3.0.2 from the BUFR stream.
d. Spatial coverage: R3.0.2 versus R3.0.1
Figure 6 shows the spatial coverage of the ECVs in May 2019, which exhibited an increase in SST and SLP coverage and a smaller increase for AT.
Figure 7 presents the difference between the number of months with observations (SST, AT, SLP, and wind) in R3.0.2 and R3.0.1 in 2021. The metric is expressed as the difference of the number of months with data relative to the 12 months (in %). For example, 50% means that R3.0.2 has six additional months with observations compared to R3.0.1. Both the SST and SLP show significant increases (>20%) in most of the areas, particularly in the South Pacific Ocean, Indian Ocean, the North Atlantic Ocean, and the Arctic Ocean. AT and wind speed have the same pattern but smaller changes with much less spatial coverage. This is because the observation number increase mainly comes from drifting buoys in the BUFR stream. Drifting buoys usually report SST and SLP but not AT and wind speed. Since AT and wind are mostly from the ship reports, their spatial patterns are similar to the ship report availability (shown in Fig. 10).
Figure 8 shows the percentage of ocean 2° boxes covered by ECVs. SST coverage increased about 20% (Fig. 8a), SLP coverage increased about 15% (Fig. 8b), and AT and wind coverage increased about 3% (Fig. 8c,d). The small dip in R3.0.2 in 2018 is likely due to the relaxed duplicate removal criteria, but more investigation is required.
In addition to the ECV coverage, Fig. 9 shows the overall increase in the number of observations in the R3.0.2 Final file (Fig. 9b) over R3.0.1 (Fig. 9a) for May 2019. The increase in spatial coverage is mostly attributed to the increase in surface drifting buoys as discussed previously. Studies indicate that spatial data coverage is critical to downstream higher-level SST analysis and other data products, such as in the widely used Optimum Interpolation Sea Surface Temperature (OISST) product from combined satellite and in situ observations as the increased coverage of in situ observations in R3.0.2 is critical to correct satellite biases (e.g., Huang et al. 2019, 2021). The increased observation numbers and spatial coverages also increase the quality of uncertainty qualifications in surface marine observations and many products with ICOADS as an input dataset, such as SST (Kent et al. 2019), air–sea fluxes (Berry and Kent 2011), the World Ocean Database (Boyer et al. 2018), nighttime marine air temperature (e.g., Kent et al. 2013; Hirahara et al. 2014; Kennedy et al. 2019; Huang et al 2017), and climate change assessments (e.g., Hausfather et al. 2017; IPCC 2021). The increase in sea level pressure is also critical to the quality of weather and climate forecast and reanalysis (e.g., Slivinski et al. 2019).
Figure 10 shows the relative difference between the number of months with observations (drifting buoys, moored buoys, and ships) in R3.0.2 and R3.0.1 in 2021. For drifting buoys, there is an obvious increase of 20% in nearly all the oceans, particularly in most of the midlatitude oceans and almost all the North Atlantic Ocean, the increase is over 50%. Spatial coverage of moored buoy is patchy but the data availability has increased.
It is worth mentioning that in a small region of the North Sea, there seems a 100% decrease in moored buoy coverage. This is a spurious signal due to incorrect platform type assignment in some TAC data and corrected in R3.0.2. For example, ID 6200114 was incorrectly assigned as moored buoy in TAC but in R3.0.2 it was correctly assigned as a fixed ocean platform.
The coverage increase of ship data is overall less than 20% but over 20% along some tracks (e.g., in the Atlantic Ocean between Europe and North America, in the South Pacific between Australia and New Zealand, and Antarctica).
5. Data access
ICOADS NRT R3.0.2 data include the following:
-
R3.0.2 Total files: Raw TAC + BUFR blended, duplicate status marked.
-
R3.0.2 Final files: Processed TAC + BUFR merged data with duplicate reports removed. Land mask checked. R3.0.2 Final files are also available in netCDF format.
-
Monthly Summary Groups (MSG) statistic files, as described below.
Both the standard and enhanced trimming level data are available on demand from the National Center for Atmospheric Research (NCAR) ICOADS RDA dataset ds548.0 (ICOADS 2017). Users can request data subsets and customize their own data QC trimming and filtering on ICOADS records.
The enhanced and standard trimming level data are used in turn to calculate the statistics for the monthly summaries on 2° × 2° and 1° × 1° grids. Ten statistical measures (such as the mean and standard deviation) are calculated for 22 observed and derived variables (Slutz et al. 1985). These variables and their statistics computed using the available data are shown in Tables 2 and 3. These noninterpolated summary data provide a quick look of the key environmental variables and their statistical values.
Variables in the ICOADS NRT R3.0.2 Monthly Summary Group (MSG). For more information, see the ICOADS R2.5 General Information about Statistics documentation at https://icoads.noaa.gov/e-doc/R2.5-stat_doc.pdf.
Statistics in MSG. Units refer to Table 2 if the units depend on a variable. For more information, see the ICOADS R2.5 General Information about Statistics documentation at https://icoads.noaa.gov/e-doc/R2.5-stat_doc.pdf.
ICOADS NRT R3.0.2 data access options have been updated on the ICOADS Data and Products page at https://icoads.noaa.gov/products.html, where several file format and data access options are available.
6. Summary and future work
By combining new BUFR data with existing TAC data, an improved ICOADS NRT product has been developed and released as ICOADS NRT R3.0.2. The number of reports in ICOADS NRT R3.0.2 increased by about 30% as compared to R3.0.1. The spatial ocean coverage for essential climate variables (ECVs) increased by about 20% on monthly 2° × 2° grids, and added a new variable for salinity. The new release will improve downstream products significantly, as demonstrated in OISST v2.1 (Huang et al. 2021). With enhanced ingest monitoring, monthly generated plots to assess the health of the observing system to identify any abnormal platform type distributions, and daily use/monitoring by products such as OISST, NCEI is better equipped to handle future NRT format changes. NCEI will also be able to more speedily include new platform types from the BUFR format as they become available.
In the future, NRT GTS data streams from other GTS data collection centers [such as the Navy FNMOC and European Centre for Medium-Range Weather Forecasts (ECMWF)] will be utilized as well. Although all are collected from the same WMO GTS transmission system, collections from different centers are slightly different; merging them will increase the dataset completeness, and it is a good practice to monitor the input data streams for dropped or missing data and respond appropriately to the data providers about such issues.
We will include other observing platforms such as Argo floats, and the NDBC’s Coastal Marine Automated Network (C-MAN) stations in NRT dataset at a future date. Inclusion of Argo floats will increase ocean coverage in the open oceans, outside of main shipping lanes. CMAN stations will enhance coastal data availability for both ocean and land data users.
The QC process can also be improved, by using new climatology data for the trimming, and including additional checks such as internal consistency check (e.g., track checks and spike checks), and mutual consistency check or buddy check (e.g., Lorenc and Hammon 1988). The merging and duplicate identification processes need to be modernized. In the current TAC and BUFR merge process, whole reports were either excluded or kept in the R3.0.2 Final files, resulting in the possible loss of any additional parameters contained in the excluded report. We will examine the possibility of blending near-duplicate reports in order to composite reports for improved quality and completeness of the data.
Acknowledgments.
We would like to thank NOAA NDBC, the DFO Canadian Global Data Assembly Centre (GDAC-CA) for Drifting Buoys, and the NOAA NWSTG for providing BUFR data. We appreciate the consistent NRT data support for ICOADS provided by Diane Stokes over decades. We would also like to especially thank the ICOADS international partners and users for their input and evaluation of the R3.0.2 data and the NCEI Graphics Team for help with graphics. We thank David Sallis and Xungang Yin for their scientific code reviews and help in coding diagnoses. We thank two NCEI internal reviews for improving this manuscript. Finally, we are grateful for the ICOADS user community that continues to provide their data requirements and feedback necessary to improve ICOADS. This study was supported by the NOAA Grant NA19NES4320002 [Cooperative Institute for Satellite Earth System Studies (CISESS)] at the University of Maryland/ESSIC. EK was supported by NERC Grant Reference NE/R015953/1.
Data availability statement.
ICOADS R3.0.2 data are available at https://www.ncei.noaa.gov/data/marine/nrt/, https://www.ncei.noaa.gov/data/marine/msg/, and https://rda.ucar.edu/datasets/ds548.0/.
APPENDIX A
ICOADS IMMA1 Variables and Their Keys in the BUFR Reports
Table A1 presents ICOADS IMMA1 variables and their keys in the BUFR Reports, where direct mappings were available.
ICOADS IMMA1 variables and their keys in the BUFR reports. Note: For more detailed information on these parameters, see the IMMA1 format documentation at https://icoads.noaa.gov/e-doc/imma/R3.0-imma1.pdf.
APPENDIX B
Selected Variables and Their Possible Flag Values
Table B1 shows the QC flags of the 14 selected weather and ocean variables.
Selected variables and their possible flag values. Modified from ICOADS release 1, supplement J. See appendix C for definitions of the QC flag values. For more information, see the ICOADS release 3.0 Quality Control and Related Processing document at https://icoads.noaa.gov/e-doc/R3.0-stat_trim.pdf.
APPENDIX C
QC Flag Meaning, Value, and Weight
Table C1 presents QC flag meaning, value, and weight. The sum of the weighted flags is quality code, which determines which report to retain among the duplicates.
QC flag meaning, value, and weight. Modified from ICOADS release 1, supplement J. For more information, see the ICOADS release 3.0 Quality Control and Related Processing document at https://icoads.noaa.gov/e-doc/R3.0-stat_trim.pdf.
REFERENCES
Banzon, V., T. M. Smith, T. M. Chin, C. Liu, W. Hankins, 2016: A long-term record of blended satellite and in situ sea-surface temperature for climate monitoring, modeling and environmental studies. Earth Syst. Sci. Data, 8, 165–176, https://doi.org/10.5194/essd-8-165-2016.
Berry, D. I., and E. C. Kent, 2011: Air–sea fluxes from ICOADS: The construction of a new gridded dataset with uncertainty estimates. Int. J. Climatol., 31, 987–1001, https://doi.org/10.1002/joc.2059.
Boyer, T. P., and Coauthors, 2018: World Ocean Database 2018. NOAA Atlas NESDIS Tech. Rep., 87 pp.
Cornes, R. C., E. C. Kent, D. I. Berry, and J. J. Kennedy, 2020: CLASSnmat: A global night marine air temperature data set, 1880–2019. Geosci. Data J., 7, 170–184, https://doi.org/10.1002/gdj3.100.
Freeman, E., and Coauthors, 2017: ICOADS release 3.0: A major update to the historical marine climate record. Int. J. Climatol., 37, 2211–2232, https://doi.org/10.1002/joc.4775.
Freeman, E., and Coauthors, 2019: The International Comprehensive Ocean-Atmosphere Data Set—Meeting users needs and future priorities. Front. Mar. Sci., 6, 435, https://doi.org/10.3389/fmars.2019.00435.
Hausfather, Z., K. Cowtan, D. C. Clarke, P. Jacobs, M. Richardson, and R. Rohde, 2017: Assessing recent warming using instrumentally homogeneous sea surface temperature records. Sci. Adv., 3, e1601207, https://doi.org/10.1126/sciadv.1601207.
Hersbach, H., and Coauthors, 2020: The ERA5 global reanalysis. Quart. J. Roy. Meteor. Soc., 146, 1999–2049, https://doi.org/10.1002/qj.3803.
Hirahara, S., M. Ishii, and Y. Fukuda, 2014: Centennial-scale sea surface temperature analysis and its uncertainty. J. Climate, 27, 57–75, https://doi.org/10.1175/JCLI-D-12-00837.1.
Huang, B., and Coauthors, 2015: Extended Reconstructed Sea Surface Temperature version 4 (ERSST.v4). Part I. Upgrades and intercomparisons. J. Climate, 28, 911–930, https://doi.org/10.1175/JCLI-D-14-00006.1.
Huang, B., and Coauthors, 2017: Extended Reconstructed Sea Surface Temperature, version 5 (ERSSTv5): Upgrades, validations, and intercomparisons. J. Climate, 30, 8179–8205, https://doi.org/10.1175/JCLI-D-16-0836.1.
Huang, B., C. Liu, G. Ren, H.-M. Zhang, and L. Zhang, 2019: The role of buoy and Argo observations in two SST analyses in the global and tropical Pacific oceans. J. Climate, 32, 2517–2535, https://doi.org/10.1175/JCLI-D-18-0368.1.
Huang, B., C. Liu, V. Banzon, E. Freeman, G. Graham, B. Hankins, T. Smith, and H.-M. Zhang, 2020: Improvements of the Daily Optimum Interpolation Sea Surface Temperature (DOISST) version 2.1. J. Climate, 34, 2923–2939, https://doi.org/10.1175/JCLI-D-20-0166.1.
Huang, B., C. Liu, E. Freeman, G. Graham, T. Smith, and H.-M. Zhang, 2021: Assessment and intercomparison of NOAA Daily Optimum Interpolation Sea Surface Temperature (DOISST) version 2.1. J. Climate, 34, 7421–7441, https://doi.org/10.1175/JCLI-D-21-0001.1.
ICOADS, 2017: International Comprehensive Ocean-Atmosphere Data Set (ICOADS) release 3, individual observations. NCAR CISL Research Data Archive, accessed 6 May 2021, https://doi.org/10.5065/D6ZS2TR3.
IPCC, 2021: Annex I: Observational products. Climate Change 2021: The Physical Science Basis, V. Masson-Delmotte et al., Eds., Cambridge University Press, 2061–2085.
Junod, R. A., and J. R. Christy, 2019: A new compilation of globally gridded night-time marine air temperatures: The UAHNMATv1 dataset. Int. J. Climatol., 40, 2609–2623, https://doi.org/10.1002/joc.6354.
Kennedy, J. J., N. A. Rayner, R. O. Smith, D. E. Parker, and M. Saunby, 2011: Reassessing biases and other uncertainties in sea surface temperature observations measured in situ since 1850: 2. Biases and homogenization. J. Geophys. Res., 116, D14104, https://doi.org/10.1029/2010JD015220.
Kennedy, J. J., N. A. Rayner, C. P. Atkinson, and E. E. Killick, 2019: An ensemble data set of sea‐surface temperature change from 1850: The Met Office Hadley Centre HadSST.4.0.0.0 data set. J. Geophys. Res. Atmos., 124, 7719–7763, https://doi.org/10.1029/2018JD029867.
Kent, E. C., N. A. Rayner, D. I. Berry, M. Saunby, B. I. Moat, J. J. Kennedy, and D. E. Parker, 2013: Global analysis of night marine air temperature and its uncertainty since 1880: The HadNMAT2 data set. J. Geophys. Res. Atmos., 118, 1281–1298, https://doi.org/10.1002/jgrd.50152.
Kent, E. C., and Coauthors, 2017: A call for new approaches to quantifying biases in observations of sea surface temperature. Bull. Amer. Meteor. Soc., 98, 1601–1616, https://doi.org/10.1175/BAMS-D-15-00251.1.
Kent, E. C., and Coauthors, 2019: Observing requirements for long-term climate records at the ocean surface. Front. Mar. Sci., 6, 441, https://doi.org/10.3389/fmars.2019.00441.
Lorenc, A. C., and O. Hammon, 1988: Objective quality control of observations using Bayesian methods: Theory, and a practical implementation. Quart. J. Roy. Meteor. Soc., 114, 515–543, https://doi.org/10.1002/qj.49711448012.
Morice, C. P., and Coauthors, 2021: An updated assessment of near-surface temperature change from 1850: The HadCRUT5 data set. J. Geophys. Res. Atmos., 126, e2019JD032361, https://doi.org/10.1029/2019JD032361.
Pelletier, Y., 2008: A primer on writing BUFR templates, version 1.4. Meteorological Service of Canada National Prediction Operations Division Doc., 25 pp., https://www.eumetnet.eu/wp-content/uploads/2017/04/OPERA_BUFR_template_primer_V1.4.pdf.
Rayner, N. A., D. E. Parker, E. B. Horton, C. K. Folland, L. V. Alexander, D. P. Rowell, E. C. Kent, and A. Kaplan, 2003: Global analyses of sea surface temperature, sea ice, and night marine air temperature since the late nineteenth century. J. Geophys. Res., 108, 4407, https://doi.org/10.1029/2002JD002670.
Reynolds, R. W., T. M. Smith, C. Liu, D. B. Chelton, K. S. Casey, and M. G. Schlax, 2007: Daily high-resolution-blended analyses for sea surface temperature. J. Climate, 20, 5473–5496, https://doi.org/10.1175/2007JCLI1824.1.
Slivinski, L. C., and Coauthors, 2019: Towards a more reliable historical reanalysis: Improvements for version 3 of the Twentieth Century Reanalysis system. Quart. J. Roy. Meteor. Soc., 145, 2876–2908, https://doi.org/10.1002/qj.3598.
Slutz, R. J., S. J. Lubker, J. D. Hiscox, S. D. Woodruff, R. L. Jenne, R. H. Joseph, P. M. Steurer, and J. D. Elms, 1985: Comprehensive Ocean-Atmosphere Data Set: Release 1. NOAA Environmental Research Laboratories Climate Research Program Doc., NTIS PB86-105723, 268 pp.
Smith, S. R., and Coauthors, 2016: The International Maritime Meteorological Archive (IMMA) format. Accessed 31 May 2016, http://icoads.noaa.gov/e-doc/imma/R3.0-imma1.pdf.
Smith, S. R., R. Richardson, J. P. Stow, M. A. Bourassa, and H. McMillan, 2021: A new technique for century-scale wind component indices. Front. Mar. Sci., 9, 661473, https://doi.org/10.3389/feart.2021.661473.
Vose, R. S., and Coauthors, 2012: NOAA’s Merged Land–Ocean Surface Temperature analysis. Bull. Amer. Meteor. Soc., 93, 1677–1685, https://doi.org/10.1175/BAMS-D-11-00241.1.
WMO, 2019: Manual on codes. World Meteorological Organization, accessed 1 May 2020, https://library.wmo.int/doc_num.php?explnum_id=11283.
Wolter, K., 1997: Trimming problems and remedies in COADS. J. Climate, 10, 1980–1997, https://doi.org/10.1175/1520-0442(1997)010<1980:TPARIC>2.0.CO;2.
Woodruff, S. D., R. J. Slutz, R. L. Jenne, and P. M. Steurer, 1987: A comprehensive ocean-atmosphere dataset. Bull. Amer. Meteor. Soc., 68, 1239–1250, https://doi.org/10.1175/1520-0477(1987)068<1239:ACOADS>2.0.CO;2.
Woodruff, S. D., S. J. Lubker, K. Wolter, S. J. Worley, and J. D. Elms, 1993: Comprehensive Ocean–Atmosphere Data Set (COADS) release 1a: 1980–92. Earth Syst. Monit., 4, 1–8.
Zhang, H.-M., and Coauthors, 2019: Updated temperature data give a sharper view of climate trends. Eos, 100, https://doi.org/10.1029/2019EO128229.