A key BEDI-wide strategy to enhance data usability is the organization of data collections into twelve Societal Benefit Areas, or SBA. The BEDI SBA align with nine environmental fields adopted internationally by the Global Earth Observation System of Systems (GEOSS) project conducted by the Group on Earth Observations (GEO), of which the U.S. is one of more than 100 member nations. The BEDI SBA are overarching environmental fields of interest (such as disasters, energy, climate, and agriculture) that organize Earth science data into discrete fields where they can be more easily accessed and discovered by the global user community. EOSDIS DAACs are mapping individual datasets in their collections into these SBA, including ensuring that dataset imagery are available through GIBS, when applicable.
Just as EOSDIS will continue to increase the number of images available through GIBS, data users also can expect improvements to clients that use GIBS imagery, like Worldview. Recent Worldview enhancements based on SBA include the ability to search for imagery related to specific hazards and as well as by science disciplines.
Data usability also is improved through systems designed to deliver data rapidly and with minimal processing for use in managing time-critical events, such as wildfires and ice floe assessments. One such service is the EOSDIS Land, Atmosphere Near real-time Capability for EOS (LANCE), which provides imagery of Earth observations generally within three hours of an observation. While not intended for scientific research, LANCE products help further the BEDI objectives of making Earth science data more easily usable by a broader base of user communities, such as resource managers, policy analysts, and local governments.
NASA BEDI Specifics: Enhancing EOSDIS Data Discoverability
A critical element to accomplishing BEDI goals and objectives is ensuring that the metadata associated with federal civil Earth observation data are complete and consistent. Improving EOSDIS data discovery begins with a thorough review of the metadata associated with EOSDIS data and datasets to verify that they are complete and meet international metadata standards.
Metadata are data about data, and include (but are not limited to) data attributes such as quality, lineage, and acquisition parameters. Metadata are used in all aspects of NASA’s Earth science data lifecycle, from initial measurements to the search and discovery of processed data. Earth observing missions use metadata in science data products when describing information such as the instrument/sensor, operational plan, or geographic region sampled. DAACs use metadata for preservation, access, and distribution of data and data products.
As the EOSDIS data collection grew over the years, this led to EOSDIS metadata based on multiple and disparate systems, each requiring different formats and different mechanisms for submitting and updating data entries. This not only reduced the value of the metadata, but led to users having difficulty discovering relevant data and datasets. To correct this problem, EOSDIS created the Common Metadata Repository (CMR).
The CMR is a single, shared, scalable metadata repository for all NASA Earth science data that merges all existing capabilities and metadata from existing NASA Earth science metadata systems, such as the Global Change Master Directory (GCMD) and the EOSDIS’ Common Metadata Repository (CMR). In addition, the CMR serves as the definitive management system for EOSDIS metadata, and includes metadata from EOSDIS data collections as well as from Earth science data collections outside EOSDIS (such as the GCMD). CMR metadata also are formatted to meet the International Standards Organization (ISO) 19100 series of standards, which applies to geophysical metadata (such as ISO 19115 and 19139).
Assembling metadata from Earth observing data collections into a single repository based on international metadata standards requires a review of DAAC dataset metadata, including metadata that are part of ECHO and GCMD datasets. This review includes the use of tools and techniques to:
- Compare metadata recommendations and dialects
- Identify the structure of metadata collections
- Compare the structure of metadata collections
- Evaluate and measure metadata completeness with respect to recommendations
- Evaluate and measure metadata completeness with respect to specific organization goals
As of the end of August 2016, 95.8% of ECHO and GCMD BEDI metadata have been evaluated for completeness and accuracy; this process is still ongoing for CMR datasets and no metrics are currently available. Ensuring that EOSDIS metadata are complete, accurate, and adhere to international standards vastly improves the discovery, access, and use of Earth science data across organizations and, through this, significantly enhances data discovery.
The CMR also is the foundation for EOSDIS Earthdata Search. Earthdata Search provides access to EOSDIS services for data discovery, filtering, and visualization, and uses the CMR to conduct sub-second searches through the entire EOSDIS metadata catalog. Once BEDI is fully implemented, an EOSDIS data user will be able to use Earthdata Search to discover data across multiple agencies.
Along with ensuring that EOSDIS metadata are complete, easily searchable, and based on international standards, data discoverability is further enhanced by registering appropriate DAAC data with a digital object identifier (DOI). A DOI is a unique sequence of numbers and letters that identify an object, such as a dataset or journal article. DOIs are assigned and regulated by the International DOI Foundation (IDF) and based on international standards (ISO 26324, Digital Object Identifier System). According to the IDF, approximately 130 million DOIs have been assigned worldwide as of June 2016.