A public/private partnership involving NASA and IBM Research has led to the release of NASA's first open-source geospatial artificial intelligence (AI) foundation model for Earth observation data. Built using NASA's Harmonized Landsat and Sentinel-2 (HLS) dataset, the release of the HLS Geospatial Foundation Model (HLS Geospatial FM) is a milestone in the application of AI for Earth science. The model has a wide range of potential applications, including tracking changes in land use, monitoring natural disasters, and predicting crop yields. The HLS Geospatial FM is available at Hugging Face, a public repository for open-source machine learning models.
NASA's Interagency Implementation and Advanced Concepts Team (IMPACT) played a major role in this work. Located at NASA's Marshall Space Flight Center in Huntsville, Alabama, IMPACT is a component of NASA's Earth Science Data Systems (ESDS) Program and is charged with expanding the use of NASA Earth observation data through innovation, partnerships, and technology, including the application of AI to these data.
"AI foundation models for Earth observations present enormous potential to address intricate scientific problems and expedite the broader deployment of AI across diverse applications,” says Dr. Rahul Ramachandran, IMPACT manager and a senior research scientist at Marshall. “We call on the Earth science and applications communities to evaluate this initial HLS foundation model for a variety of uses and share feedback on its merits and drawbacks."
Along with NASA and IBM Research, this collaborative effort included Clark University's Center for Geospatial Analytics, ESA (European Space Agency), USGS, and the U.S. Department of Energy's Oak Ridge National Laboratory. This work is part of NASA's Open-Source Science Initiative (OSSI), a commitment to building an inclusive, transparent, and collaborative open science community over the next decade. Development of the HLS Geospatial FM began in January 2023, and the FM was released in July 2023.
The Significance of Foundation Models
Foundation models (FMs) are types of AI models trained on a broad set of unlabeled data. They can be used for different tasks and can apply information about one situation to another. The goal of the NASA/IBM work is to provide an easier way for researchers to analyze and draw insights from large NASA datasets related to Earth processes.
"We believe that foundation models have the potential to change the way observational data are analyzed and help us to better understand our planet," says NASA Chief Science Data Officer Kevin Murphy. "And by open-sourcing such models and making them available to the world, we hope to multiply their impact."
AI FMs have the potential to play a pivotal role in understanding our planet's interconnected processes and the climate effects of ongoing natural and human-caused changes. FMs that are pretrained on Earth observation data can accelerate the analysis of tremendous amounts of data in two primary ways.
First, FMs do not need large training datasets, which can be laborious and resource-intensive to create. The ability to train FMs on much smaller datasets can save time and money. Second, FMs can reduce redundant efforts to build downstream applications, which use FM output to perform a specific task, such as tracking changes in land use or monitoring natural disasters.