X

NV5 Geospatial Blog

Each month, NV5 Geospatial posts new blog content across a variety of categories. Browse our latest posts below to learn about important geospatial information or use the search bar to find a specific topic or author. Stay informed of the latest blog posts, events, and technologies by joining our email list!



Not All Supernovae Are Created Equal: Rethinking the Universe’s Measuring Tools

Not All Supernovae Are Created Equal: Rethinking the Universe’s Measuring Tools

6/3/2025

Rethinking the Reliability of Type 1a Supernovae   How do astronomers measure the universe? It all starts with distance. From gauging the size of a galaxy to calculating how fast the universe is expanding, measuring cosmic distances is essential to understanding everything in the sky. For nearby stars, astronomers use... Read More >

Using LLMs To Research Remote Sensing Software: Helpful, but Incomplete

Using LLMs To Research Remote Sensing Software: Helpful, but Incomplete

5/26/2025

Whether you’re new to remote sensing or a seasoned expert, there is no doubt that large language models (LLMs) like OpenAI’s ChatGPT or Google’s Gemini can be incredibly useful in many aspects of research. From exploring the electromagnetic spectrum to creating object detection models using the latest deep learning... Read More >

From Image to Insight: How GEOINT Automation Is Changing the Speed of Decision-Making

From Image to Insight: How GEOINT Automation Is Changing the Speed of Decision-Making

4/28/2025

When every second counts, the ability to process geospatial data rapidly and accurately isn’t just helpful, it’s critical. Geospatial Intelligence (GEOINT) has always played a pivotal role in defense, security, and disaster response. But in high-tempo operations, traditional workflows are no longer fast enough. Analysts are... Read More >

Thermal Infrared Echoes: Illuminating the Last Gasp of a Dying Star

Thermal Infrared Echoes: Illuminating the Last Gasp of a Dying Star

4/24/2025

This blog was written by Eli Dwek, Emeritus, NASA Goddard Space Flight Center, Greenbelt, MD and Research Fellow, Center for Astrophysics, Harvard & Smithsonian, Cambridge, MA. It is the fifth blog in a series showcasing our IDL® Fellows program which supports passionate retired IDL users who may need support to continue their work... Read More >

A New Era of Hyperspectral Imaging with ENVI® and Wyvern’s Open Data Program

A New Era of Hyperspectral Imaging with ENVI® and Wyvern’s Open Data Program

2/25/2025

This blog was written in collaboration with Adam O’Connor from Wyvern.   As hyperspectral imaging (HSI) continues to grow in importance, access to high-quality satellite data is key to unlocking new insights in environmental monitoring, agriculture, forestry, mining, security, energy infrastructure management, and more.... Read More >

1345678910Last
17714 Rate this article:
5.0

Big Data Science with Climate Analytics-as-a-Service

NASA Center for Climate Simulation Demonstrates On-demand Analytic Processing for Climate Change Research

Anonym

As one might easily imagine, the Big Data domain of climate science has been faced with unprecedented growth. At the NASA Center for Climate Simulation (NCCS), data scientists carefully curate the Modern-Era Retrospective Analysis for Research and Applications (MERRA) data collection, a synthesis of more than thirty years of observational data integrated with numerical models that is of increasing importance to climate change research.

 

NCCS has set up MERRA Analytic Services (MERRA/AS) to perform analyses using the MapReduce parallel computing approach running on Hadoop technology. In order to integrate the capabilities of the system for practical use, the Climate Model Data Services (CDS) API has been provided to support web service access for consumer applications, basic instructions from a command line interface, and advanced programmatic capabilities through python development. The Climate Analytics-as-a-Service (CAaaS) technology stack can be deployed on local enterprise hardware or on the cloud.

 Climate models project 21st century global temperatures.

credit: NASA's Scientific Visualization Studio and NASA Center for Climate Simulation

 

MERRA spans across 160 terabytes, so it makes perfect sense that the analytical services are backed by some serious computational horsepower. In fact, the Hadoop MapReduce operations are running on a computing cluster powered by 36 Dell R710 servers, each with twelve 3 terabyte hard drives and an internal OS disc. Everything is connected through a 36-port InfiniBand switch and a 48-port Gigabit Ethernet switch. Overall, the cluster is capable of around 11 teraflops.

 

CAaaS provides a climate research specialization of the business-process-as-a-service concept, something that promises to continue gaining popularity as the cloud computational universe evolves. It provides capabilities which themselves demonstrate the power that such an approach may yield: high-performance and adaptive data proximal analytics, scalable data management, software as a virtualized appliance, and a generalized API that exposes reusable data services. NCSS's hope is that it will serve as a useful resource for developing and evaluating the next generation of climate data analysis tools and capabilities. With a promised reduction in the time spent in the preparation of data used to compare different data models – a long sought goal of the climate research community – MERRA/AS and CAaaS are a great real world example of Hadoop and MapReduce being used to drive experimental development of high-performance analytical applications in the climate science domain.

Please login or register to post comments.