NASA logo
Goddard Space Flight Center



Sun and Earth picture

NASA's Heliophysics Digital
Resource Library (HDRL):

Data and Services for the Heliophysics
System Observatory

NASA HDRL

Highlights

Find data, Browse data, Access data

See the Sun!!

Get Data:
Space Solar
OMNI Orbits

Register Datasets with "SPASE"

Analyze Data: SPEDAS, SolarSoft, Autoplot

Find Python tools: SunPy, SpacePy, Pysat, PySPEDAS ...

Cite Data with DOIs

Data: The Core of HP Research

Heliophysics (HP) studies the nature and dynamical interactions of the Sun, the heliosphere, and the plasma environments of the planets based on data from a fleet of spacecraft termed the "Heliophysics System Observatory" (HSO). This requires easy access to data and tools from a distributed set of active archives. The Heliophysics Digital Resource Library (HDRL -- formerly The Heliophysics Data Environment or HPDE) is the collective set of data, documentation, and tools resulting from and using the HSO. The HDRL maintains openly accessible data that are independently scientifically usable. The HDRL is expanding the HPDE to incude new services for solving larger ("big data") problems and for supporting open science research with software, research data, and data from a wide variety of platforms, large and small.

How It Works

A quick summary of the HDRL can be found on NASA's main Heliophycs Data page, which has links to important documents such as a Project Data Management Plan Template and the NASA Heliophysics Division Science Data Management Policy. The latter defines the HDRL as "the public face to access data and related documentation, tools, and services of NASA’s HSO and research efforts," and notes that its archives are the final location for all NASA HP scientific data and documentation. The discussion and links below provide a much more extensive view of what the HDRL encompasses. The HDRL is built from data systems driven by community needs and guidance and founded on community-based standards. Consistent with this approach, data providers and data users share responsibility for the quality and proper use of the data for research. Here's a 3 minute intro to the the HDRL (HPDE). Also, here is a report on what the HP community would like to see to improve the HDRL.

Data Discovery: Finding the Data You Need

How can I search for (and browse and access) HP datasets?

The Heliophysics Data Portal (HDP) provides access to a registry of HP data products that can be searched by time, measurement type, observed region, time resolution, spacecraft or observatory name, any text, and other means, in any order. It is made possible by the use of uniform metadata provided by the uniform "SPASE" terminology. A simple Help page will get you started. Many datasets can be accessed and plotted directly from the HDP, but in all cases the HDP provides the most direct access routes to the data.


Where does HP archive NASA (and other) data, and what else do the archives do?

NASA has a responsibility to keep data easily available long-term, and for this it cannot depend on mission archives after missions end. NASA HP now expects that missions will deliver data in standard formats (CDF, FITS, NetCDF) to the the Space Physics Data Facility (SPDF; space physics and ITM data) or the Solar Data Analysis Center (SDAC; solar data) as the data are produced. Working with the archives improves the data quality, accessibility, and documentation. The archives are also responsible for a number of tools including SolarSoft, CDAWeb, and SSCWeb, as well as the maintenance and upgrading of the CDF data format standard.


What other archives serve data of relevance to HP?

There are many other archives that serve data of relevance to HP, both from other agencies and from other nations. Some of the larger ones are listed here. In addition, many missions serve data directly; check the HDP for access routes to specific products.

  • NOAA's National Centers for Environmental Information (formerly the National Geophysical Data Center) (NGDC/NCEI) is a source for ground-based activity indices, GOES X-ray, magnetic field, and particle data, cosmic ray datasets, DMSP images and particle data, and other Heliophysics related data. Time series plots, data inventories, and data downloads are available, and a Web service provides easy direct access from applications such as IDL.

  • The ESAC Science Data Center (ESDC), located at the European Space Astronomy Center in Madrid, Spain, provides web browser and web service access to a large collection of datasets from Astronomy and Planetary Science in addition to HP missions such as Cluster, Ulysses, SOHO, and Proba-2.

  • JAXA's Data Archives and Transmission System (DARTS) is a rich source of Japanese solar and space physics data (among other things). The site provides Geotail, Hinode, Yohkoh, Akebono, and other datasets and a "Conjunction Event Finder" (CEF) that allows convenient data browsing, determination of satellite footpoints, and other useful services.

  • The French Centre de Données de Physique des Plasmas (CDPP) has large amounts of HP data from earlier (ARCAD, Interball, etc.) and current (Cluser, STEREO, Ulysses, etc.) missions. Search by keywords or quicklooks, and download selected data.

  • Madrigal is an upper atmospheric science database used by groups throughout the world. Madrigal is a robust, World Wide Web based system capable of managing and serving archival and real-time data, in a variety of formats, from a wide range of upper atmospheric science instruments. Data at each Madrigal site is locally controlled and can be updated at any time, but shared metadata between Madrigal sites allow searching of all Madrigal sites at once from any Madrigal site.

  • SuperMAG is a worldwide collaboration of organizations and national agencies that currently operate more than 300 ground based magnetometers. SuperMAG provides easy access to validated ground magnetic field perturbations in the same coordinate system, identical time resolution and with a common baseline removal approach.

Data Services: Access, Visualization, and Analysis

What services does the HDRL provide for Solar Physics?

Helioviewer (and JHelioviewer): Overlays, movies, and other info for solar images

A general visualization tool for registered solar images from many missions, including some ground-based. Available ata products include multi-wavelength SDO, SOHO, Hinode, and other images in multiple wavelengths, with easy overplotting and movie-making. Many videos are already archived on YouTube. The European version, JHelioviewer, runs on desktops/laptops and adds such things as potential-field modeled magnetic field lines. Go to Helioviewer or JHelioviewer to try them out. Both applications include links to the Heliophysics Events Knowledgebase (not supported by HDRL but by HP missions) to provide solar events and structures.


Virtual Solar Observatory: Web browser and application access to solar data

Web browser and API access to most solar physics data. Searches can be made by time range, mission, instrument, observables, nicknames (e.g., “H-alpha”) and spectral ranges. VSO is integrated into SolarSoft to provide API access to data via IDL. See an info link and a basic data search link.


SolarSoft: Access and analysis routines covering most of solar physics (mostly IDL)

A mostly IDL-based collection of routines that provide everything from basic data access and processing of level-zero data to general plotting tools, to advanced analysis tools tailored to the needs of particular solar missions. This is gradually being complemented by routines in the scientific Python ecosystem and developed by the PyHC (see above), but it still serves as the major workhorse for solar data analysis. See the main page and an installation link.


What services does the HDRL provide for Space Physics?

Space Physics Environment Data Analysis Software (SPEDAS): IDL access, plotting, and analysis (also in Python as pySPEDAS)

Roughly speaking, SolarSoft for non-solar physics. A set of IDL routines, along with a version that does not require IDL, for loading, plotting, analyzing, and integrating data from many ground and space-based observatories. The latter include all the CDAWeb accessible data, with enhancements for some analysis routines (e.g., visualization of some 3D distributions) and continual expansion and improvement. A Python version of SPEDAS (pySPEDAS) has been developed. A general SPEDAS webpage can be found here, and a overview publication is by Angelopoulos et al., 2018.


Autoplot: Multiformat access and plotting tool

A Java application that reads and displays/plots all formats used in Heliophysics plus many others. Autoplot can be used to plot data for a server or as a standalone Webstart application. It can be used to form a data access layer, and can make “png walks” of images of data plots to provide rapid data surveys. A variety of output formats are also possible, and it is compatible with the HAPI access method. Autoplot is a downloadable application.


HAPI: A general method for access to simple and complex time-ordered data

The Heliophysics Application Programming Interface is a generic API with a full and mature specification that provides uniform access to a wide range of HP data, including time series of scalars, vectors, spectrograms, and more complex matrix fields. It has a number of implementations, with servers including CDAWeb, iSWA at CCMC, LISRD at LASP, and a growing number of other places. Data access using a simple API from clients exists for IDL, Matlab, and Python. It is being built and maintained by an informal (but funded) group at a variety of institutions. Details including a specification of the protocol can be found here.


creader: Load binned, spike-edited, labeled CDAWeb data into IDL directly

The "creader" software is a set of routines to take advantage of CDAWeb's Web Service access. One line of IDL code issues a command to the CDAWeb server to bring in data from a particular dataset for a given time range, time resolution, and set of variables, and the variables are renamed to whatever is desired. The binning to a uniform resolution allows multiple datasets to be compared directly, no matter what the original time resolution was. Putting a creader saveset of routines into an accessible place in the IDL path makes the routines callable without a need for compilation. For more details, go to the "how to" page.


CDAWeb/CDAS Data Access and Display; Browsing and accessing the SPDF archive

A web browser interface (CDAWeb) provides access to most of the SPDF active archive of old and new NASA mission products; it provides a variety of browse plots, quality plots, binning, bad point editing, and ASCII and CDF downloads. Direct application data access to subsets (by variable and time) and supersets (across file boundaries) is provided by web services of both “SOAP” and “RESTful” varieties in what is called the Coordinated Data Analysis System CDAS. IDL and other libraries of routines provide various user capabilities.


OMNIWeb: Access and tools for the "OMNI" and other interplanetary data

The "OMNI" dataset is a collection of datasets at 1-min and hourly resolution that allow users to have uniform access to values of many variables at the nominal bow shock of the Earth, thus useful for both solar wind and magnetospheric studies. Many indices (KP, Dst, AE, etc) and other quantities are provided, along with tools for plotting, subsetting by various filters, and variable intercomparisons. Many of the data products are also accessible through CDAWeb, thus providing API access via applications such as IDL and Python.


CDF Maintenance and Upgrades

SPDF provides continual upgrades in conjunction with user needs to assure that the standard CDF data format continues to provide efficient and complete access to HP data. This effort includes maintaining and updating the “ISTP” guidelines for the metadata in CDF (and now NetCDF) files.


What general services does the HDRL provide?

Python in Heliophysics Community (PyHC)

There is an increasing use of Python for data access and analysis, particularly among younger researchers. The HDRL is unifying these efforts and has initiated funding of some projects. This will be a community-directed effort that builds on the models of other open-source communities. The SunPy effort is part of PyHC, along with a host of other projects.


SSCWeb & 4-D Orbit Viewer: Two and three dimensional visualization of orbits through time

The SSCWeb browser interface provides spacecraft orbit views in planar projections with options to plot multiple spacecraft orbits and to apply region and other query restrictions. A very large set of spacecraft are included. Also provided are 3-D space and time interactive views using "TIPSOD".


Data/model comparison activities; model and output registries

The HDRL is collaborating with the CCMC to develop SPASE-based descriptions of simulation code and its output with the goal of both making CCMC simulation and verification activities easier, and also to facilitate data-model comparisons. (See: https://ccmc.gsfc.nasa.gov.)


Support for various maintenance/upgrade tasks including the CHIANTI atomic database

Small levels of funding are being supplied to keep some projects up-to-date and available to the community, such as the CHIANTI database, DSCOVR high-resolution data, and various others as needed and possible.


SPASE Descriptions, Registration, and Tools

The SPASE Data Model needs to be continually updated, and new or revised product descriptions are needed frequently; the SPASE group provides these, partly as a contract-funded activity for continuity and community service, and partly as an open consortium that continues to refine and update the model. SPASE descriptions increasingly include Digital Object Identifiers (DOIs) for data reference and “product and parameter keys” to aid the use of APIs. (See: http://spase-group.org, and the publication by Roberts et al. (2018); https://doi.org/10.1029/2018SW002038.)


What Cloud-based tools are available for open science analysis of "big data"?

HelioCloud is an effort to create a next generation environment for doing heliophysics research based on a NASA instance of AWS CLoud web services. This project has the dual goals of increasing research team collaboration and unlocking the science which may be done with high end computing and big data. Together these goals will help support the need for Open Science in Heliophysics. Our project has a number of thrusts which we are using to pursue these goals:

  • The creation of a cloud-based, browser accessible research analysis environment with scalable compute resources.
  • Placing a large (~Pb) volume of heliophysics research data in the cloud where it may be easily analyized using cloud-based compute.
  • The development of a unified and sharable heliophysics research analysis environment which may be either downloaded or run in the cloud.
  • The development of best practices and user-facing software to support publication and consumption of Open Science projects and data.

Those interested in using these tools for a wide variety of purposes, but especially for science projects otherwise difficuot or impossible, should contact Brian Thomas

HP Dataset Registration

Why and how should HP data products be registered?

Central to the success of the Data Environment is a uniform set of terminology to describe products and their sources. This allows us to make registry of data products that is useful for search and discovery of relevant data. A number of groups in the US and elsewhere have worked on this problem. To foster the interoperability of the various partners in the HDRL, NASA HP has sponsored the SPASE (Space Physics Archive Search and Extract) collaborative, consisting of scientists and software designers from a number of US and international institutions, to develop the SPASE Data Model that will allow uniform descriptions of products and services. The current official version of the Data Model is available for use, and suggestions for improvement are always welcome.


How do I get help registering products using SPASE descriptions?

Since the SPASE effort grew up after the initiation of most of the current NASA missions, there was no mandate in their contracts or PDMPs for the provision of standardized metadata. Thus the NASA HDRL has formed a SPASE Metadata Working Team (SMWT) to work with missions to make the required XML files for describing data products. Much of this work for current and many past missions has been completed, but if you see any difficulties or need help with new SPASE product descriptions, please contact the SMWT via email to shing.fung@nasa.gov or brian.a.thomas@nasa.gov; more streamlined access is in the works.


Why and how would I use Digital Object Identifiers for data citation?

The recognition and documentation of data use in publications has become more formalized, with many journals adopting increasingly specific requirements for citation and referencing of datasets. Most people agree that these changes are beneficial to all concerned, including readers who want to reproduce research and data providers who want an easy way to keep track of and get credit for data use. To help NASA's Heliophysics (HP) division with adapting to this change, the SPASE group is offering a service that makes acquiring a "Digital Object Identifier" easy for data providers. DOIs have long been routinely used for assigning unique identifiers to journal articles or books, but they are now the de facto standard for registering datasets. What will be needed from data providers will be the author, publisher, and publication date. We will work with missions to provide starting points for this information using current registry information. Click here for more details.


I'm writing a paper/proposal that involves making a new dataset or Event List; what should I do with it?

NASA HP proposals now require Data Management Plans in most proposals that states what new datasets will be produced during the work. Such datasets need to be provided to NASA HP Archives (SPDF, SDAC, and, for simulation output, CCMC). Here is a Research Data Management Plan Template that includes a discussion of the issues involved to use as a guide.

Project Data Management Plans

What information should a mission plan provide about data products?

Mission planning includes the production of Project Data Management Plan that details how products are to be produced as well as what products are going to be provided to achieve the mission goals. This plan should be developed in conjunction with the HP Archives (SPDF or SDAC). See the Project Data Management Plan Template.

Historical Notes

What are the historical origins of the HDRL?

Many of the basic ideas about access to large, distributed data holdings were well-understood by those attempting to make a "Space Physics Data System" in the early 1980's. The technology has improved since, but these documents still are useful:


What Data Upgrade and Value Added Services work has HDRL undertaken?

Here are the projects that have been funded by competition in the Heliophysics Data Environment Enhancements (HDEE) offering in ROSES, given by category of award and the by the year of the ROSES call that is relevant for the task. Also included are a number of tasks that are not HDMC funded, but are relevant to the HDRL. Here also are the abstracts for the all the awards:



NASA HDRL

Connections

IHDEA:
International HPDE


ESA/ESDC:
European Science Data Center


JAXA/DARTS:
Data from Japan


AMDA/CDPP:
Tools and Data from France


NOAA-SWPC:
Space Weather


CCMC:
Models
& on Request!


Responsible NASA official: Brian A. Thomas
E-mail: brian.a.thomas@nasa.gov

Webmaster: Tami Kovalick
E-mail: tamara.j.kovalick@nasa.gov
Last revised 23 May 2023
Privacy Policy and Important Notices