From the Oceans to the Cloud: Opportunities and Challenges for Data, Models, Computation and Workflows

ocean-obs

From the Oceans to the Cloud: Opportunities and Challenges for Data, Models, Computation and Workflows

As part of the publication series OceanObs’19: An Ocean of Opportunity, the article From the Oceans to the Cloud: Opportunities and Challenges for Data, Models, Computation and Workflows was recently published in Frontiers in Marine Science. PacIOOS Data Management Lead, Jim Potemra, contributed to the paper.

Advances in ocean observations and models mean increasing flows of data. Integrating observations between disciplines over spatial scales from regional to global presents challenges. Running ocean models and managing the results is computationally demanding. The rise of cloud computing presents an opportunity to rethink traditional approaches. This includes developing shared data processing workflows utilizing common, adaptable software to handle data ingest and storage, and an associated framework to manage and execute downstream modeling. Working in the cloud presents challenges: migration of legacy technologies and processes, cloud-to-cloud interoperability, and the translation of legislative and bureaucratic requirements for “on-premises” systems to the cloud.

To respond to the scientific and societal needs of a fit-for-purpose ocean observing system, and to maximize the benefits of more integrated observing, research on utilizing cloud infrastructures for sharing data and models is underway. Cloud platforms and the services/APIs they provide offer new ways for scientists to observe and predict the ocean’s state. High-performance mass storage of observational data, coupled with on-demand computing to run model simulations in close proximity to the data, tools to manage workflows, and a framework to share and collaborate, enables a more flexible and adaptable observation and prediction computing architecture. Model outputs are stored in the cloud and researchers either download subsets for their interest/area or feed them into their own simulations without leaving the cloud. Expanded storage and computing capabilities make it easier to create, analyze, and distribute products derived from long-term datasets.

The authors provide an introduction to cloud computing, describe current uses of the cloud for management and analysis of observational data and model results, and describe workflows for running models and streaming observational data. Topics that must be considered are discussed when moving to the cloud: costs, security, and organizational limitations on cloud use. Future uses of the cloud via computational sandboxes and the practicalities and considerations of using the cloud to archive data are explored. The authors are also considering the ways in which the human elements of ocean observations are changing – the rise of a generation of researchers whose observations are likely to be made remotely rather than hands on – and how their expectations and needs drive research towards the cloud. In conclusion, visions of a future where cloud computing is ubiquitous are discussed.

Recent Posts

Did you know?

PacIOOS is the first regional association that was certified as a Regional Information Coordination Entity (RICE) by the U.S. Integrated Ocean Observing System (IOOS). Certification provides NOAA and its interagency partners a means to verify that a regional association’s organizational and operational practices, including data management, meet recognized and established standards set by NOAA.