Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

Cloud-based Jupyter Notebooks for Water Data Analysis

Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI ( for information on this resource.
Type: Resource
Storage: The size of this resource is 34.2 KB
Created: Dec 07, 2017 at 8:06 p.m.
Last updated: Jan 16, 2018 at 4:57 p.m.
Citation: See how to cite this resource
Content types: Single File Content 
Sharing Status: Public
Views: 2036
Downloads: 69
+1 Votes: 1 other +1 this
Comments: 2 comments


The development and adoption of technologies by the water science community to improve our ability to openly collaborate and share workflows will have a transformative impact on how we address the challenges associated with collaborative and reproducible scientific research. Jupyter notebooks offer one solution by providing an open-source platform for creating metadata-rich toolchains for modeling and data analysis applications. Adoption of this technology within the water sciences, coupled with publicly available datasets from agencies such as USGS, NASA, and EPA enables researchers to easily prototype and execute data intensive toolchains. Moreover, implementing this software stack in a cloud-based environment extends its native functionality to provide researchers a mechanism to build and execute toolchains that are too large or computationally demanding for typical desktop computers. Additionally, this cloud-based solution enables scientists to disseminate data processing routines alongside journal publications in an effort to support reproducibility. For example, these data collection and analysis toolchains can be shared, archived, and published using the HydroShare platform or downloaded and executed locally to reproduce scientific analysis. This work presents the design and implementation of a cloud-based Jupyter environment and its application for collecting, aggregating, and munging various datasets in a transparent, sharable, and self-documented manner. The goals of this work are to establish a free and open source platform for domain scientists to (1) conduct data intensive and computationally intensive collaborative research, (2) utilize high performance libraries, models, and routines within a pre-configured cloud environment, and (3) enable dissemination of research products. This presentation will discuss recent efforts towards achieving these goals, and describe the architectural design of the notebook server in an effort to support collaborative and reproducible science

This was presented as an EPoster at the 2017 American Geophysical Union and can be found at:

Subject Keywords



Start Date:
End Date:


Related Resources

This resource belongs to the following collections:
Title Owners Sharing Status My Permission
JupyterHub Example Notebooks Anthony Castronova  Public &  Shareable Open Access

How to Cite

Castronova, A., l. brazil, M. Seul (2018). Cloud-based Jupyter Notebooks for Water Data Analysis, HydroShare,

This resource is shared under the Creative Commons Attribution CC BY.


Richard Hooper 6 years, 6 months ago

Great paper Tony. Can you reply to this comment?

+1 Votes: Be the first one to 

Richard Hooper 6 years, 6 months ago

So a reply is possible. THis could be good.

+1 Votes: Be the first one to 

New Comment