Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

Cloud-based Jupyter Notebooks for Water Data Analysis


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 34.2 KB
Created: Dec 07, 2017 at 8:06 p.m.
Last updated: Jan 16, 2018 at 4:57 p.m.
Citation: See how to cite this resource
Content types: Single File Content 
Sharing Status: Public
Views: 2329
Downloads: 71
+1 Votes: 1 other +1 this
Comments: 2 comments

Abstract

The development and adoption of technologies by the water science community to improve our ability to openly collaborate and share workflows will have a transformative impact on how we address the challenges associated with collaborative and reproducible scientific research. Jupyter notebooks offer one solution by providing an open-source platform for creating metadata-rich toolchains for modeling and data analysis applications. Adoption of this technology within the water sciences, coupled with publicly available datasets from agencies such as USGS, NASA, and EPA enables researchers to easily prototype and execute data intensive toolchains. Moreover, implementing this software stack in a cloud-based environment extends its native functionality to provide researchers a mechanism to build and execute toolchains that are too large or computationally demanding for typical desktop computers. Additionally, this cloud-based solution enables scientists to disseminate data processing routines alongside journal publications in an effort to support reproducibility. For example, these data collection and analysis toolchains can be shared, archived, and published using the HydroShare platform or downloaded and executed locally to reproduce scientific analysis. This work presents the design and implementation of a cloud-based Jupyter environment and its application for collecting, aggregating, and munging various datasets in a transparent, sharable, and self-documented manner. The goals of this work are to establish a free and open source platform for domain scientists to (1) conduct data intensive and computationally intensive collaborative research, (2) utilize high performance libraries, models, and routines within a pre-configured cloud environment, and (3) enable dissemination of research products. This presentation will discuss recent efforts towards achieving these goals, and describe the architectural design of the notebook server in an effort to support collaborative and reproducible science

This was presented as an EPoster at the 2017 American Geophysical Union and can be found at:
https://agu2017fallmeeting-agu.ipostersessions.com/default.aspx?s=2B-C4-70-3C-B8-A0-0D-77-35-04-7C-F2-A4-1B-36-10

Subject Keywords

Coverage

Temporal

Start Date:
End Date:

Content

Related Resources

This resource belongs to the following collections:
Title Owners Sharing Status My Permission
JupyterHub Example Notebooks Anthony Castronova  Public &  Shareable Open Access

How to Cite

Castronova, A., l. brazil, M. Seul (2018). Cloud-based Jupyter Notebooks for Water Data Analysis, HydroShare, http://www.hydroshare.org/resource/96d5ca4012be43e282eb258f2ac1d525

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

Richard Hooper 6 years, 10 months ago

Great paper Tony. Can you reply to this comment?

Reply
+1 Votes: Be the first one to 
 this.

Richard Hooper 6 years, 10 months ago

So a reply is possible. THis could be good.

Reply
+1 Votes: Be the first one to 
 this.

New Comment

required