Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

Supporting Materials for: Advancing Open and Reproducible Water Data Science by Integrating Data Analytics with an Online Data Repository


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 1.6 MB
Created: Oct 11, 2024 at 7:04 p.m.
Last updated: Nov 15, 2024 at 4:53 p.m.
Citation: See how to cite this resource
Sharing Status: Public
Views: 108
Downloads: 40
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

This HydroShare resource was created as a demonstration of how a reproducible data science workflow can be created and shared using HydroShare. The hsclient Python Client package for HydroShare is used to show how the content files for the analysis can be managed and shared automatically in HydroShare. The content files include a Jupyter notebook that demonstrates a simple regression analysis to develop a model of annual maximum discharge in the Logan River in northern Utah, USA from annual maximum snow water equivalent data from a snowpack telemetry (SNOTEL) monitoring site located in the watershed. Streamflow data are retrieved from the United States Geological Survey (USGS) National Water Information System using the dataretrieval package. Snow water equivalent data are retrieved from the United States Department of Agriculture Natural Resources Conservation Service (NRCS) SNOTEL system. An additional notebook demonstrates how to use hsclient to retrieve data from HydroShare, load it into a performance data object, and then use the data for visualization and analysis.

Subject Keywords

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Place/Area Name:
Logan River Gage, Utah
Longitude
-111.7827°
Latitude
41.7433°

Content

readme.md

Resource Content

This HydroShare resource contains supporting materials for a manucript titled "Advancing Open and Reproducible Water Data Science by Integrating Data Analytics with an Online Data Repository" submitted for publication to the Environmental Modelling & Software journal. It contains Jupyter notebooks and data files required to reproduce the analyses presented in the paper.

Preparing to Execute the Notebooks in this Resource

Before executing the notebooks contained in this resource using the HydroShare linked JupyterHub environments, you will first need a HydroShare user account. If you do not have one already, visit https://www.hydroshare.org to sign up and create an account.

For gaining access to HydroShare's linked JupyterHub servers, visit HydroShare's help documentation for apps and click on the links to the following in the navigation on that page. These links will take you to documentation on how to gain access to the JupyterHub servers.

  • Cyber-GIS Jupyter for water
  • CUAHSI JupyterHub
  • CIROH JupyterHub

You can also download this HydroShare resource and execute the provided notebooks in a Python environment on your local machine.

Executing the Notebooks using JupyterHub

To execute the interactive notebooks in this resource using the CUAHSI JupyterHub:

  1. At the top of this page in HydroShare, click the "Open with" button and select "CUAHSI JupyterHub" .
  2. When prompted, select the first server option in the list of server options - "Python v3.9.7 - JupyterLab Interface"
  3. Scroll to the bottom of the server options and click the "Start" button
  4. Wait for your server to start up and then double click on the notebook you want to run in the file browser panel on the left.

The instructions are similar for the other two JupyterHub environments.

Files Included in this Resource

  • HydroShare_SnowtoFlow_Notebook.ipynb: Juputer notebook with the snow-to-flow analysis described in the paper.
  • discharge.csv: Original USGS NWIS streamflow data for the gage used in the snow-to-flow analysis.
  • snotel.csv: Original snow water equivalent data for the SNOTEL site used in the snow-to-flow analysis.
  • HydroShare_hsclient_Notebook1.ipynb: Jupyter notebook showing how to use hsclient to create a HydroShare resource, create and edit metadata, and add content files.
  • HydroShare_hsclient_Notebook2.ipynb: Jupyter notebook with example code for accessing data from an existing HydroShare resource using hsclient, loading it into a performant data object, and then performing analysis.
  • readme.md: This readme file

Additional Example Notebooks

For additional examples of how to use the hsclient Python package for HydroShare, visit the HydroShare resource at https://www.hydroshare.org/resource/7561aa12fd824ebb8edbee05af19b910/ or the GitHub repository for the hsclient package at https://github.com/hydroshare/hsclient.

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
National Science Foundation Collaborative Research: Elements: Advancing Data Science and Analytics for Water (DSAW) OAC 1931297

How to Cite

Horsburgh, J. S. (2024). Supporting Materials for: Advancing Open and Reproducible Water Data Science by Integrating Data Analytics with an Online Data Repository, HydroShare, http://www.hydroshare.org/resource/7440c7feae1f428d91c1e510d23d3e54

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required