Checking for non-preferred file/folder path names (may take a long time depending on the number of files/folders) ...

Parsimonious and Transferrable Parameterization of Reservoir Operations: A Modular Approach for Large-Scale Modeling


Authors:
Owners: This resource does not have an owner who is an active HydroShare user. Contact CUAHSI (help@cuahsi.org) for information on this resource.
Type: Resource
Storage: The size of this resource is 231.6 MB
Created: Apr 15, 2025 at 1:24 p.m.
Last updated: Apr 15, 2025 at 1:36 p.m.
Citation: See how to cite this resource
Content types: CSV Content 
Sharing Status: Public
Views: 70
Downloads: 0
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

This HydroShare Resorce provides the data, code, and results for the manuscript under review by JAMES. The abstract of the manuscript is as follows: Accurately representing daily reservoir operations in large-scale hydrological and water resource modeling remains challenging due to both the complex and vague nature of real-world operations and very limited availability of operation records for many reservoirs worldwide. To address this gap, this study introduces MODROM (MOdular Data-driven Reservoir Operation Model), a parsimonious reservoir parameterization scheme that conceptualizes reservoir operations through simple operation modules and their seasonal transitions. These operation modules are designed to be simple and parsimonious for easier generalizing from data-rich to data-scarce reservoirs. MODROM is calibrated using high-quality long-term operation records from more than 400 data-rich reservoirs across the Contiguous United States (CONUS), and a Random Forest model is developed to provide calibrated parameters for data-scarce reservoirs based on a suite of static reservoir characteristics. Results demonstrate MODROM’s strong and robust performance when calibrating using all available data for each reservoir, though the performance generally declines for reservoirs with larger regulation capacity. The generalization performance is strong under favorable sampling conditions but is affected by sampling uncertainty due to the limited reservoir dataset. Benchmarking against existing models shows that MODROM offers distinct advantages in generalizing parameters to data-scarce reservoirs using readily available static reservoir characteristics, with the potential for global-scale application.

Subject Keywords

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Place/Area Name:
CONUS
North Latitude
49.0000°
East Longitude
-125.0000°
South Latitude
24.5000°
West Longitude
-67.0000°

Content

readme.md

Introduction

This HydroShare Resource provides the codes, processed data, and results for the following study under review at JAMES: "Parsimonious and Transferrable Parameterization of Reservoir Operations: A Modular Approach for Large-Scale Modeling" by Donghui Li and Gabriele Villarini, in which we developed a MOdular Data-driven Reservoir Operation Model (MODROM) that can be implemented for large-scale hydrological and water resource modeling.

The files are organized as below: - readme.md: this markdown file. - code: the folder of codes for raw data retrieval, process, and analysis. - data: the folder of processed data. - results: the folder of results.

code

  • benchmark.ipynb: this notebook contains the scripts to implement the benchmarking reservoir operation models compared in the study.
  • fit_modrom.ipynb: this notebook contains the scripts to calibrate our MODROM with all available operation data.
  • parameter_generalization.ipynb: this notebook contains the scripts to train the random forest model to transfer calibrated MODORM parameters from data-rich reservoirs to data-scarce reservoirs using static reservoir characteristics.
  • gdrom_rules.py: this python script contains the "if-else" operation rules for 452 large reservoirs across the CONUS, which is used to run the GDROM, one of the benchmarking models.
  • istarf.R: this R script contains the implementation of ISTARF, one of the benchmarking models.

data

  • daily_records/: the folder of daily operation data.
  • istarf_simulation/: the folder of ISTARF simulation, including both calibration and generalization results.
  • classification_data.csv: the file of features and targets used to train the random forest model that determines which module to apply.
  • GRAND_ID through USE_NAVI are obtained from GRanD, representing different reservoir characteristics
  • ppt: 30-year normal precipitation obtained from PRISM.
  • mode: module ID at the given month for the reservoir.
  • sin_month and cos_month: harmonic function of month.
  • regression_data_mode2.csv: the file of features and targets used to train the random forest model that determines the parameter value for each applied module 2 (note that module 1 parameters are implicitly contained in this as linear regression can collapse to constant value).
  • regression_data_mode3.csv: the file of features and targets used to train the random forest model that determines the parameter value for each applied module 3.
  • reservoir_metadata.csv: the file of reservoir metadata.
  • reservoir_norm_info.json: the file of normalization information (maximum value of inflow, storage, and outflow, in acft/day) for each reservoir.

results

  • calibration_metrics.csv: the calibration KGE for each reservoir.
  • generalization_metrics.csv: the generalization KGE for each reservoir, including the statistics out of the 100 independent random train-test splits.
  • bootstrap_metrics.csv: the KGE from the boostrap validation.
  • calibrated_parameters.json: the calibrated parameters for each reservoir.
  • benchmark_metrics.csv: the benchmark KGE for each reservoir across benchmarking reservoir operation models.
  • final_clf.pkl: the pickle file of trained classification random forest model that is used to determine which module to apply.
  • final_reg_mode2.pkl: the pickle file of trained regression random forest model that is used to determine parameter values for module 2.
  • final_reg_mode3.pkl: the pickle file of trained regression random forest model that is used to determine parameter values for module 3.

How to Cite

Li, D. (2025). Parsimonious and Transferrable Parameterization of Reservoir Operations: A Modular Approach for Large-Scale Modeling, HydroShare, http://www.hydroshare.org/resource/6c88031dc73c4f15bf3747b0a03becbc

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required