Monitoring of Freshwater at High Spatio-temporal Resolutions

The overarching goal of this project is to revolutionise our understanding of the fundamental principles that govern water regimes in streams and lakes worldwide. The project will capture the multi-dimensional aspects of flow regimes (river discharge) and model hydraulics using a wide range of high resolution geo-datasets integrated with gauging station data in a machine learning framework. The preliminary research below lays the foundation for this challenging project.

Project presentation

Global Monitoring of Fresh Water at High Spatial and Temporal Resolutions:

Assessing Stream and Lake Hydrological/Physical Features within a Machine Learning Framework

JPL-NASA: Earth Science Seminar Thursday, 29 August 2019

Presented by Giuseppe Amatulli, Yale School of Forestry and Environmental Studies, Yale Center for Research Computing, Yale University

Lakes and rivers process a significant quantity of fresh water, which is among the most vulnerable resources in nature. The physico-chemical characteristics of each watershed or stream are the result of complex interactions among several environmental variables. These variables regulate discharge regimes and stream profiles, such as rainfall, evapotranspiration, soil infiltration and retention, geomorphology, land use, and snow cover, among others. Current knowledge of stream flow trends in rural areas and developing countries is limited and fragmented, and small streams are often not represented. A full geo-analysis is needed to capture these stream features. Freshwater quantification at high spatial resolution is therefore essential to this aim, and also the first step towards a comprehensive assessment of the global water cycle. The project captures the multi-dimensional aspects of the flow regimes (monthly discharge) and model hydraulics worldwide, using a broad range of 90m geo-datasets and gauging station data in a machine learning framework. This work will be the most comprehensive hydrological model based on a high dimensional data-driven approach able to assess stream network location, width, depth, and water flow. The overarching goal of this research is to revolutionize our understanding of the fundamental principles that govern freshwater discharge regimes worldwide.

Methodology (update on 1st of July 2020)

  • Task 1) Dataset archive
    • Task 1.a) Observations: gauge station data: discharge + geochemistry (GSIM + geochemistry ongoing)
    • Task 1.b) Predictors: remote sensing and geographic data (completed)
    • Task 1.c) Pre-processing predictor datasets (completed)
  • Task 2) Extraction of hydrographic features from DEM
    • Task 2.a) Stream network and basins delineation (completed, see demo)
    • Task 2.b) Compute other stream morphology properties (ongoing)
  • Task 3) Freshwater-specific environmental predictors
    • Task 3.a) Compute stream variables (TERRACLIMATE completed see demo, the rest ongoing)
    • Task 3.b) Observation snapping and stream-predictors extraction at gauge location (ongoing)
  • Task 4) Geo-data-fusion within a machine learning framework
    • Task 4.a) Training and testing splitting procedure (testing phase with TERRACLIMATE)
    • Task 4.b) Machine learning algorithm selection (testing phase with Random Forest)
    • Task 4.c) Model evaluation and error metrics (acquiring scientific knowledge)
    • Task 4.d) Predict response variables and uncertainty in ungauged locations (under development)
    • Task 4.e) Variables importance at global and local level (under development)
  • Task 5) Web-GIS development (completed prototype, see demo)

Global Monitoring of Fresh Water at High Spatial and Temporal Resolutions

Water quality

Basins delineation and stream networks extraction at global level

Geomorphometric/Topographic/Environmental variables at global level

High resolution Digital Elevation Models: extracting geomorphometric features  and hydrography.


Project team

The project’s research team brings together several years of experience in large geographic data and machine learning model implementation. The team members’ outstanding credentials in the environmental and computer sciences, statistical modelling and hydrology offer the requisite skills to achieve project objectives.

PI:  Dr. Giuseppe Amatulli
Research Scientist at Yale University (USA). Expertise: Geocomputation, data analysis, GIS, remote sensing , big geo-dataset processing, hydrology.

Co-PI: Dr. George H. Allen
Assistant Professor at Texas A&M University (USA). Expertise: hydrology, remote sensing, GIS, data analysis, water resources.

Scientists: 2 Post-Docs with expertise hydrology, machine learning and geocomputation (to be selected)

Advisor: Prof. Peter Raymond
Professor at Yale University (USA). Expertise: (USA). Expertise: hydro-chemistry, water resources, freshwater ecology.

Advisor: Dr. Longzhu Shen
Research Scientist at University of Cambridge (UK). Expertise: modeller, data analysis, big-data processing, machine learning, chemistry.

Advisor: Dr. Sami Domisch
Research Scientist at Leibniz-Institute of Freshwater Ecology and Inland Fisheries (Germany). Expertise: freshwater biodiversity, biogeography, modelling and data analysis, massive data processing, machine learning

Collaborator: Mr. Richard Barnes
PhD Student at University of Berkeley. Expertise: computer scientist, developer, data analysis

Collaborator: Mr. Paolo Corti
Geospatial Engineering consultant at Spatial Ecology. Expertise: computer scientist, web-GIS developer

Collaborator: Mr. Tushar Sethi
Head of Operations, Spatial Ecology (UK). Expertise: Water and waste management technologies; translating scientific research into applied solutions.