Skip to content

Latest commit

 

History

History
39 lines (33 loc) · 4.89 KB

File metadata and controls

39 lines (33 loc) · 4.89 KB
:alt: natcapuk-logo
:class: bg-primary mb-1
:align: right
:width: 400px
:alt: abstract-environmental-data-science
:class: bg-primary mb-1 sd-rounded-3
:align: center

Environmental Data Science Toolbox

This is a prototype version of the National Capability UK (NC-UK) Environmental Data Science Toolbox, hosted by the UK Centre for Ecology & Hydrology (UKCEH). The aim is to apply FAIR principles (Findable, Accessible, Interoperable, and Reusable) to a collection of data science methods that are generalizable across different environmental applications, with a focus on integrative modelling. The hope being that this will encourage cross-disciplinary use of methods, enhancing national environmental research.

If you're interested in contributing to this project it would be great to hear from you and you can find details of how to do so via the CONTRIBUTING.md page in the root of the repository. 🌞

The current recommended workflow for interactively engaging with the code in the methodology notebooks is to clone the {bdg-info}Notebook Repository linked at the top of each notebook to get access to the relevant files and then to create a virtual environment and test running different sections of the code in your favourite IDE, such as VS Code.

Methods Key Concepts Key Datasets
Bias Correction of Climate Models {bdg-warning-line}Ongoing Development Gaussian Processes, Bayesian Hierarchical Modelling Climate Model Output, In-situ Weather Station Measurements
Calculating Risk to Terrestrial Carbon Pool {bdg-warning-line}Ongoing Development Data Access, Data Integration MODIS Land Cover and Net Primary Production Products, European Space Agency (ESA) Climate Change Initiative (CCI) Soil Moisture Dataset, Global Standardized Precipitation-Evapotranspiration Index (SPEI) Dataset.
Understanding the error of Multispecies Biodiversity Indicators {bdg-warning-line}Ongoing Development Bias, Uncertainty Simulated Dataset (Multispecies Occupancy).
Joint Species Distribution Models with jsdmstan Stochastic Partial Differential Equations, Integrated Nested Laplace Approximations, Simulated Dataset (Multispecies Populations).
Non-target Analysis of Environmental Mass Spectrometry Data {bdg-warning-line}Ongoing Development Cheminformatics, Data Access, Non-target Analysis, Large Language Models, Principal Component Analysis, UpSet Analysis Processed LC-MS and GC-MS Data hosted on the NORMAN Digital Sample Freezing Platform (DSFP).
RO-Crate Tutorial {bdg-warning-line}Ongoing Development Data Access, Metadata, Data Integrity COSMOS Dataset from EIDC
EEX-placebased-exposure {bdg-warning-line}Ongoing Development Place based data exploration, Data Integration, Data Visualisation, Data analysis Air quality data from Defra and Water Quality data from Environment Agency
Multivariate Modelling of Censored Chemicals using jsdmstan Multivariate, Censored Data, Joint Species Distribution Modelling, Bayesian Inference Environment Agency PFAS River Monitoring & PAH Estuary Datasets, Simulated lognormal Censored Dataset
Extracting evidence-linked abundance drivers from species-account text LLM Assisted Knowledge Extraction, Evidence-linked Causal Relationships, Structured JSON Extraction, Causal DAG generation Plant Atlas-style Species Account Text, Synthetic Biodiversity Species-account Examples, Extracted Ecological Driver Relationship Table