Global emission factor dataset for Scope 3 machine learning applications

用于范围 3 机器学习应用的全球排放因子数据集

阅读:1

Abstract

The accurate and transparent estimation of greenhouse gas emissions is essential for corporate sustainability reporting and machine learning applications. Existing emission-factor datasets have restrictive licenses, insufficient spatiotemporal granularity, or outdated information, limiting their reproducibility and utility across disciplines. We present ExioML, an open-source dataset derived from Exiobase 3.8.2. It integrates environmentally extended multi-regional input-output tables with a graphics processing unit (GPU)-accelerated computational toolkit, facilitating compatibility with and extensibility to other datasets. ExioML encompasses sector-level emission factor data for 49 regions and 28 years from 1995 to 2022, structured into two aggregation schemes: a product-by-product format covering 200 categories, and an industry-by-industry format covering 163 categories. To validate dataset usability and establish a reproducible baseline, we define a regression task for predicting sectoral greenhouse gas emissions. The task is evaluated using tree-based and neural-network-based models, with mean squared error as the evaluation metric. ExioML provides openly accessible emission-factor tables and a reproducible baseline intended to support reuse and benchmarking across sustainability and machine-learning studies.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。