Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content
This is a Non-Federal dataset covered by different Terms of Use than Data.gov.

Annual Mean PM2.5 Components Trace Elements (TEs) 50m Urban and 1km Non-Urban Area Grids for Contiguous U.S., 2000-2019, v1

Metadata Updated: February 21, 2025

The Annual Mean PM2.5 Components Trace Elements (TEs) 50m Urban and 1km Non-Urban Area Grids for Contiguous U.S., 2000-2019, v1 data set contains annual predictions of trace elements concentrations at a hyper resolution (50m x 50m grid cells) in urban areas and a high resolution (1km x 1km grid cells) in non-urban areas, for the years 2000 to 2019. Particulate matter with an aerodynamic diameter of less than 2.5 �m (PM2.5) is a human silent killer of millions worldwide, and contains many trace elements (TEs). Understanding the relative toxicity is largely limited by the lack of data. In this work, ensembles of machine learning models were used to generate approximately 163 billion predictions estimating annual mean PM2.5 TEs, namely Bromine (Br), Calcium (Ca), Copper (Cu), Iron (Fe), Potassium (K), Nickel (Ni), Lead (Pb), Silicon (Si), Vanadium (V), and Zinc (Zn). The monitored data from approximately 600 locations were integrated with more than 160 predictors, such as time and location, satellite observations, composite predictors, meteorological covariates, and many novel land use variables using several machine learning algorithms and ensemble methods. Multiple machine-learning models were developed covering urban areas and non-urban areas. Their predictions were then ensembled using either a Generalized Additive Model (GAM) Ensemble Geographically-Weighted-Averaging (GAM-ENWA), or Super-Learners. The overall best model R-squared values for the test sets ranged from 0.79 for Copper to 0.88 for Zinc in non-urban areas. In urban areas, the R-squared model values ranged from 0.80 for Copper to 0.88 for Zinc. The Coordinate Reference System (CRS) used in the predictions is the World Geodetic System 1984 (WGS84) and the Units for the PM2.5 Components TEs are ng/m^3. The data are provided in RDS tabular format, a file format native to the R programming language, but can also be opened by other languages such as Python.

Access & Use Information

Public: This dataset is intended for public access and use. Non-Federal: This dataset is covered by different Terms of Use than Data.gov. License: No license information was provided.

Downloads & Resources

Dates

Metadata Created Date February 21, 2025
Metadata Updated Date February 21, 2025

Metadata Source

Harvested from nasa test json

Additional Metadata

Resource Type Dataset
Metadata Created Date February 21, 2025
Metadata Updated Date February 21, 2025
Publisher SEDAC
Maintainer
Identifier C2673738199-SEDAC
Data First Published 2023-04-28
Language en-US
Data Last Modified 2025-02-19
Category AQDH, geospatial
Public Access Level public
Bureau Code 026:00
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id 5dcf84f7-86ff-4697-835a-78a8f6d3ef4e
Harvest Source Id a73e0c30-4684-40ef-908e-d22e9e9e5f86
Harvest Source Title nasa test json
Homepage URL https://doi.org/10.7927/10.7927/1x94-mv38
Metadata Type geospatial
Old Spatial -180.0 17.0 -65.0 72.0
Program Code 026:001
Source Datajson Identifier True
Source Hash c0860d310656a0663eabcabb1fd9774528d809148dc93e5b9f8ffeec838f3054
Source Schema Version 1.1
Spatial
Temporal 2000-01-01T00:00:00Z/2019-12-31T00:00:00Z

Didn't find what you're looking for? Suggest a dataset here.