Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Skip to content
This is a Non-Federal dataset covered by different Terms of Use than Data.gov.

Crime Hot Spot Forecasting with Data from the Pittsburgh [Pennsylvania] Bureau of Police, 1990-1998

Metadata Updated: March 12, 2025

This study used crime count data from the Pittsburgh, Pennsylvania, Bureau of Police offense reports and 911 computer-aided dispatch (CAD) calls to determine the best univariate forecast method for crime and to evaluate the value of leading indicator crime forecast models. The researchers used the rolling-horizon experimental design, a design that maximizes the number of forecasts for a given time series at different times and under different conditions. Under this design, several forecast models are used to make alternative forecasts in parallel. For each forecast model included in an experiment, the researchers estimated models on training data, forecasted one month ahead to new data not previously seen by the model, and calculated and saved the forecast error. Then they added the observed value of the previously forecasted data point to the next month's training data, dropped the oldest historical data point, and forecasted the following month's data point. This process continued over a number of months. A total of 15 statistical datasets and 3 geographic information systems (GIS) shapefiles resulted from this study. The statistical datasets consist of

Univariate Forecast Data by Police Precinct (Dataset 1) with 3,240 cases Output Data from the Univariate Forecasting Program: Sectors and Forecast Errors (Dataset 2) with 17,892 cases Multivariate, Leading Indicator Forecast Data by Grid Cell (Dataset 3) with 5,940 cases Output Data from the 911 Drug Calls Forecast Program (Dataset 4) with 5,112 cases Output Data from the Part One Property Crimes Forecast Program (Dataset 5) with 5,112 cases Output Data from the Part One Violent Crimes Forecast Program (Dataset 6) with 5,112 cases Input Data for the Regression Forecast Program for 911 Drug Calls (Dataset 7) with 10,011 cases Input Data for the Regression Forecast Program for Part One Property Crimes (Dataset 8) with 10,011 cases Input Data for the Regression Forecast Program for Part One Violent Crimes (Dataset 9) with 10,011 cases Output Data from Regression Forecast Program for 911 Drug Calls: Estimated Coefficients for Leading Indicator Models (Dataset 10) with 36 cases Output Data from Regression Forecast Program for Part One Property Crimes: Estimated Coefficients for Leading Indicator Models (Dataset 11) with 36 cases Output Data from Regression Forecast Program for Part One Violent Crimes: Estimated Coefficients for Leading Indicator Models (Dataset 12) with 36 cases Output Data from Regression Forecast Program for 911 Drug Calls: Forecast Errors (Dataset 13) with 4,936 cases Output Data from Regression Forecast Program for Part One Property Crimes: Forecast Errors (Dataset 14) with 4,936 cases Output Data from Regression Forecast Program for Part One Violent Crimes: Forecast Errors (Dataset 15) with 4,936 cases. The GIS Shapefiles (Dataset 16) are provided with the study in a single zip file: Included are polygon data for the 4,000 foot, square, uniform grid system used for much of the Pittsburgh crime data (grid400); polygon data for the 6 police precincts, alternatively called districts or zones, of Pittsburgh(policedist); and polygon data for the 3 major rivers in Pittsburgh the Allegheny, Monongahela, and Ohio (rivers).

Access & Use Information

Public: This dataset is intended for public access and use. Non-Federal: This dataset is covered by different Terms of Use than Data.gov. License: us-pd

Downloads & Resources

Dates

Metadata Created Date March 12, 2025
Metadata Updated Date March 12, 2025

Metadata Source

Harvested from DOJ JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date March 12, 2025
Metadata Updated Date March 12, 2025
Publisher National Institute of Justice
Maintainer
Identifier 556
Data First Published 2015-08-07T15:12:10
Language eng
Data Last Modified 2015-08-07T15:12:10
Public Access Level public
Aicategory Not AI-ready
Bureau Code 011:21
Metadata Context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
Metadata Catalog ID https://www.justice.gov/data.json
Schema Version https://project-open-data.cio.gov/v1.1/schema
Catalog Describedby https://project-open-data.cio.gov/v1.1/schema/catalog.json
Harvest Object Id f8b2386b-636d-48a5-89da-b9890f665834
Harvest Source Id 11827822-e56a-442a-9edb-6b249b7ddcc3
Harvest Source Title DOJ JSON
Internalcontactpoint {"@type": "vcard:Contact", "fn": "Jennifer Scherer", "hasEmail": "mailto:Jennifer.Scherer@usdoj.gov"}
Jcamsystem {"acronym": "OJP_EXT", "id": 8, "name": "External system not available in CSAM"}
License http://www.usa.gov/publicdomain/label/1.0/
Metadatamodified 9/2/2022 6:22:00 PM
Program Code 011:060
Publisher Hierarchy Office of Justice Programs > National Institute of Justice
Sourceidentifier https://doi.org/10.3886/ICPSR03469
Source Datajson Identifier True
Source Hash e024d753a7d72a8d1a063d5bfaec2c3db3f53bc4b4ce4baf5a49b84d4dde12d5
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.