NYC 311 Harvester
This project provides a command-line utility for downloading the full NYC 311
service request dataset (dataset id erm2-nwe9
) from the NYC OpenData Portal
using the official bulk CSV export endpoint. The script focuses on a reliable,
single-request download flow while adding a few quality-of-life improvements:
- Automatic discovery of dataset metadata so that generated files include the last update timestamp (or current UTC time when not available).
- Support for authentication via
APP_TOKEN
and optional HTTP/HTTPS proxies through standard environment variables. - Progress feedback during long-running downloads, with optional verbose mode for detailed tracing.
The repository exposes a single script, download_all.py
, plus a handful of
internal helpers documented in docs/functions.md
.