MTA Subway Time® Historical Data
This page explains how to retrieve archival GTFS data from MTA New York City Transit's feed of real-time train arrival estimates on the subway lines, the 42nd Street Shuttle, and the Staten Island Railway. This archive began on September 17, 2014, and it continues daily to the present.
There are no links to files directly from this site. You will need to know the name of the file to fetch it. The files are hosted at amazonaws.com.
Files Generated Each Day
There are three separate transit line files
How to Retrieve the Data Files
There are two ways to retrieve history data.
The first way is through a daily 'roll up' of the many individual files.
The 3 files mentioned above are collected 12 times an hour, or 864 files per day.
There is a single archive file created at 12:01 a.m. every morning that consists of the previous day's files.
The naming convention for the 'roll-up' archive URL is datamine-YEAR-MONTH-DAY.s3.amazonaws.com/gtfs.tgz
Thus, for example the roll-up for 9/17/2014 can be downloaded from this URL:
Another way to get the same data is to download each individual file from this datamine-history.s3.amazonaws.com URL.
The naming convention for files on this site is the file name (above) + a suffix:
There is one file for each transit line archived every five minutes, starting at one minute past the hour.
The files are time stamped at 01, 06, 11, 16, 21, 26, 31, 36, 41, 46, 51, and 56 minutes after the hour.
Thus, an example of a file name created on 9/17/2014 at around 9:30AM can be downloaded from this site as:
Because the file is renamed at the intervals above, there is no file stamped 9:30 a.m., but there is one at 9:31AM.
For Further Information
All data is formatted in GTFS-realtime. For further infromation about the nature of information in specific fields, and the formatting and structure of the data, visit https://developers.google.com/transit/gtfs-realtime/
This information is brought to you by: MTA HQ Information Services
- Google Translate