For the New York Metropolis Metropolitan Transportation Authority’s knowledge and analytics crew, January 5, 2025, felt lots like kismet.
Three and a half years earlier, New York state legislators had passed a law requiring the MTA to launch “simply accessible, comprehensible, and usable” knowledge to the general public; by January 2022, MTA chair and CEO Janno Lieber formally introduced the brand new crew’s formation. In the meantime, New York City’s controversial congestion pricing program, which tolls automobiles coming into Manhattan’s busiest streets, formally kicked off in 2019 however was chugging by way of a prolonged setup course of, with the transit company and state combating lawsuits, politicians, and vocal naysayers alongside the best way.
So when this system lastly began in January, the MTA’s knowledge and analytics crew had ready. They might see the second the tolling began proper within the spreadsheets. “The day that it turned on, one subject modified from ‘no income assortment’ to ‘income,’” says Andy Kuziemko, the deputy chief of the info and analytics crew.
A couple of days later, the crew was pumping out knowledge on automobile entries into the zone in 10-minute increments, and posting the info on its web site, in order that New Yorkers themselves may determine whether or not the congestion program was truly lowering site visitors on metropolis streets. The company has been doing it since. You—sure, you—can view and obtain the MTA’s knowledge right here.
The web net pages aren’t flashy, however they signify a uncommon and complete public transit win for open-data advocates, who argue that entry to well-maintained public datasets is essential to authorities transparency and effectivity.
Since 2022, the MTA’s knowledge and analytics crew has grown to 26 full-time workers, who spend their workdays centralizing data that was as soon as scattered by way of the complete MTA. The company, to be clear, is large. The nation’s largest, it carries some 5.9 million riders on subways, buses, commuter railways, and thru tunnels and bridges on daily basis. That’s a whole lot of numbers to trace.
Actually lots; MTA now publishes greater than 180 datasets. Current additions embrace greater than a decade’s worth of data on the time MTA workers spend on “productive duties,” a new dataset on subway-delay-causing incidents; and bus speeds on Manhattan’s most crowded downtown roads. Kuziemko says 30 extra datasets have gotten publicly obtainable “within the close to future.”
Counter Intelligence
In an interview, Kuziemko and MTA chief of strategic initiatives Jon Kaufman credited a brand new tradition of intra-agency knowledge sharing for the renewed program. In 2023, management inspired managers throughout the company to permit their knowledge to be ingested into the MTA’s “knowledge lake,” which will be refined, stripped of figuring out data, and ultimately revealed overtly. (A few of the MTA’s knowledge comprises the personally identifiable data of commuters; the company says this particular knowledge will not be revealed for the general public.) The company has additionally began utilizing new in-house software program and instruments, which give them technical capabilities they didn’t have earlier than. “We now have paid for zero hours of consulting time, which is a factor we’re actually pleased with—that we truly constructed in-house experience within the public sector,” says Kuziemko. “It’s actually cool.”
“It’s uncommon for a authorities company to share this degree of information granularity,” says Sarah Kaufman, who directs the NYU Rudin Middle for Transportation and as soon as led the company’s open-data program. Actually, it’s one thing like an about-face for the MTA, which before 2009 made a habit of legally pursuing developers who scraped system timetable and route knowledge to construct rider-friendly apps.
