Chicago's municipal record-keeping infrastructure has a problem hiding in plain sight. Across dozens of city departments — from the Chicago Department of Buildings to the Office of the City Clerk — digital archives accumulated over the past two decades are riddled with duplicate images: the same scanned permit applications, the same zoning maps, the same agenda attachments uploaded twice, three times, sometimes more, by staff working in disconnected systems that never talked to each other.
The issue matters now because the city is mid-transition into a consolidated open-data platform, a push that accelerated after the Chicago City Council approved updated digital governance guidelines in late 2024. Pulling duplicate files into a unified system doesn't just waste server space — it breaks search functions, inflates document counts in public-facing portals, and in some cases has caused confusion about which version of a permit or variance record is the authoritative one.
How the Duplication Built Up Over 20 Years
The roots go back to the early 2000s, when individual city departments began scanning paper records independently. The Department of Planning and Development digitized zoning records on one system. The Chicago Department of Transportation maintained its own image repository for infrastructure permits. The Cook County Assessor's Office, though a separate body, fed overlapping property images into portals that Chicagoans used interchangeably with city sources. Nobody set a unified naming convention. Nobody enforced a deduplication protocol.
By 2015, the city's open data portal — data.cityofchicago.org, launched in 2010 — had become the public face of Chicago's transparency push. But the portal pulled from those same siloed departmental feeds. A building permit for a two-flat on West Division Street in Wicker Park might appear with three separate image attachments, two of them identical scans uploaded on different dates by different clerks. The Chicago Architecture Center, which uses city permit records extensively for preservation research, flagged the redundancy issue in workshops it hosted for urban planning students as far back as 2018.
The problem compounded after the COVID-19 pandemic forced a rapid expansion of remote document processing starting in March 2020. Staff scanning and uploading from home offices, often without access to departmental intranets, defaulted to emailing files directly to supervisors who re-uploaded them. The Clerk's office alone processed tens of thousands of remote filings between April and December 2020, a period during which quality-control checkpoints were reduced.
The Cleanup Effort and What Comes Next
The city's Department of Innovation and Technology — known as DoIT — began a structured deduplication review in the first quarter of 2026, targeting the Chicago Data Portal's building and zoning image libraries first. The effort is being run in coordination with the City Clerk's digital records division and draws on hashing technology that compares image files at the binary level rather than relying on file names, which were often inconsistently assigned.
The initiative is not without cost. The 2026 budget allocated to DoIT for records modernization — covering deduplication, metadata standardization, and platform migration — runs into the millions of dollars, though the city has not published a line-item breakdown specific to the image cleanup component. Previous technology modernization contracts in Chicago, including a 2019 upgrade to the city's 311 system, ran into eight figures when implementation costs were fully counted.
For residents and researchers, the practical advice is straightforward: if you are pulling images from data.cityofchicago.org for any purpose — a FOIA request, a neighborhood association presentation, a legal proceeding — cross-reference document dates and file metadata before assuming the first result is the definitive record. The Citywide Transparency Office, based at City Hall on LaSalle Street, can confirm which image version carries official status for a given record. The deduplication review is expected to reach the transportation and public health image libraries by the end of 2026, though DoIT has not committed to a public completion date for the full project.
Chicago is not the first large American city to grapple with this. New York City's Department of Records ran a comparable audit of its digital archives between 2021 and 2023. The difference is scale and urgency: Chicago's open-data portal is increasingly the foundation for city planning decisions, and getting the underlying image library right is no longer a back-office housekeeping question.