Skip to main content
The Daily Chicago

All of Chicago, every day

News

How Chicago's Digital Archives Ended Up Flooded With Duplicate Images — And What the City Is Doing About It

A years-long accumulation of redundant photo files across city agencies has created a costly storage and transparency problem that officials are now scrambling to fix.

Share

By Chicago News Desk · Published 4 July 2026, 1:43 PM

4 min read

Updated 28 min ago· 5 July 2026, 2:05 AM

How we reported this

This article was generated by AI from the linked public sources. The Daily Chicago is independently owned and covers Chicago news free from advertiser or sponsor influence. Read our editorial standards →

How Chicago's Digital Archives Ended Up Flooded With Duplicate Images — And What the City Is Doing About It
Photo: Huneker, James, 1857-1921. / Public domain (Wikimedia Commons)

Chicago's municipal digital infrastructure is sitting on a mountain of identical images. Across city departments — from the Department of Planning and Development on West Madison Street to the Chicago Public Library's digital collections branch — thousands of duplicate photograph files have quietly multiplied for more than a decade, clogging servers, complicating public records requests, and burning through storage budgets that could otherwise fund frontline services.

The problem matters right now because the city is mid-way through a $47 million technology modernization contract awarded in early 2025 to overhaul how Chicago agencies manage and share data. Auditors reviewing the project's first phase flagged duplicate image files as a significant drag on the initiative — an unglamorous but expensive obstacle that city IT staff say predates the current administration by years.

How the Duplication Problem Built Up

The roots go back to roughly 2008, when individual Chicago departments began migrating paper records to digital formats without a unified citywide standard. The Chicago Department of Buildings digitized permit documentation. The Office of Emergency Management and Communications archived incident imagery. The Chicago Transit Authority maintained its own photo library for infrastructure inspections along the Red and Blue lines. None of these systems talked to each other, and none flagged when the same image — a pothole on South Halsted, a flooded underpass near Wacker Drive — was uploaded multiple times under different file names.

By the time the city's Department of Innovation and Technology began a formal asset inventory in late 2023, the scale of the redundancy had grown unwieldy. Storage vendors were billing the city for capacity consumed largely by copies of copies. Public records officers at the Richard J. Daley Center reported that FOIA responses were being delayed because search results for image requests returned duplicate hits that staff had to manually sort through before releasing files.

The Chicago History Museum's digital collections team encountered a related version of the same issue when it partnered with the city in 2022 on a joint digitization grant. Duplicate scans of the same neighborhood photographs from the Cabrini-Green demolition era had been submitted by at least three separate contributing agencies, each unaware the others had already uploaded the same material.

What a Fix Actually Looks Like

The 2025 modernization contract includes a dedicated image deduplication component, which uses hash-matching software to identify byte-for-byte identical files and flag near-duplicates for human review. The city's IT department estimates the cleanup could free up roughly 18 percent of current municipal storage capacity, though that figure has not been independently verified and applies only to the agencies included in the contract's first phase.

The Chicago Public Library's digital preservation unit on North Dearborn Street has been piloting a stricter file-naming protocol since January 2026 — requiring geographic metadata and a unique acquisition identifier on every image uploaded to its Contentful-based asset management system. Library staff say the protocol has already reduced duplicate submissions in the pilot program by a measurable margin, though formal results won't be published until the end of the third quarter.

The practical stakes extend beyond server costs. When journalists, researchers, or residents submit FOIA requests for photographic evidence related to city contracts or neighborhood development decisions — say, construction progress images from the Lincoln Yards site on the North Branch of the Chicago River — they can receive bloated, repetitive document packages that obscure rather than illuminate the record. Deduplication, done properly, makes the public record cleaner and more usable.

The next milestone is a cross-agency working group scheduled to meet in September 2026 at City Hall, where representatives from eight departments are expected to ratify a unified metadata standard. If adopted, new image uploads across the city's systems would carry enough identifying information to prevent the same file from being stored twice. For residents and watchdog groups tracking how the city spends public money, that meeting is worth watching. Standardized records don't make headlines, but they make accountability possible.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Chicago

Covering news in Chicago. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Chicago news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Chicago and accept our Privacy Policy. Unsubscribe anytime.