The Daily Milan

Milan news, every day

News

Milan's Digital Cleanup: The Numbers Behind the City's War on Duplicate Images

From fashion archives to Olympic venue renders, Milan's creative and public sectors are sitting on millions of redundant image files — and the cost of doing nothing is mounting fast.

By Milan News Desk · Published 4 July 2026, 9:23 pm

3 min read

Milan's Digital Cleanup: The Numbers Behind the City's War on Duplicate Images
Photo: Photo by Earth Photart on Pexels

Milan's institutions collectively manage an estimated 340 million digital image assets, and independent audits commissioned in the first half of 2026 suggest that between 23 and 31 percent of those files are exact or near-exact duplicates. That figure — drawn from storage assessments covering the Comune di Milano, several Porta Nuova-based corporate campuses, and the Triennale design archive — has forced a reckoning across the city's public and private sectors about what data redundancy actually costs.

The timing is not accidental. With the Milan-Cortina 2026 Winter Olympics now less than six months from opening day, venue operators and marketing agencies along the Via Tortona design district are under pressure to rationalise their visual asset libraries before a global content surge hits. The Fondazione Milano Cortina 2026 alone reported in its Q2 internal review that its media archive had ballooned to over 4.2 terabytes, with preliminary scans flagging roughly 900,000 duplicate or near-duplicate image files generated during venue photoshoots and CGI render cycles at sites including the PalaItalia Santa Giulia and the Ice Hockey Arena in Assago.

What Duplication Is Actually Costing

Storage is cheap — until it isn't. Enterprise cloud pricing in the European market currently runs between €0.018 and €0.023 per gigabyte per month for standard-tier object storage. Multiply that against a mid-sized fashion house in the Quadrilatero della Moda maintaining 80 terabytes of campaign imagery — a figure cited in a February 2026 report by the Milan-based digital consultancy Artefice Group — and the annual redundancy bill lands somewhere between €35,000 and €45,000 for files that serve no productive purpose. For a conglomerate operating multiple labels out of offices near Corso Como, that number scales into six figures annually.

The problem compounds in sectors where image provenance matters legally. The Camera Nazionale della Moda Italiana, headquartered on Via Gerolamo Morone, has been working since January 2026 with three member houses on a pilot deduplication protocol designed to bring their combined digital archives into compliance with the EU's updated Digital Markets Act obligations, which took fuller effect in March. Legal teams at two of those houses declined to specify which brands are involved, but sources familiar with the program said the pilot identified over 2.1 million redundant files across the three participants, consuming approximately 6.8 terabytes of combined storage.

At the municipal level, the picture is similarly messy. The Comune di Milano's Ufficio Stampa e Comunicazione, based in Palazzo Marino on Piazza della Scala, manages press imagery going back to at least 2004. A technical working group established under the city's Piano Triennale per l'Informatica 2025–2027 is using perceptual hashing algorithms to flag visually similar files — a process that by late June had scanned 11.2 million images and marked 3.4 million for human review. The city has allocated €280,000 in its 2026 digital infrastructure budget line specifically for this deduplication and metadata remediation work.

What Comes Next for Milan's Image Economy

The practical challenge is not identification — the technology is mature and relatively cheap. A standard perceptual hash scan of one million JPEG files takes under four hours on commodity cloud hardware. The bottleneck is human review and governance: deciding which version of a near-duplicate is canonical, who owns the rights metadata, and how to prevent the same redundancy from rebuilding itself within 18 months.

The Triennale di Milano, which digitised roughly 60,000 archival design photographs between 2022 and 2025 under an EU-funded cultural digitisation grant, is developing a taxonomy framework that other Milan institutions are watching closely. If adopted sector-wide, it could establish a common deduplication standard for the city's creative economy ahead of the Olympic content surge expected between November 2026 and February 2027.

For smaller studios and agencies clustered around the Navigli and in the Isola neighbourhood, the immediate practical step is running an open-source hash comparison tool against local drives before migrating to any new content management platform. The cost of remediation after migration, according to the Artefice Group report, runs approximately four times higher than deduplication done at source. The numbers, for once, make the decision straightforward.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Milan

This article was produced by the The Daily Milan editorial desk and covers news in Milan. See our editorial standards for how we use AI.

The Daily Milan brief

The day's Milan news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Milan news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Milan and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Milan

More in News

Enjoyed this story? Get tomorrow's briefing free.