Google no longer archives: it curates, selects, and forgets
Sometimes I wonder whether we are witnessing, in silence, the end of an era. For years, Google’s index was perceived as an infinite library: a neutral archive where every page had the right to exist while waiting to be found. Today that library has changed its nature. It does not preserve everything: it chooses what to keep.
It does not catalog: it selects.
It no longer expands without limit: it optimizes.
In 2025, Google quietly established — without any official announcement — a new operating principle: indexing is no longer a given. It is conditioned by a selective logic that interweaves quality, trust, and engagement. In other words: Google is no longer an engine that archives, but a system that evaluates and shapes the digital knowledge landscape. It is the dawn of what we might call the new economy of relevance.
From Universal Index to Selective Index
Between May and June 2025, Search Console data from thousands of sites around the world showed an anomalous pattern: a sudden decrease in “indexed” pages and an exponential increase in the “Crawled — currently not indexed” status.
Marie Haynes, one of the most widely followed SEO analysts internationally, documented the phenomenon in detail:
“Google appears to have removed en masse pages that it probably would never have shown to users anyway. Old pages, duplicate pages, or pages that were simply being ignored.”
Haynes describes the event as a genuine qualitative purge of the index: an algorithmic revision that prioritizes content that is active, unique, and still capable of generating user engagement.
The phenomenon was also discussed on Search Engine Roundtable. John Mueller, Search Advocate at Google, responded to the reports by explaining that:
“Our systems regularly make adjustments to what gets crawled and indexed. It’s normal that not everything stays in the index over time.” (John Mueller on Bluesky, June 5, 2025)
A statement that, read between the lines, confirms the shift from a logic of universal inclusion to active index management. Mueller speaks not of bugs, nor of penalties, but of “adjustment.” As if Google had decided to prune the tree of the web, keeping only the branches that flower.
The Index as a Trust System
In the new paradigm, being indexed is no longer an implicit right, but an act of algorithmic trust. Every page must prove it deserves a place — not only for textual quality, but for resonance within the ecosystem.
Google cross-references direct and indirect signals that concern not only the content, but its real-world impact:
- engagement;
- freshness;
- credibility;
- accessibility.
When Martin Splitt, Developer Advocate at Google, declared in March 2025 that
“We gave certain pages a chance. Users didn’t use them. Other content worked better.”
he perfectly summarized the new logic: the index is meritocratic.
What matters is not just the intrinsic quality of a page, but empirical proof of its value: user behavior. In the absence of engagement, the algorithm interprets the page as irrelevant and removes it. Not out of malice, but as a matter of economies of scale — as I had already anticipated in a recent article.
Link to: Sustainability, AI, and the Traffic Collapse: the Hidden Environmental Cost and New Challenges for Digital Publishers
From Abundance to Scarcity
For at least two decades, SEO fed on a basic assumption: more pages = more chances of capturing an opportunity. In 2025, that paradigm has flipped.
Content production, which has grown exponentially thanks to generative artificial intelligence and the unchecked proliferation of speculative sites, has saturated the web at a speed never seen before. Every day, dozens upon dozens of new sites launch, millions of pages go live — often indistinguishable from one another. For Google, the index has become a territory to be managed, not expanded.
As Adam Gent, founder of Indexing Insight, observes:
“Indexing is no longer automatic: it has to be earned. 88% of the pages removed in our sample of 1.7 million URLs had quality issues or low engagement.” (Indexing Insight Report, June 2025)
In other words, the logic has changed: abundance has become a cost. Google prioritizes efficiency and reduces redundancy. The objective is no longer to archive everything, but to ensure the index remains useful and to enhance the user experience.
The Invisible Semantics of Algorithmic Reputation
Within this “new economy of relevance,” an invisible semantics of trust is taking shape. Google interprets the value of content through a combination of quantitative and qualitative signals that, taken together, form a genuine digital reputation.
Link to: From Brand to Source: the Silent Evolution Rewriting the Future (and Present) of Digital Publishing
It is not something that can be calculated with a score or a single metric, but an algorithmic narrative built over time:
- Who you are;
- What you publish;
- How the public responds to your content;
- How relevant you are to others.
From this perspective, de-indexing is not a failure, but a judgment of momentary irrelevance. A page can be “read but not retained” — much like an article published but not considered noteworthy by a scientific journal.
The Index as a Mirror of Trust
There is an almost anthropological dimension to this transformation: the index does not just measure content, but the credibility of the context that produces it. Google is evolving from a search engine into a curator of collective knowledge, where visibility is granted to those who maintain consistency, authenticity, and recognizability — factors we have long highlighted in our advisory work as fundamental, requiring dedicated strategies to build.
The trend is clear: the web is moving from a universal, inclusive, and indiscriminate index to an intentional index built on criteria of value, relevance, and trust. It is a structural transition: digital existence becomes a concession, not a certainty.
In the new information ecosystem, visibility is no longer taken for granted — it is negotiated. Google is no longer the memory of the web, but its selective conscience.
Fortunately, not everyone is suffering through this change: some are understanding and anticipating it. The work we carry out in close collaboration with some of the leading publishing organizations confirms that those who manage to adopt a more conscious, strategic, and value-oriented approach prove to be more resilient.



