Mosaic Started It. Agents Will Finish It.

Why the age of browsers is ending, and what’s taking its place.

In 1993, a small group of developers at the University of Illinois released a curious new piece of software: Mosaic, the first widely used web browser. It could display text and images on the same page — a novelty at the time — and it introduced millions to a new kind of digital experience: the World Wide Web.

Mosaic wasn’t flashy. Pages loaded slowly. Links were blue, underlined. Interfaces were sparse and static. But it was a revelation.

What’s surprising isn’t how far we’ve come since then — it’s how much has stayed the same.

Thirty years later, we still open a browser, type a URL or search query, and view HTML rendered into a page. We still click links, fill out forms, and scroll through documents. The underlying interaction model — document-based, input-driven, screen-bound — has barely evolved.

Yes, we’ve added speed, polish, and mobile responsiveness. We’ve stacked a leaning tower of JavaScript frameworks on top — React, Angular, Vue, Svelte — all designed to bend the browser into something it was never meant to be: an application runtime. And to some extent, it works.

But at its core, the browser is still doing what Mosaic did: rendering markup into documents. The interface hasn’t caught up with the intelligence behind it.

That’s about to shift.

The Silent Evolution

The internet as we know it — navigated through browsers, powered by search engines, connected by APIs — is being quietly dismantled and rebuilt beneath our fingertips. This transformation isn’t just technological; it’s architectural.

For decades, web browsers have served a singular purpose: interpret HTML, CSS, and JavaScript to render human-readable interfaces. These technologies weren’t designed for complex applications but were stretched and contorted to accommodate increasingly sophisticated needs. We’ve built workarounds upon workarounds, bending document-centric technologies to serve application-centric purposes.

Enter MCP and A2A

Recently, a significant development emerged: Model Context Protocol (MCP). This innovation allows applications (MCP clients) to communicate with AI services (MCP servers) in a standardized way, enabling more consistent and powerful AI interactions.

Soon after, Google’s A2A (Agent-to-Agent) protocol was released which takes things a step further and enables autonomous agents to communicate with each other directly.

Together, MCP and A2A hint at a much larger architecture shift. But we’re still early. Most current implementations focus on primitive text-based exchanges: we type commands, and we get text back. It works, but it hardly feels revolutionary.

This is where we stand today. But there’s an obvious next step that no one has taken yet.

The Missing Piece: Rich MCP Clients as the New Browser

What if MCP clients evolved beyond simple text conduits to become full-fledged interpreters of a new kind of protocol — one specifically designed for human-agent interaction?

Just as web browsers transformed the internet by interpreting HTML to create visual experiences, rich MCP clients could interpret a new kind of semantic markup from AI systems to create intuitive, interactive interfaces on demand.

Imagine a conversation with an AI where:

Instead of typing “I’m available next Tuesday at 3 PM,” a calendar interface appears automatically when scheduling is discussed
Rather than reading paragraphs describing data, you receive interactive visualizations you can manipulate directly
Complex decision trees unfold as elegant, tappable options rather than lengthy text explanations

This isn’t merely adding buttons to chatbots. It’s a fundamental reimagining of human-computer interaction where the interface itself is generated dynamically based on context.

From Typing to Tapping

Humans prefer to tap, swipe, and click rather than type lengthy commands. Our fingers evolved for manipulation, not just communication. Yet our interactions with the most advanced AI systems still rely primarily on typing. The state of the art is reminiscent of connecting on a 300-baud modem to an online bulletin board system in the late 90’s — except you don’t even get ANSI colours, just markdown output!

Rich MCP clients would bridge this gap. They would understand when to present a slider for selecting a price range, when to offer a drawing canvas for creative collaboration, or when to display interactive 3D models for spatial concepts — all based on cues from the AI system. An adaptive user interface for the agent economy, if you will.

The Ripple Effects

This evolution would transform more than just user experience:

Developers would design AI experiences rather than static interfaces
AI systems would consider not just what to say, but how information should be presented
Users would interact with digital systems through natural conversation augmented by intuitive controls

Most importantly, this approach would make agent-driven interfaces accessible to everyone, not just those comfortable with text-based prompting.

The Blueprint Is Ready

The technical foundation for this transformation already exists. MCP provides the communication framework. Modern browsers have proven that interpretive rendering engines can create rich experiences. AI systems increasingly understand context and intent.

What’s missing is the connective tissue — a fabric tailored for agent-human interaction and clients designed to interpret it. The first companies to bridge this gap will define the next era of digital experience.

The New Internet Taking Shape

This isn’t incremental improvement — it’s a fundamental rewiring of how we experience the digital world. The internet of documents gave way to the internet of applications, which is now yielding to the internet of agents.

And just as browsers were the gateway to previous iterations, rich MCP clients could be our window into this new world — a world where digital interaction feels less like operating machinery and more like collaborating with an intelligent partner.

The question isn’t if this transformation will happen, but who will lead it — and how quickly we’ll adapt to a fundamentally different way of experiencing the digital realm.