AI could herald a new digital era as apps give way to voice agents
Jaspreet BindraLast week, I watched the CES (Consumer Electronics Show) the launch and demo of a product called the Rabbit R1. At the end of the 30-minute video, I paid $200 and became one of the ten thousand people who ordered it on Day Zero. The Rabbit looks like a small boxy phone, with a camera (called the Rabbit Eye), but what makes it special is the interface. Instead of a grid of apps which smartphones have made us used to, the Rabbit is an AI assistant that conveys what you want to your favourite app and makes it happen (see the video here: https://bit.ly/48SMEfN)
This is just not about a Siri or Alexa like voice-commands, what these breakout products like the Rabbit or Humane’s AI Pin herald is a completely new era of computing. No less than Bill Gates says so. He foresaw the era of the PC with GUI (Graphical User Interface) on every desk a few decades back, and he is again predicting a fundamental shift on how humans work with machines – this time with AI and natural language interface (NLI). Thus far, you would learn the language of a computer to communicate with it and do your bidding – thus the term computer languages like Python, JavaScript, etc. With AI and Large Language Models, that changes now: to using English, French, or Bengali, to human languages. “To do any task on a computer,” writes Gates (https://bit.ly/3tSMNkB), “you have to tell your device which app to use. You can use Microsoft Word and Google Docs to draft a business proposal, but they can’t help you send an email, share a selfie, analyse data, schedule a party, or buy movie tickets. In the next five years, this will change completely. You won’t have to use different apps for different tasks. You’ll simply tell your device, in everyday language, what you want to do.” To put it simply, AI is the interface.
You can see that happening in another such innovation – the AI Pin. Launched last year to great acclaim, Humane revealed the AI Pin, which you wear on your chest and summon it when you need something done. There are no screen and no apps; you talk to it to get things done. It is expensive, costing as much as a phone and maybe it will not succeed, and neither will the Rabbit. But what these products presage is the move from apps to agents. Apps are the universal user interface today, and they help us do a lot of stuff, but they are actually dumb and clunky – you need to tell them what to do, and it takes a while navigating through them. Agents are intelligent, and they do stuff for you. As John Koetsier writes in Forbes (https://bit.ly/48NUsA5): “Apps are an interface to accomplish a task, but the best interface is simply doing the requested action. Star Trek’s virtual omniscient ship AI didn’t ask Captain Picard to install an app when he asked a question.” You can use an app to order a pizza, an agent will know that you want something to eat basis your history, and that you like a kind of pizza, and it will offer to order one for you. Bill Gates explains how agents will go farther than today’s AI bots do: “…bots are limited to one app and generally only step in when you write a particular word or ask for help. Because they don’t remember how you use them from one time to the next, they don’t get better or learn any of your preferences. Agents are smarter. They’re proactive—capable of making suggestions before you ask for them. They accomplish tasks across applications. They improve over time because they remember your activities and recognize intent and patterns in your behaviour. Based on this information, they offer to provide what they think you need, although you will always make the final decisions.”
We can see the beginning of this revolution with Rabbit and AI Pin, and even with Windows Copilot from Microsoft, which can significantly enhance human productivity. This profound shift to AI will upend the smartphone platforms, much like they disrupted feature phones. Gates says that natural language interfaces and agents will “bring about the biggest revolution
in computing since we went from typing commands to tapping on icons…Agents will be the next platform.”
ChatGPT and other bots, however impressive, are just the very beginning of the AI era. AI goes far beyond a technology, and as Gartner says: “It is not just a technology or business trend. It is a profound shift in how humans and machines interact.” The power of natural language AI agents will reshape big tech, fundamentally upend computing, and will change our lives the same way that PCs and smartphones did over the last few decades, as we go down this new Rabbit hole.