Just last week, I wrote about a bunch of young engineers working on a project called Jarvis to create an operating system that is controlled by natural input methods such as voice and gestures and based on artificial intelligence. The international team, led by two Indians, was inspired by the virtual butler of the superhero Iron Man. Obviously, they hadn’t seen the movie Her.
Her might be a work of fiction set in the future, but it gives a clear picture of where our computing technologies are headed—a world where the device, which is now the central part of all computing, is pushed to the sidelines as everyone interacts directly with the operating system. Throughout the movie, the protagonist is interacting with his operating system, OS1, but rarely does he sit in front of a PC or hold the smartphone in his hand. Samantha, as his virtual assistant likes to call herself, is Siri on steroids, though she doesn’t sound so.
Across the world there are multiple teams working on ways to make computing more natural. The idea is to take interfaces that need traditional input methods like keyboards and mouse away and replace them with voice, gestures or, in extreme cases, thought. And don’t think I am taking you on an Isaac Asimov-like trip of future technologies. In fact, some of these technologies are already here. Take Apple’s Siri or Google’s Now, which are both effective in taking orders from you and executing simple searches. No, they still can’t do what Samantha does, like sort your mail or analyse your voice and understand that you are in a bad mood.
Actually, Samantha does much more than sort mail. But as Vlad Sejnoha, chief technology officer, Nuance Communication, explains in a blogpost, even Samantha’s first, strictly utilitarian incarnation is impressive. “Her speech recognition, natural language understanding, speech generation, dialog, reasoning, planning, and learning all far exceed the current state of the art,” Sejnoha writes. He should know, for Nuance is at the very forefront of technologies showcased in the movie. Two-thirds of Fortune 100 companies rely on Nuance solutions, which is also used in 7 billion mobile phones and 70 million cars. But what is really crucial for the success of voice technologies is the database, and Nuance has one of the largest libraries of speech data in the world. It is this large database that now lets devices understand much more that American accents.
Even as voice control becomes more common and effective in devices, we are also seeing gesture control make its presence felt. Though mostly gimmicky, most top-end smartphones now understand some gestures. Even televisions are tuning in, with smart TVs from Samsung being able to flip channels as you wave your hands.
At least two companies are working on rings that continued…