
Character.AI, the popular platform that allows users to create and interact with AI chatbots has introduced a new diffusion model called TalkingMachines. The Google-owned startup says its newest AI model enables “real-time, audio-driven, FaceTime-style video generation.”
In a blog post, Character.AI said that users said the new model can help generate an interactive, real-time video of characters with different styles, genres and identities using just an image and a voice signal.
For audio, Character.AI is using a custom-built 1.2B parameter audio module that captures both speech and silence, with the company claiming that it can achieve “infinite-length generation with no quality degradation over time.” The company goes on to say that its new AI model supports a variety of styles, like photorealistic humans to anime to 3D avatars and builds on the core infrastructure for role-playing, storytelling and interactive world-building.
Character.AI has been constantly adding new features like a new image-to-video generator called AvatarFX, Scenes and Streams. Following OpenAI’s advanced voice mode, the startup even added a call feature that allows users to engage in voice conversations with the character of their choice to increase engagement.
Last year, the Google-owned startup was sued by the mother of a 14-year-old kid in Florida who claimed that a Character.AI chatbot encouraged her son to kill himself. Since then, the company has introduced new supervision tools to ensure the online safety of users under 18.