‘Gemini image generation got it wrong. We will do better’: Google SVP

Google acknowledged problems with the Gemini AI chatbot's image generation feature. It has been shut down temporarily.

Google said on Wednesday that it's “aware that Gemini is offering inaccuracies in some historical image generation depictions” and that it's "working to improve these kinds of depictions immediately." (Image: Google)

Google received a lot of flak for its Gemini chatbot’s AI image generation feature that was launched three weeks. Users accused it of overdoing diversity and inclusion when generating images of people. For example, one user pointed out how Gemini threw up images with people of various ethnicities when asked to show an image depicting the founding fathers of the United States. They were all white men and this “historical inaccuracy” was deemed a problem.

The search giant on Friday acknowledged the issue and said that it will work to fix it while the feature is temporarily paused.

“Three weeks ago, we launched a new image generation feature for the Gemini conversational app (formerly known as Bard), which included the ability to create images of people. It’s clear that this feature missed the mark. Some of the images generated are inaccurate or even offensive. We’re grateful for users’ feedback and are sorry the feature didn’t work well. We’ve acknowledged the mistake and temporarily paused image generation of people in Gemini while we work on an improved version,” wrote Google’s senior vice president, Prabhakar Raghavan in a company blog post.

Story continues below this ad

Also Read | Google suspends Gemini chatbot’s ability to generate pictures of people

The image generation feature of Gemini was built on an AI model called Imagen 2. Google tuned this feature to counter some of the issues the company says it saw in other generation products — how people use it to depict violent or sexually explicit images or depictions of real people.

The company also tried to bring in standards of diversity, equity and inclusion to the product but they seemed to have overshot the mark. Raghavan wrote that if a user gives a prompt for something like a group of football players or someone walking a dog, it would be ideal if it depicted people of more than just one ethnicity.

“However, if you prompt Gemini for images of a specific type of person — such as “a Black teacher in a classroom,” or “a white veterinarian with a dog” — or people in particular cultural or historical contexts, you should absolutely get a response that accurately reflects what you ask for,” admitted Raghavan.

Also Read | Centre to issue notice to Google over ‘illegal’ response to question on PM Modi by its AI

Google seems to have done two things wrong here. While the company tuned the model to ensure that a range of people are depicted, it did not account for the cases where it really should not show a range of people. The company also says that the model became way more cautious than was intended, refusing to answer some prompts entirely because it wrongly interpreted normal prompts as sensitive.

Story continues below this ad

The company has temporarily shut down the model’s operations and will only bring it back after extensive work, which should also include rigorous testing.

Tags:
Gemini
Google

Top Stories

J-K Assembly session, Jammu and Kashmir, Mehbooba Mufti, Farooq Abdullah, Farooq Abdullah dials Mehbooba Mufti for support, Srinagar, Indian express news, current affairs

Farooq Abdullah dials Mehbooba Mufti for support in Rajya Sabha polls

Entertainment Ek Deewane Ki Deewaniyat movie review: Harshvardhan Rane, Sonam Bajwa film revives misogynistic toxicity of Darrs, Anjaams, Tere Naams

Sports Nihal Sarin on Daniel Naroditsky's death: 'He (Vladimir Kramnik) has kind of literally taken a life'

Taken together, these cases show how mediation has become the new language of power.(Illustration by C R Sasikumar)

Opinion Rise of the new peacemakers

Congress may agree to Tejashwi as CM face to get Bihar alliance back on track

Political Pulse4 hr ago

Tejashwi Yadav is set to be the chief ministerial candidate of the Mahagathbandhan alliance, as talks have resolved differences and reduced friendly fights. Though a seat-sharing deal has not been announced, a positive meeting with RJD's leadership has reassured the alliance's unity and focus on bringing change to Bihar.

View all shorts

Live Blog

Loading Taboola...