Gemini introduces Custom Gems and Imagen 3 for improved image generation

Avatar
Gemini users can now create a team of experts to help them think through a challenging project or brainstorm ideas
Gemini introduces Custom Gems and improved image generation with Imagen 3

Google’s flagship artificial intelligence tool, Gemini has announced the introduction of Custom Gems, a feature that lets users customise the tool, and Imagen 3, an advanced image generation feature. This was disclosed in a release by the Senior Director of Product Management at Gemini Experiences, Dave Citron.

According to the statement, Gemini is rolling out the new features following their preview at Google I/O. The release noted that Custom Gems, a new feature that lets users customize Gemini to create their own personal artificial intelligence experts on any topic they want, is now available for only Advanced, Business and Enterprise users. However, Imagn 3, the new image generation model, is now available across all users including Gemini, Advanced, Business and Enterprise. 

The ability to create custom Gems is coming to Gemini Advanced subscribers, and updated image generation capabilities with our latest Imagen 3 model are coming to everyone,” Dave Citron said.

Gemini’s custom version will provide everything from coding to career advice 

Over the coming days, Gemini Advanced, Business and Enterprise subscribers can start creating and chatting with Gems which is the custom versions of Gemini first previewed at I/O. Thus, they can customize Gems to act as an expert on topics or refine them toward your specific goals.

To achieve this, users simply have to write instructions for their Gem, give it a name, and then chat with it whenever they want. 

From Bard to Gemini: Google renames chatbot to reflect improved capacity

With this new addition, users can create a team of experts to help them think through a challenging project, brainstorm ideas for an upcoming event, or write the perfect caption for a social media post. The customised Gemini can also remember a detailed set of instructions to help users save time on tedious, repetitive or difficult tasks. 

Gemini is also launching a set of premade Gems for different scenarios to help users get started. These scenarios include the Learning coach which helps them break down complex topics, making them easier to understand and the Brainstormer which gives users easy inspiration — from fresh ideas for a themed party to the perfect gift for an upcoming birthday. 

Other premade customised Gem include the Career guide that unlocks users’ career potential with detailed plans to refine their skills and achieve their career goals, the Writing editor that can elevate writing through clear, constructive feedback on everything from grammar to structure, and the Coding partner which can level up a user’s coding skills and help them build projects while learning as they go. 

Gems are now rolling out on desktop and mobile devices to Gemini Advanced, Business and Enterprise users in more than 150 countries in most languages. 

Generate high-quality images with Imagen 3 

Google Gemini has also upgraded its creative image generation capabilities with the introduction of Imagen 3. Over the coming days, the company says it will be bringing this latest image generation model to Gemini Apps and expanding its availability for users in all languages. 

An animated Gemini image of a tiny dragon hatching from an egg in a sunlit meadow, surrounded by curious glowing butterflies. Vibrant colors, detailed scales
Image of a tiny dragon hatching from an egg

Imagen 3 sets a new standard for image quality, generating images with just a few words. Users can even ask Gemini to create images in various styles — like photorealistic landscapes, textured oil paintings or whimsical claymation scenes. 

Imagen 3 brings advanced image generation capabilities that come with built-in safeguards and adhere to our product design principles. Across a wide range of benchmarks, Imagen 3 performs favourably compared to other image generation models available. And as with Imagen 2, we use SynthID, our tool for watermarking AI-generated images,” the company said.

Based on its clear design principles, the company said users will remain in control of the creative process from start to finish. If the initial image they get does not meet their expectations, all they have to do is simply tell Gemini what they would like to change and it will give them a new image. 

A vibrant abstract painting with the words _Dream Big_ splashed across the canvas in bold colors

Over the coming days, the company said it will also start to roll out the generation of images of people, with an early access version for Gemini Advanced, Business, and Enterprise users, starting in English. The company also said it has worked hard to make technical improvements to the product, as well as improved evaluation sets, red-teaming exercises and clear product principles. 

With Imagen 3, we’ve made significant progress in providing a better user experience when generating images of people. We don’t support the generation of photorealistic, identifiable individuals, depictions of minors or excessively gory, violent or sexual scenes. Of course, not every image Gemini creates will be perfect, but we’ll continue to listen to feedback from early-access Advanced users as we keep improving,” the company said.

Gemini said the rollout of Imagen 3 will be gradual as it aims to bring it to more users and languages soon.


Technext Newsletter

Get the best of Africa’s daily tech to your inbox – first thing every morning.
Join the community now!

Register for Technext Coinference 2023, the Largest blockchain and DeFi Gathering in Africa.

Technext Newsletter

Get the best of Africa’s daily tech to your inbox – first thing every morning.
Join the community now!