Imagen: Google AI Can Create Ultra-Realistic Images With Just A Text Description


Published on:

Google introduced the Imagen neural network, which generates images based on text. Imagen is very similar to DALL-E 2, the artificial intelligence developed by Open AI that also allows images to be generated based on a text description.

However, there are several differences between the two models, such as the level of detail and the efficiency in creating that image.

Imagen comes from the Brain Team at Google Research, and it is based on the Transformer T5 model, introduced in 2020.

The operation of Imagen is similar to that of DALL-E 2. The AI ​​converts a small text into a highly detailed image that matches what is described. The combinations are almost unlimited, and in most cases, DALL-E 2 managed to offer us an image very similar to what we asked for. Now Google says it has ironed out some of the gaps in the OpenAI tool and has managed to generate images that humans prefer.

Google Imagen AI demo

Originally the AI ​​produces 64 x 64 pixel images, but they are later scaled to 1024 x 1024 pixels. The same resolution as DALL-E 2. This idea of ​​scaling is what relieves the calculation power and allows the generation of images in a few seconds.

Google ensures that its AI offers results with a much more precise level of detail compared to other systems. To prove this, the company created a benchmark called DrawBench, which compares its AI model with similar AI models, such as VQ-GAN+CLIP, Latent Diffusion Models, or even DALL-E 2, and exposed the results “side by side” so that “human evaluators” can differentiate between them and choose the most realistic.

These evaluators, according to Google, concluded that the images generated by Imagen have a higher quality and a better “image-text alignment” compared to the rest of the models. However, OpenAI’s neural network is ahead of Google’s, as it is already a full-fledged, albeit closed beta, and people use it for everyday tasks and entertainment.

Unfortunately, Google is still concerned about the misuse of this AI, something that also happens with DALL-E 2, and for this reason, it has decided not to make it available to users for the time being. When Google will offer those who wish to use Imagen is not yet clear.

Sabarinath is the founder and chief-editor of Code and Hack. With an unwavering passion for all things futuristic tech, open source, and coding, he delves into the world of emerging technologies and shares his expertise through captivating articles and in-depth guides. Sabarinath's unique ability to simplify complex concepts makes his writing accessible and engaging for coding newbies, empowering them to embark on their coding journey with confidence. With a wealth of knowledge and experience, Sabarinath is dedicated to providing valuable insights, staying at the forefront of technological advancements, and inspiring readers to explore the limitless possibilities of the digital realm.

Related Posts:

Leave a Reply

Please enter your comment!
Please enter your name here