The productive AI race
By now even the most casual tech news follower is aware of productive AI tools like ChatGPT, Stable Diffusion, Midjourney, and DALL-E. The world’s top 5 companies compete to develop the best major language models and incorporate them into every software or web service we use. These tools can generate useful images or text using prompts. On the other hand, many of these tools are “trained” on works written by humans and require human oversight to bring their output to a meaningful level.
Revolutionizing visual editing with DragGAN
However, new artificial intelligence research reveals incredible progress, especially in the field of image manipulation. A group of scientists from Google, MIT, the University of Pennsylvania, and the Max Planck Informatics Institute in Germany have developed an experimental tool that could make image editing easier and more accessible to ordinary people.
It is enough to look at the examples in this news to understand what the new tool called DragGAN can do. With just a few clicks and a few seconds, it is possible to rotate the object in the image as if it were a 3D model, change facial expressions or make any other difficult adjustments you can think of. In the meantime, it should be noted that DragGAN is not a public model. Therefore, we did not have the opportunity to try the tool.
Your dream scene is just seconds away
The researchers note that DragGAN can change the content of an image in just a few seconds when using Nvidia’s GeForce RTX 3090 graphics card, because their application does not need to use multiple neural networks to achieve the desired results. The next step will be to develop a similar model for point-based editing of 3D models.