AI image generators attracted massive attention when the first powerful diffusion models were unveiled in 2022 and 2023. Now, AI video generation, buoyed by high-profile releases like OpenAI’s Sora, became the hot topic.
“I think a lot of tools are seeing the direction AI content creation is going, and just trying to stay up to date, trying to diversify [into video],” says
Ross Symons, cofounder of the creative AI company ZenRobot. “But I find the tools that are sticking to what they’re good at, like Midjourney, are dominating.”
So there’s still a lot of interest in AI image generation. Most modern AI image generators continue to use a diffusion model. This type of AI model is trained by teaching a neural network to predict random noise as it’s gradually added to images. Once training is complete, the process can be reversed to create unique images from random noise. The diffusion model is then paired with a text model, which converts the user’s text prompt into tokens that guide the diffusion model’s results.
Diffusion models have improved since 2022 but haven’t fundamentally changed, and in the past year, the spotlight shifted from the models themselves to the software tools they power. The best AI image generators currently are quick, easy to use, and offer ways to more finely control image generation.
https://www.midjourney.com/home
Midjourney is a clear favorite for professional image generation.Midjourney
While there’s literally dozens of image generators available,
Midjourney is the clear front-runner.
“Midjourney is a very important tool to us,” says
Chrissie Cremers, cofounder of Aigency Amsterdam. “It still excels in art direction.” Symons agrees, saying he uses “predominately” Midjourney.
Released in 2022, Midjourney quickly gained a reputation for detailed, appealing images with a strong artistic style. It tends to generate images with vivid, bold colors and a dramatic look that grabs a viewer’s attention. Midjourney’s diffusion model is proprietary and available only with a subscription to its titular online service.
Creative professionals like Midjourney’s user interface—which, given the tool’s origins, is a bit ironic. The model was initially available only through a Discord chatbot connected to a slapped-together user-authentication system. That rapidly changed, however, and Midjourney now offers a beautiful Web interface that stands apart from those of its peers.
Flux, the most significant new model released in 2024, can support image-editing tools.Flux
While Midjourney is still in favor among AI-powered creatives, 2024 saw the release of one major diffusion model that fills a different role:
Flux. The model was created by Black Forest Labs, a company founded by three former employees of Stability AI (the company behind Stable Diffusion).
Symons cited Flux as the most notable new image-generation model in 2024. “In terms of realism, it’s amazing,” he said. Cremers agreed, saying, “It’s very well suited for photorealism, much more so than Midjourney.” Flux also offers strong tools for
“in-paint and out-paint,” which allows the model to edit images by removing unwanted objects or expanding the image beyond its original scope.
In another contrast to Midjourney,
Flux is an open model, meaning it’s free for anyone to obtain, use, and fine-tune. It can be installed locally to generate images on powerful home and office computers.
https://leonardo.ai/
Leonardo.ai has carved out a niche by targeting creative professionals with specific needs.Leonardo.ai / Fyfe Boyce Design
Founded in 2022,
Leonardo.ai moved quickly to establish itself as a leading choice for individuals and companies looking for professional image generation.
It has largely focused on creating tools that use existing generative models in new ways, though the company did release its first in-house foundation model, Phoenix, in 2024. Leonardo.ai also offers Flux, and it ranks among the better options for those unable to run Flux on their own hardware.
“For character consistency, if you are generating characters that are illustrated or fall in the fantasy space, or in 3D, I think it’s good,” said Symons. This is achieved with support for
Low Rank Adaptation (LoRA), which lets users tweak a diffusion model to meet their needs.
https://openai.com/index/dall-e/
Dall-E 3, available through OpenAI’s ChatGPT, has an image-editing tool for adding or removing objects from generated images.OpenAI
OpenAI’s
Dall-E was among the first image generators to make headlines. In 2024, however, it was overshadowed by hype for OpenAI’s video-generation tool, Sora, which was finally released in December.
Despite that, Dall-E remains among the more prominent tools, especially for amateur use. Now a part of ChatGPT, Dall-E generates passable images quickly. An April 2024 update added image editing as well, to help users remove or change specific areas of an image generated by Dall-E.
https://labs.google/fx/tools/image-fx
Though only an “experiment,” Google’s ImageFX generates detailed images with excellent realism.Google
Google Labs, the company’s home for its “AI experiments,” includes an image generator called
ImageFX. Though not widely advertised, and a bit bare-bones in comparison with other tools on this list, ImageFX has earned a reputation among AI creatives in the know.
“It’s a bit of a dark horse. It’s not talked about, but it’s a good tool. It’s free, you can generate four images at a time, and the images are of seriously good quality,” said Symons.
ImageFX has a simplistic interface that encourages users to input short prompts. It also provides a list of keywords to guide the user’s prompt toward text that the model can easily understand. This approach makes it easy to generate detailed images with a specific sense of style. ImageFX is entirely free, too, which is a plus for more casual users.
The Best of the Rest
While the image generators mentioned above stole the show in 2024, they aren’t the only tools worth paying attention to.
Magnific has a reputation for excellent image upscaling. Krea.ai offers attractive interfaces for third-party models, including Flux.
Creative professionals seem less enthusiastic about image generators from established companies. Getty Images, Shutterstock, and Adobe have generative AI services, but they lack the buzz surrounding leaders like Midjourney and Flux. Cremers blames the images used for training: “They tend to be trained on a lot of stock imagery, and that reflects in the outcome.”
2025 seems likely to be a repeat of 2024. Video generation will continue to take the spotlight as several companies—most notably Google—have yet to release their highly anticipated models. Even so, Symons thinks creative professionals will continue to find new ways to use image generators as they become more familiar with the tools.
“I wish people understood the capabilities of AI more. There are certain things you can do, and things you can’t,” said Symons.
From Your Site Articles
Related Articles Around the Web