XaiJu
All Your Tech
All Your Tech

patreon


Aura Flow - flow-based text-to-image generation model

Today a new open source model was announced called Aura Flow. It's touted as a state-of-the-art flow-based text-to-image generation model, and this is more important than ever with the drama surrounding Stable Diffusion 3 and Stability AI. Naturally I was quick to get it up and running over at Pixeldojo.ai, but the real question is... How does it stack up to Stable Diffusion 3?

Let's find out:

prompt: Create an 8k, hyper-detailed image of an old wooden chair placed in the middle of a dense forest. The chair is surrounded by lush, green moss, blending seamlessly with the natural environment. Next to the chair stands an ancient, gnarled tree with thick bark and twisted branches. On the ground beside the chair, there is an antique gramophone, partially covered in moss but still distinguishable. The scene should feel serene and poetic, with soft sunlight filtering through the forest canopy, casting dappled shadows and highlighting the peaceful beauty of the setting. Butterflies flutter gently around, enhancing the enchanting atmosphere. Ensure the focus remains on the chair and the gramophone, capturing the tranquility and beauty of nature reclaiming these man-made objects.

SD3: https://pixeldojo.ai/community-gallery/1716406354089-lh6lg0b5i


Aura Flow: https://pixeldojo.ai/community-gallery/1720788400678-uid9uy0ms




Prompt: toy soldier as a 3d illustration with toy tank in the background on in a children's room the tank is firing candy and a giant marshmallow figure and the soldiers in the background have differentt expressions

Negative: (lowres, low quality, worst quality:1.2), (text:1.2), glitch, deformed, mutated, cross-eyed, ugly, disfigured (lowres, low quality, worst quality:1.2), (text:1.2), watermark, painting, drawing, illustration, glitch, deformed, mutated, cross-eyed, ugly, disfigured

SD3: https://pixeldojo.ai/community-gallery/1719279223512-5rhbr0q3x
 
Aura Flow: https://pixeldojo.ai/community-gallery/1720788492911-pbdstn53s
 
Prompt: The image features a woman with blonde hair, who appears to be an angel due to the large white wings attached to her back. She is wearing a white, strapless dress with a sweetheart neckline and a thin waistband. The dress flows down to the floor, and she is standing with her hands gently resting on her hips. The setting is a nighttime street scene with a dark, moody atmosphere. There are buildings in the background, and the sky is filled with stars, suggesting a clear night. The streetlights are on, casting a warm glow on the scene. There's a sense of motion in the image, as if the woman is walking or floating down the street. The interesting details in the image include the contrast between the ethereal figure of the woman with wings and the mundane, urban environment. The wings are large and realistic-looking, which adds to the surreal and dreamlike quality of the scene. The lighting and composition of the image create a sense of depth and mystery, inviting the viewer to wonder about the story behind the image.

SD3: https://pixeldojo.ai/community-gallery/1718902530648-q5p9evbgr

 
Aura Flow: https://pixeldojo.ai/community-gallery/1720788691337-illgq0ecy

 
I decided to run this one through the Creative Upscaler and like how it turned out: https://pixeldojo.ai/community-gallery/1720788778381-17ysz119v

 
What about text generation?

Prompt: a portrait photo of an anthropomorphic tortoise holding a sign reading "All Your Tech AI Loves Stable Diffusion 3!"

SD3: https://pixeldojo.ai/community-gallery/1716221194001-ferqiss8l



Aura Flow: https://pixeldojo.ai/community-gallery/1720789096854-i1owcrhr0


Complex Prompting:

This image depicts a surreal and futuristic sushi-making factory. The factory is filled with an intricate network of clear, cylindrical tubes that are stacked on top of each other, creating a towering structure. These tubes are filled with various sushi rolls, each one meticulously arranged with different ingredients such as salmon, tuna, avocado, and seaweed. At the base of the tubes, there are chefs in white uniforms and toques, who are intently observing the tubes. They appear to be monitoring the sushi-making process, ensuring that everything is going smoothly. The chefs are positioned on a walkway that runs horizontally across the tubes, allowing them to oversee the entire operation. The tubes are connected by a complex network of pipes and tubes that crisscross above and below the walkway. These pipes likely serve as conduits for the sushi ingredients and tools necessary for the sushi-making process. In the foreground, there are two distinct sushi rolls that stand out. One roll has a green frog perched on top, and the other has a leopard print frog. These whimsical additions add a touch of humor and surrealism to the scene. The overall atmosphere of the image is one of industrious efficiency mixed with playful imagination. The chefs' focused expressions and the intricate setup of the factory convey a sense of precision and dedication to their craft. However, the inclusion of the frogs introduces a fantastical element, as it is not common to see live animals in a sushi-making environment. The lighting in the image is bright and artificial, casting a sterile glow over the entire factory. The colors are vivid, with the orange of the salmon, the green of the avocado, and the white of the rice creating a visually appealing palette. Overall, this image is a creative and imaginative take on the sushi-making process, blending reality with fantasy to create a captivating and thought-provoking scene.


SD3: https://pixeldojo.ai/community-gallery/1719226042660-0o71m1g6i

 
Aura Flow: https://pixeldojo.ai/community-gallery/1720789190340-k41j8vcuu



Let me know what you think. It shows promise, but not quite there yet from my testing. Either way I'm sure it will evolve quickly, and it's fantastic to see another new entry in the image generation landscape. Try it out here: https://pixeldojo.ai/aura-flow

Aura Flow - flow-based text-to-image generation model Aura Flow - flow-based text-to-image generation model Aura Flow - flow-based text-to-image generation model Aura Flow - flow-based text-to-image generation model Aura Flow - flow-based text-to-image generation model Aura Flow - flow-based text-to-image generation model

More Creators