AI Generator Comparison — alethia.design

Texture and Style Experiment, AI Generator Comparison

I compared how various Firefly image generators handle texture and style using a visual example and a text prompt. I used a geometric pattern as the image and paired it with this prompt:

"A glamorous woman wearing a 1950s Marilyn Monroe-inspired halter dress, crafted with the texture of the selected image, poses elegantly in a dimly lit studio. The lighting enhances the texture of the dress and the pleats of the skirt, evoking classic Hollywood fashion photography."

This experiment shows how Firefly boards combine images and text to create new, creative designs while keeping the original style.

Balanced style and realism.
Translated the pattern into fabric form.
Managed lighting and folds to evoke the mood of 1950s fashion photography.

The Outcome & Observations

Firefly 4 Ultra

it created a striking image where the dress and background blended together, sharing the same blue geometric pattern. This gives the picture a smooth, dreamlike look, similar to modern camouflage or illusion photography.
This shows how Firefly 4 Ultra manages pattern copying and space difference. It is great at making rich, artistic images but often blends parts together if the subject and background have similar colors or textures. Future versions or comparisons should look for models that separate subject from background better, especially for fashion and texture-focused projects.
Firefly 4

It created a detailed and well-lit image that captured vintage glamour, showing clear facial features, highlights, and realistic fabric folds. The dress design beautifully reflected the reference pattern, using geometric shapes to add depth.
This version highlights Firefly 4’s skill in realistic lighting and style, but also shows it struggles to keep the subject separate from the background. While the look is bold and cinematic, projects that need clear textile contrast, fashion details, or material realism may need improvements or a tool better at isolating elements.
Firefly Image 3 (fast)

it created a visually pleasing and elegant dress with a strong blue color theme. The geometric pattern fit well on the fabric, adding a rich and formal feel. The look captures vintage Hollywood style with clear lighting and a simple shape.
This test shows Firefly Image 3 (fast) is good at making consistent textures and lighting, resulting in polished and well-matched images. However, it often focuses more on overall look than separating layers clearly. For better fabric realism and material detail, further adjustments or a different model might be needed to clearly separate the dress from the background.
Firefly Image 3

It created a highly stylized, almost cartoon-like result. It turned the texture into a vector-style design, giving the image a graphic-art feel. The dress echoes the background’s geometric pattern, making the colors and shapes work well together. Bold outlines and a consistent blue tone add a modern touch, similar to digital prints or patterns.
This test shows Firefly Image 3 prefers artistic style over realism. It turns patterns into a clear visual style, showing good design sense. However, for projects needing fabric texture, realistic details, or clear subjects, this model blends elements too much. While it doesn’t meet technical goals, the result is a strong example of pattern and style blending.
The GPT image generator

it created a polished, stylish image that reflects mid-century fashion photography. The lighting, pose, and fabric movement match classic Hollywood portrait style. The pattern fits well into the dress, making a balanced and pleasing design.
This image shows GPT’s ability to create clear mood and style, especially with cinematic lighting and composition. However, like other models, it favors overall harmony over clear spatial details. While the image looks good and is well done, it doesn’t highlight texture and fabric details as the experiment intended.
Runway Gen-4

it created a bright, high-fashion image with strong lighting and flowing fabric. The model’s pose and expression fit well with the vintage style from the prompt, and the geometric pattern enhances the skirt’s shape and movement. The shadows and arrangement give a studio fashion shoot feel, adding realism and elegance.
Runway Gen-4 excels at making detailed, stylish fashion images with good focus on pose, light, and texture. However, like other models in this test, it blends the subject and background too much, creating a smooth look but less clear separation. This makes the image elegant and cinematic but less distinct between the patterned dress and the background, which was the experiment’s goal.
Gemini 2.5 (Nanobanana)

it created a clear, realistic, and well-balanced image. It accurately applied the pattern evenly on the dress, keeping the 1950s halter style. The lighting and pose show the classic Hollywood glamour from the prompt.
This test shows Gemini 2.5’s skill in smooth textures, good lighting, and precise patterns, making the image look polished and professional. However, like other models tested, it focuses on overall visual unity rather than separating the dress’s texture from its background. The result is stylish and cohesive but doesn’t clearly isolate the dress material.
Flux 1.1 Ultra (Raw)

it created a very realistic and cinematic image with strong light and shadow contrasts and detailed textures. The halter dress folds are well shown, with light reflecting off the pleats and fabric movement. The pixel-like texture gives a digital fashion feel, blending old and new styles.
The model excels in realistic lighting, detailed clothing, and a cinematic look, making the image feel polished and connected. However, like other models tested, it blends elements that should stay separate, reducing clear separation between the fabric and background despite controlled lighting.
Flux 1.1 Ultra

it created a visually striking, cinematic image with strong lighting and realistic surfaces. The model’s pose and style match the classic Hollywood look in the prompt, while the fabric’s grid pattern adds a modern, futuristic feel. The dress’s folds and pleats stand out thanks to the lighting.
This shows Flux 1.1 Ultra’s skill in making polished fashion images, especially with fabric shine, poses, and cinematic mood. However, like earlier versions, it has trouble clearly separating the subject from the background when using similar colors or patterns. The image looks balanced and well-lit but doesn’t fully achieve texture contrast and realistic materials.

Conclusion

Each AI image generator created different results from the same prompt and pattern, showing some common trends and unique features. This study helps understand how AI handles texture, lighting, and material details when combining text and images for fashion design.

Similarities

Across all nine models — Firefly 3, Firefly 3 Fast, Firefly 4, Firefly 4 Ultra, Flux 1.1 Ultra, Flux 1.1 Ultra Raw, Gemini 2.5 (Nanobanana), Runway Gen-4, and GPT Image — several consistent traits emerged:

Aesthetic Cohesion Over Separation: Each model favored a unified visual language, merging the dress texture with the background rather than creating substantial depth or material distinction.
Faithful Interpretation of Style: All generators successfully captured the 1950s glamour aesthetic, reflecting the prompt’s reference to Marilyn Monroe–inspired fashion through elegant posing, lighting, and composition.
Consistent Color Harmony: The blue tonal palette was well-preserved throughout, with most models using it to create mood, cohesion, and a cinematic quality.
High Visual Appeal: Despite technical variations, all outputs maintained substantial artistic value — each result could stand alone as a stylized piece of digital fashion imagery.

Differences

The distinctions between models primarily reflect differences in rendering precision, material realism, and stylistic approach:

Firefly Series (3–4 Ultra): These models excelled in pattern clarity and compositional harmony but tended to over-integrate the texture into both dress and background. The result leaned toward painterly or textile-like blending rather than fabric realism.
Flux 1.1 (Ultra and Raw): These versions achieved cinematic lighting and depth, handling folds and sheen beautifully. However, they also showed a slight tendency toward environmental merging, producing editorial-level polish with minimal separation.
Gemini 2.5 (Nanobanana): This generator produced the most balanced composition, with clean lighting and defined form, while still maintaining the same texture continuity that subtly merged figure and background.
Runway Gen-4: Stood out for its dynamic posing and studio-light realism, with well-defined shadows and movement, though it too favored overall cohesion over contrast.
GPT Image: Delivered one of the most visually refined and photorealistic interpretations, emphasizing clarity and fabric flow, yet followed the same merging pattern — prioritizing stylistic unity over textural isolation.

Synthesis

The experiment shows a common problem with current AI image generators: they focus more on blending things nicely rather than clearly showing different materials. Each model is good at lighting, composition, and mood, but few can clearly separate a patterned garment from its background.

This flaw reveals an interesting creative idea. The models see realism as design harmony, mixing the subject and background to create a painterly look like fashion illustrations rather than clear images. While they don’t meet the original technical goal, the results reveal how AI balances realism, unity, and artistic style.

Texture and Style Experiment, AI Generator Comparison

Firefly 4 Ultra

Firefly 4

Firefly Image 3 (fast)

Firefly Image 3

The GPT image generator

Runway Gen-4

Gemini 2.5 (Nanobanana)

Flux 1.1 Ultra (Raw)

Flux 1.1 Ultra

Conclusion

Similarities

Differences

Synthesis

Visual Consistency

Object Continuity

Thank You