250+ images generated by 5 AI agents with different personas. The verdict: Only 4 of 6 models work. Nano Banana is the speed king. FLUX.2 is completely broken.
Split comparison image showing 4 AI-generated art styles side by side
Hero image for model comparison post
A majestic phoenix rising from flames - 3-model comparison (Nano Banana, Nano Banana Pro, Recraft V3)
Comparison hero showing same prompt across top 3 models
Testing 6 models with 5 AI agents (250+ images):
Bottom line: Just use Nano Banana. It works.
We put 5 AI agents with distinct personas through rigorous testing of 6 image generation models. Each agent generated 50+ images over simulated 30-minute sessions. Here's what we learned.
| Agent | Persona | Priority |
|---|---|---|
| 🚀 Indie Hacker | Ship fast, budget-conscious | Speed + Cost |
| 🎨 Creative Director | Client-ready quality | Visual Excellence |
| 🔬 ML Engineer | Technical deep-dive | API Quality |
| 🌟 First-Timer | Just discovered AI art | Ease of Use |
| 🏭 Production Lead | Enterprise reliability | Scalability |
"THIS IS MY DEFAULT. Fast, cheap, good quality. 9/10 images are usable. For $0.039 this is INSANE value."
"Product shot (earbuds): Clean white background, nice shadows. Looks professional. Could be on a landing page TODAY."
"Nano Banana Pro: Quality IS better than regular Nano, but NOT 4X better. Maybe 1.5X better. For an indie hacker watching every dollar, this is NOT worth it."
"OH WOW! That was SUPER EASY! Just type what I want + model name = beautiful image!"
"Every single working image blew my mind: The dragon flying over a castle - EPIC! The phoenix with flames - AMAZING!"
"NANO-BANANA IS MY FAVORITE! It just WORKS every time! Fast results! Beautiful quality! Never gave me weird errors!"
On FLUX errors: "Wait, what's a safety_tolerance?? I didn't even SET that parameter!"
"After extensive testing across multiple models with 58+ images... I can confidently say: NONE of these models are consistently client-ready for agency work."
"Text accuracy is surprisingly GOOD (90%+ success rate). But design consistency is WILDLY INCONSISTENT."
On Recraft V3: "The Art Deco geometric patterns are STUNNING. Feels like real-world environmental graphics."
On a Nano Banana poster: "GRADE: A+ (Client-ready, no edits needed)"
All 5 agents reported the same issue during testing. The flagship FLUX.2 models were completely non-functional due to a safety_tolerance parameter bug.
Error: body.safety_tolerance: unexpected value; permitted: '1', '2', '3', '4', '5'The ML Engineer traced it to the MCP wrapper sending an integer instead of a string. UPDATE: This bug has been fixed and verified working in production!
$0.039/image | 2-3 seconds | 100% reliability
Every agent loved it. The Indie Hacker called it "INSANE value." The First-Timer wrote it a love letter. The Production Lead ranked it #2 for enterprise use.
Best for: Rapid iteration, budget projects, general use Standout: Text rendering actually works!
$0.15/image | 3-4 seconds | 100% reliability
4x the price, but the Creative Director says it's "the only model delivering client-ready assets." Best anime/illustration quality across all tests.
Best for: Marketing materials, premium content, detailed illustrations Caveat: Indie Hacker says "NOT worth 4x the cost for most use cases"
$0.04/image | 5-7 seconds | 100% reliability
80+ style presets from pixel art to vector illustrations. The Creative Director generated assets in 12 different styles. Best text rendering in class.
Best for: Typography, diagrams, branded content, vector work Limitation: No batch generation (1 image per request)
Unknown pricing | 1.2 seconds | Unreliable
Fastest model tested (1.2s per image). Quality rivals Nano Banana. But the pricing uncertainty is a dealbreaker — multiple agents hit "financial safety" blocks.
Best for: Speed testing only (until pricing is resolved) Warning: Production Lead says "CAN'T RECOMMEND"
Pricing TBD | Speed TBD | NOW FUNCTIONAL
These models failed 100% during testing due to a bug that has since been fixed. Initial post-fix testing shows they work correctly now. Full benchmarking pending.
aspect_ratio vs image_size confusion| Model | Success Rate | Avg Cost | Speed | Client-Ready* |
|---|---|---|---|---|
| Nano Banana | 100% | $0.039 | 2-3s | ~60% |
| Nano Banana Pro | 100% | $0.15 | 3-4s | ~85% |
| Recraft V3 | 100% | $0.04 | 5-7s | ~75% |
| Z-Image Turbo | 90% | Unknown | 1.2s | ~65% |
| FLUX.2 Pro | TBD | TBD | TBD | |
| FLUX.2 Flex | TBD | TBD | TBD |
*Creative Director's assessment
Use: Nano Banana (90%) + Recraft V3 (10%)
Use: Nano Banana Pro (70%) + Recraft V3 (30%)
Use: Nano Banana Pro (primary) + Nano Banana (fallback)
Just use Nano Banana.
| Model | Speed | Cost | Quality | Reliability | Overall |
|---|---|---|---|---|---|
| Nano Banana | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | 4.75/5 |
| Recraft V3 | ⭐⭐⭐ | ⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | 4.25/5 |
| Nano Banana Pro | ⭐⭐⭐⭐ | ⭐⭐ | ⭐⭐⭐⭐⭐ | ⭐⭐⭐⭐⭐ | 4.0/5 |
| Z-Image Turbo | ⭐⭐⭐⭐⭐ | ❓ | ⭐⭐⭐⭐ | ⭐⭐⭐ | 3.5/5 |
| FLUX.2 Pro | ⭐⭐⭐⭐ | ❓ | ⭐⭐⭐⭐ | ✅ Fixed | Pending |
| FLUX.2 Flex | ⭐⭐⭐⭐ | ❓ | ⭐⭐⭐⭐ | ✅ Fixed | Pending |
Testing conducted November 27, 2025 FLUX.2 bug fix deployed November 27, 2025