Most solo operators experimenting with AI image generation start with a web interface—DALL·E’s playground, Midjourney’s Discord bot, or Stability AI’s DreamStudio. The pricing feels simple: buy credits, burn through them, top up when you run out.
Then you scale. You want to automate thumbnail generation for blog posts, create social assets in bulk, or build a tool that generates images on demand. Suddenly you’re looking at API pricing, and the math gets complicated fast.
Here’s what actually happens when you move from casual generation to programmatic use, with real numbers from three major platforms.
DALL·E 3: pay per resolution and quality tier
OpenAI’s DALL·E 3 API charges based on image resolution and quality setting. As of May 2026, standard quality at 1024×1024 costs $0.040 per image. HD quality at the same resolution jumps to $0.080. If you drop to 1024×1792 (portrait or landscape), HD pricing climbs to $0.120 per image.
That means 1,000 standard blog thumbnails cost $40. If you want HD quality for each, that’s $80. For a daily newsletter with five images per issue, you’re looking at $1.20 per send in HD, or $730 per month if you publish Monday through Friday.
DALL·E 3 doesn’t offer volume discounts. You pay the same rate whether you generate ten images or ten thousand. The API is fast—typically under ten seconds per generation—but there’s no batch pricing, no prepaid tiers, and no way to lock in a lower rate.
Midjourney: seat-based pricing, not per-image
Midjourney doesn’t sell API access the way OpenAI does. Instead, you subscribe to a plan that gives you a monthly GPU time allowance. The Basic plan costs $10/month for roughly 200 images (about 3.3 hours of GPU time). The Standard plan is $30/month for around 900 images (15 hours). Pro runs $60/month for 1,800 images (30 hours), with an option to buy additional GPU hours at $4 per hour.
If you’re automating image generation, Midjourney’s Discord-first architecture creates friction. There’s no official REST API yet. Third-party wrappers exist, but they scrape the Discord bot and risk rate limits or account suspension. For reliable programmatic use, Midjourney isn’t viable—even though the per-image cost on a Standard plan works out to about $0.033, cheaper than DALL·E 3.
Stable Diffusion: self-hosting vs. hosted APIs
Stable Diffusion is open-source, which changes the cost structure entirely. You can run it locally or on your own cloud instance, paying only for compute. A mid-tier GPU instance on AWS (g5.xlarge with an NVIDIA A10G) costs around $1.006 per hour on-demand. If you generate 100 images per hour, that’s roughly $0.01 per image—75% cheaper than DALL·E 3 standard quality.
But self-hosting requires setup: installing dependencies, managing model weights, handling queues, and monitoring uptime. For solo operators generating fewer than 500 images a month, the overhead usually isn’t worth it.
Hosted Stable Diffusion APIs solve this. Stability AI’s own API charges $0.01 per image for SDXL (1024×1024). Replicate offers SDXL at $0.0055 per image, billed per compute second. Both are significantly cheaper than DALL·E 3, but image quality and prompt adherence vary more widely. You’ll burn extra generations refining prompts.
Hidden costs: retries, storage, and moderation
Every AI image API occasionally returns unusable output—cropped faces, garbled text, or results that ignore your prompt entirely. DALL·E 3 is the most reliable, but you’ll still retry 5–10% of generations. Stable Diffusion can require three or four attempts to get a usable image, especially with complex prompts.
Factor retries into your budget. If your effective cost per usable image is 1.2× the API’s listed price, a $0.01 Stable Diffusion call becomes $0.012. A $0.04 DALL·E call becomes $0.048.
Storage adds up too. A single 1024×1024 PNG averages 1.5–2 MB. Generate 10,000 images and you’re storing 20 GB. At $0.023/GB/month on AWS S3, that’s $0.46/month—not huge, but it scales linearly. If you’re generating images for a public-facing tool, you’ll also need a CDN. Cloudflare’s free tier works for light use; beyond that, budget $0.01–0.02 per GB transferred.
Content moderation is another variable cost. DALL·E 3 includes built-in filtering, but Stable Diffusion doesn’t. If you’re accepting user prompts, you’ll need a moderation layer—either OpenAI’s moderation endpoint ($0.0001 per request) or a third-party service like Sightengine, which starts at $39/month for 5,000 images.
When self-hosting makes sense
Self-hosting Stable Diffusion pays off when you’re generating more than 2,000 images per month and can batch them efficiently. Spin up a GPU instance, queue 500 generations, process them in parallel, then shut the instance down. You’ll pay for an hour or two of compute instead of thousands of individual API calls.
For sporadic use—ten images one day, none for a week—stick with a hosted API. The convenience premium is worth it.
If you’re choosing between DALL·E 3 and Stable Diffusion APIs, run a quality test first. Generate twenty images with identical prompts on both platforms. If DALL·E 3 nails the prompt 90% of the time and Stable Diffusion needs three tries per usable image, DALL·E’s 4× higher price might still be cheaper per good output.
Want more breakdowns like this? Subscribe to One Two Three Send for weekly operator-focused analysis of tools, pricing, and infrastructure decisions.
