Cloud cost optimization vs. leaving: where each actually wins

TL;DR. There’s a large, fast-growing industry — “cloud cost optimization,” or FinOps (the discipline of managing cloud spend) — whose whole job is to cut your AWS, Azure or Google bill without moving you off it. It works: tools like ProsperOps, Vantage, CloudHealth, Spot, Cast AI and consultancies like the Duckbill Group routinely shave 20–40% off a bill through commitment discounts and waste cleanup. But there’s a structural ceiling: every one of these levers discounts the meter — it doesn’t remove it. When your bill is dominated by egress and steady, always-on compute, no amount of optimization changes the underlying math, and that’s when leaving wins. The honest answer for most teams: optimize first (a lot of it is free money), then leave the workloads where optimization runs out of road.

Two opposite ways to cut a cloud bill

When your AWS bill gets scary, you’ll be pitched two opposite solutions:

  1. Optimize and stay — keep everything on AWS, but pay less for it. This is the FinOps / cost-optimization industry.
  2. Leave (“cloud repatriation”) — move the expensive parts onto flat-rate or owned infrastructure.

They’re usually framed as enemies. They’re not — they’re steps in order. This page is the honest map of what the first one can and can’t do, so you know when it’s enough and when it isn’t.

What “optimize and stay” actually is (a real, big industry)

This isn’t a cottage trade. The cloud cost optimization / FinOps market was roughly $13–15 billion in 2024–25 and is growing ~11–13% a year (Grand View Research; MarketsandMarkets). It exists because cloud waste is enormous — the FinOps Foundation’s State of FinOps 2026 survey (1,192 organizations, $83 billion in combined cloud spend) still ranks cutting waste as the #1 priority.

It pulls two distinct levers:

Who does what:

TypeWhat they sellExamples
Visibility dashboardsShow where the money goesVantage, CloudHealth, Cloudability, CloudZero
Savings / commitment managersAuto-buy and manage your discountsProsperOps, Zesty, Spot, nOps
Managed service providers (MSPs)Run your cloud and optimize it for youRackspace, Bespin Global, NTT Data
Boutique consultanciesExpert audits and retainersThe Duckbill Group

They’re good at this. If you’ve never bought a Reserved Instance, they’ll find you real money — only about half of companies use AWS Reserved Instances at all, and fewer use them well (Flexera; NetApp). Do this first. It’s often the cheapest 20–30% you’ll ever save.

How they charge — and the incentive to watch

The pricing model tells you whose side a vendor is on. Four common ones:

Pricing modelExampleWhat it does to incentives
A cut of the savings (gainshare)ProsperOps takes ~20–35% of the savings it produces✅ Aligned — they only earn when you save. But it’s a permanent tax on the savings, capped at what discounts can do.
Flat monthly software feeMost dashboards➖ Neutral — you pay the same whether they save you a lot or a little.
A percentage of your cloud spendSome MSPs⚠️ Backwards — the vendor earns more when your bill is bigger. Read these contracts closely.
Fixed fee / retainerThe Duckbill Group (and my own assessment)✅ Predictable, no conflict — you know the cost up front.

The one to watch is percentage-of-spend: a provider paid a slice of your bill has a quiet reason not to shrink it too far. Gainshare is honest, but remember it’s a recurring cut of the savings — over a few years, paying someone 30% of your discount forever can cost more than fixing the problem once.

The ceiling: optimization discounts the meter — it can’t remove it

Here’s the part the optimization industry doesn’t put on its homepage — in its own governing body’s words. The FinOps Foundation describes commitment discounts (Reserved Instances, Savings Plans) as “coupons”: they lower the rate applied to your metered usage — they do not change the fact that you’re metered, and they don’t reserve or own anything.

So the whole optimize-and-stay model, by design, cannot:

And the easy wins are running out. That same State of FinOps 2026 survey reports practitioners have “hit the big rocks of waste” — diminishing returns, with the savings that remain smaller and more expensive to chase. Optimization is a one-time step down, not a gentler slope. Your bill still grows with your success; it just grows from a slightly lower starting point.

When optimizing and staying is genuinely the right call

This is not “AWS bad, leave always.” For plenty of workloads, staying and optimizing is correct — and a good advisor will tell you so:

If optimization gets your bill where it needs to be, you’re done — don’t move anything. That’s the honest answer, and it’s the one I give in an assessment whenever it’s true.

When leaving wins

Leaving beats optimizing in one specific case: a steady, always-on workload whose bill is dominated by egress and 24/7 compute. Steady workloads get no benefit from the cloud’s pay-as-you-go premium — you’re paying a rental markup for elasticity you don’t use. Optimization discounts that markup; it can’t delete it.

The clearest public example is 37signals (Basecamp / HEY), who left AWS and cut infrastructure spend from about $3.2 million to under $1 million a year — including replacing a ~$1.5M/year S3 bill with 18 petabytes of their own storage hardware, and saving ~$2M/year on compute after buying ~$700k of servers. (Honest caveat: these are the founder’s self-reported figures, comparing cloud bills to their on-prem operating cost; a full accounting must also count hardware refresh, added ops staff, and power/cooling. Treat it as a disclosed real-world case, not an audited universal number.)

The pattern: when the meter — not the waste — is the problem, the only fix is to stop being metered.

The honest decision

Your situationThe right move
Never bought Savings Plans; idle resources everywhere; no CDNOptimize first — free money on the table
Spiky / seasonal load, or a bill that’s mostly managed servicesOptimize and stay — the cloud’s model fits you
Already optimized, but the bill is steady and dominated by egress + 24/7 computeLeave that part — optimization has run out of road
Not sure which of these is actually youGet the numbers — that’s what a teardown is for

Optimize-and-stay and leaving aren’t rivals; they’re a sequence. Cut the waste, buy the discounts, put a cache up front — and then, if a steady, traffic-heavy core is still bleeding at metered rates, move that core onto flat-rate or owned infrastructure. Most teams need both, in that order.

Quick wins you can do this week

  1. Check your commitment coverage. If you’re not using Reserved Instances / Savings Plans on steady compute, that’s the fastest discount there is. (Or let a gainshare tool do it — just know it takes a permanent cut.)
  2. Find your idle and oversized resources. Turn off what’s unused; rightsize what’s overbuilt.
  3. Put a cache in front of your traffic to cut egress before you do anything structural.
  4. Then break out the two lines optimization can’t fix — egress and steady 24/7 compute — and price them on flat-rate infrastructure. If those two are most of your bill, optimization won’t be enough.

Not sure whether to optimize and stay or move the bleeding parts off — send me your cloud bill and I’ll show you which lines a FinOps tool can fix and which ones only leaving will, free, in 24 hours. Read by me, never shared. Or see the Cloud-Exit Assessment for the full decision with real numbers.

Sources

Don't miss new posts

I publish honest, sourced breakdowns of cloud-exit economics — egress, storage, monitoring, reliability — and the occasional announcement. Leave your email and I'll let you know when something new goes up.

Double opt-in — you'll get one email to confirm. No spam, unsubscribe anytime. Read by me, never shared.