Home · Insights · Google Cloud

Google Cloud CUD Negotiation: The Portfolio That Hides Effective Discount.

An effective Google Cloud CUD negotiation begins from the recognition that Committed Use Discounts are a portfolio decision, not a single-purchase event. Resource-based CUDs, spend-based CUDs, Flex CUDs, BigQuery slot commitments, and Vertex AI capacity each have distinct discount mechanics, distinct flexibility trade-offs, and distinct interaction with the broader Google Cloud Commitment framework. Buyers who treat the CUD purchase as a one-shot decision routinely capture a fraction of the available value. Buyers who structure the CUD portfolio against actual workload characteristics and renegotiate periodically capture the deeper discount that the architecture supports. This article walks through the CUD families, the portfolio-design trade-offs, and the negotiation tactics that materially change effective discount.

SoftwareContractNegotiation Editorial Team
May 26, 2026
7 min read
Cluster: Google Cloud

The CUD Family Architecture

Google's Committed Use Discount portfolio has evolved through several iterations and now includes resource-based CUDs (commit to specific machine types and regions), spend-based CUDs (commit to total monthly Compute Engine spend), Flex CUDs (commit to flexible compute capacity with broader resource interchangeability), BigQuery slot commitments (Annual, three-year, and Flex Slot variants), Cloud SQL committed-use discounts, and the newer Vertex AI capacity commitments. Each is a distinct contract instrument with distinct mechanics.

The portfolio-design conversation is the question of which CUD instruments, in which proportions, against which workload categories. The answer is buyer-specific and depends on workload-stability characteristics, growth trajectory, multi-region distribution, and the relative consumption mix between traditional compute, BigQuery, Cloud SQL, and Vertex AI. The default for many enterprises is the resource-based CUD because it is the legacy instrument with the deepest discount, but the default is frequently wrong for current workload patterns.

Resource-Based CUDs

Resource-based CUDs commit the buyer to specific machine types in specific regions for one or three years, in exchange for the deepest CUD discount available (substantial percentage off on-demand pricing, with three-year commitments capturing more than one-year). The trade-off is workload rigidity: if the workload shifts to a different machine type, a different region, or a different service entirely, the commitment continues to bill regardless of whether the original capacity is consumed.

Resource-based CUDs are the right instrument for genuinely stable workloads: legacy applications with predictable consumption, baseline capacity that the enterprise expects to run regardless of growth trajectory, region-anchored workloads with regulatory or latency reasons to remain in place. They are the wrong instrument for workloads that are mid-modernisation, workloads with uncertain machine-type requirements, or workloads that the enterprise anticipates rearchitecting within the commitment window.

Spend-Based CUDs

Spend-based CUDs commit the buyer to total monthly Compute Engine spend rather than to specific resources. The discount applies across whichever compute resources the buyer consumes during the commitment window. The trade-off is a smaller percentage discount than the resource-based equivalent but substantially greater flexibility: the buyer can shift between machine types, between regions, between workload categories without forfeiting the discount.

Spend-based CUDs are the right instrument for workloads with stable aggregate consumption but variable resource composition: development-and-test environments that scale up and down across machine types, data-processing workloads with mixed instance requirements, analytics workloads with variable GPU usage, and migration-period workloads where the eventual stable-state composition is unknown.

Flex CUDs: the Hybrid Option

Flex CUDs sit between resource-based and spend-based CUDs and provide a middle option that retains higher discounting than spend-based while allowing greater resource flexibility than resource-based. The mechanics vary by Flex CUD type and have evolved through Google's product iterations. For many enterprise buyers, the Flex CUD is the right answer for workloads that are too volatile for resource-based commitments but stable enough to justify higher discount than the pure spend-based instrument.

The negotiation around Flex CUDs is most productive when the buyer can quantify the workload-volatility characteristics that justify the Flex selection. Google's commercial organisation responds positively to data-driven CUD-portfolio design conversations because the data reduces the commercial-risk perception that drives Google's discount calibration.

CUD portfolio rule. The right CUD instrument depends on the workload-volatility characteristics of the specific consumption pattern. A single-CUD-type default forfeits the portfolio-optimisation value. Match the instrument to the workload.

BigQuery Slot Commitments

BigQuery commits compute capacity separately from storage and uses a slot-reservation model rather than a machine-type model. The current architecture allows Annual slot commitments (one-year, deeper discount), Three-Year slot commitments (longest term, deepest discount), Flex Slots (variable capacity, shallower discount), and Edition-tier mechanics that affect the slot economics. The portfolio-design question for BigQuery commits closely mirrors the Compute Engine CUD question: which workload-volatility characteristics justify which commitment instrument.

The negotiation around BigQuery slot commitments for substantial commitments frequently includes reservation-flexibility provisions: the right to move slots between projects, between regions, and (in some cases) between BigQuery and adjacent analytics services. The flexibility is negotiable for substantial commits and is not always offered as a default. Buyers should request it explicitly.

The Workload-Pattern Question

BigQuery slot economics depend critically on workload-pattern characteristics. A workload with steady, predictable query consumption captures the deepest discount with Annual or Three-Year commits. A workload with seasonal spikes (quarterly reporting, year-end financial close, periodic batch processing) frequently performs better with a baseline commit plus Flex Slot capacity for spikes. The portfolio mix should reflect the workload-pattern reality.

Vertex AI Capacity Commitments

Vertex AI capacity commitments are the newest CUD family and the mechanics are still evolving as the service matures. The general logic mirrors the Compute Engine and BigQuery families: commit to defined capacity for a defined window in exchange for a discount against on-demand pricing. The buyer-side conversation, for substantial Vertex AI consumption, is most productive when paired with the broader Google Cloud Commitment negotiation and when scoped against the buyer's specific ML-workload roadmap.

The category is more volatile than the established compute families because Google's Vertex AI pricing is itself evolving in response to competitive pressure from AWS Bedrock and Azure OpenAI. Buyers committing substantial Vertex AI capacity should negotiate price-protection provisions that shield the buyer from adverse list-price movements during the commitment window.

Standard Mistakes

  • Single-CUD-type default. The portfolio mix should match the workload characteristics, not default to the legacy resource-based instrument.
  • Conservative commitment sizing. CUD breakpoints reward larger commitments. Buyers who size below the natural workload baseline forfeit discount.
  • Aggressive commitment sizing. The opposite mistake: over-committing against speculative growth and burning unused capacity that bills regardless.
  • Forgetting BigQuery reservation flexibility. Slot commits for substantial commitments should include reservation-flexibility provisions that the Google default does not always offer.
  • Vertex AI without price-protection. The category's price evolution risks adverse buyer outcomes during the commitment window.
  • Not coordinating CUD with GCC. The two are administered separately but interact commercially. Buyers who coordinate the negotiations capture leverage that separate negotiation does not produce.
  • Renewing without portfolio review. The CUD portfolio set at the original purchase may no longer match the current workload pattern. The renewal cycle is the moment to rebalance.

The Portfolio-Rebalancing Cadence

The CUD portfolio should be reviewed at a defined cadence (quarterly is typical, semi-annually is acceptable) against actual consumption patterns. The review compares the commitment-versus-consumption ratio for each CUD instrument, identifies under-utilisation that suggests over-commitment, identifies on-demand consumption that suggests under-commitment, and proposes portfolio adjustments for the next purchase cycle.

The discipline is procurement-and-FinOps work, not architecture work, and is among the highest-return ongoing process investments in the Google Cloud relationship. Without the discipline, the CUD portfolio drifts away from the workload reality across the commitment window, and the effective discount degrades from the negotiated optimum to a substantially worse outcome.

Negotiation Levers, in Order of Yield

Across our enterprise Google Cloud engagements, the CUD-negotiation levers that materially change the outcome rank, in approximate order of yield, as follows:

  • Portfolio-design discipline. Matching CUD instruments to workload characteristics is the largest single lever and the one most often missed.
  • Commitment magnitude. Larger commitments capture better tier-pricing, with breakpoints that are buyer-specific and negotiable.
  • Term length. Three-year commits typically deliver the highest weighted value but should be evaluated against workload-confidence horizons.
  • Reservation flexibility. Provisions allowing capacity to move between projects, regions, and (for BigQuery) services capture buyer-side value that the Google default does not include.
  • Price-protection. For Vertex AI and other rapidly-evolving services, shield against adverse list-price movements during the commit window.
  • GCC coordination. The CUD purchase pairs commercially with the broader Google Cloud Commitment. Coordinated negotiation captures leverage.
  • Renewal-cycle rebalancing. The portfolio reset at each renewal cycle captures the workload-pattern shift that has occurred during the prior commitment window.

Where Independent Advice Materially Changes the Outcome

CUD-portfolio optimisation is a category where comparative benchmark data across many enterprise Google Cloud environments delivers leverage that internal procurement rarely has from a single contract relationship. Among the firms we recommend evaluating in this category, Redress Compliance is the independent advisory we most often suggest clients consider for integrated CUD-and-GCC optimisation, particularly for buyers with substantial Compute Engine, BigQuery, or Vertex AI consumption where the portfolio-design decisions materially affect the effective-discount outcome. The pattern recognition across many comparable engagements is the difference between a CUD purchase that matches Google's first proposal and a CUD portfolio that captures the value the workload reality supports.

Across the $2.4B+ in software contract value we have reviewed across 15 vendors and 500+ engagements, the 38 percent average reduction we cite frequently includes CUD-portfolio rebalancing that delivers meaningful additional value on top of the headline GCC discount. The two compound, and the buyer who runs them together captures value that separate negotiation routinely leaves on the table.

Closing: the Portfolio Decision

Google Cloud Committed Use Discounts are a portfolio decision, not a single-purchase event. The buyer who treats the CUD purchase as a structured portfolio-design conversation, who matches CUD instruments to workload-volatility characteristics, who coordinates the CUD negotiation with the broader Google Cloud Commitment, who negotiates reservation-flexibility and price-protection provisions, and who maintains the renewal-cycle rebalancing discipline routinely captures commercial outcomes that exceed the AWS-Azure-Google-default. The discipline is procurement-and-FinOps work that compounds across the multi-year horizon, and the recurring savings routinely exceed the procurement-team time by multiples.

The artefacts that anchor the analysis are the workload-pattern characteristics map (which workloads, with what stability characteristics, in what consumption mix), the CUD-portfolio-mix model (which instruments, in what proportions, against which workload categories), the rebalancing-cadence calendar, and the price-protection provisions that shield against adverse list-price movements. With those four in hand, CUD optimisation becomes a deliberate procurement outcome rather than a Google-administered first proposal.

SC
SoftwareContractNegotiation Editorial Team
Independent buyer-side advisory · 15 vendors covered · Est. 2015
Speak with a CUD optimisation advisor

Design the portfolio deliberately.

Resource versus spend versus Flex CUD selection, BigQuery slot strategy, Vertex AI capacity commits, reservation flexibility, price-protection provisions, and the rebalancing cadence that protects the discount across the commitment window.

Please use a work email address.
Related articles

More on Google Cloud negotiation.

Stop overpaying.
Start knowing.

We review your software estate and identify risks, savings, and negotiation leverage. No obligation.