BigQuery Adaptive Repartitioning

How BigQuery Adaptive Repartitioning Works

AdminFollow

5 min•Feb 28, 2026

Views - 13

BigQuery Adaptive Repartitioning

? The Core Problem

During shuffle:

Hash(key) → Partition → Slot

If key distribution is uneven:

Some partitions are huge
Some are tiny
Slowest partition determines stage runtime

This causes:

Memory pressure
Spill
Stragglers
Stage reattempts

? Adaptive Repartitioning (Conceptually)

BigQuery does dynamic repartitioning during execution when:

A partition grows too large
A worker becomes a straggler
Memory pressure crosses threshold

What Happens Internally

Runtime detects skew
Heavy partition is split into sub-partitions
Work is redistributed across additional slots
Slow stage rebalances

This is sometimes called dynamic fan-out.

? When It Triggers

Large GROUP BY cardinality
Hot join keys
Window functions on skewed keys
Large DISTINCT

? Tradeoffs

Adaptive repartitioning:

✅ Reduces worst-case skew
❌ Increases shuffle traffic
❌ Consumes more slots
❌ Increases slot-ms cost

So even when “fixed,” skew is still expensive.

? Important Insight

At PB scale:

Preventing skew is 10x cheaper than letting adaptive repartitioning fix it.

Because repartitioning multiplies network IO.

Comments (0)

No comments yet.

Learningdhara Community LLP provide expert teaching, guidance and consulting services. Over 20 years of experience we ensure you always getting the good guidance from the top people in the entire of IT industry.

Course

Service

Get In Touch

India Presence: Hadapsar, Pune, Maharashtra, 411028
Contact: +91-7541-942-682
Canada Presence: 47, Robert Parkinson Drive, Brampton ( Ontario ), L7A0Y2
US Presence: 1800 Silas Deane Hwy, Rocky Hill, CT 06067
support@learningdhara.com

© Copyright 2024. All Rights Reserved by Learningdhara Community LLP

Terms & Conditions FAQ Disclaimer Support