The Best Box Score
Blog/·8 min read

The ABS Challenge System: What It Means for Umpire Accountability

MLB's Automated Ball-Strike System went live in 2026. Here's how it works, why our methodology already aligns with it, and what five years of pre-ABS data reveals about which umpires need it most.

absumpiresmethodology

MLB's Automated Ball-Strike System went live on Opening Day 2026. After four years of minor league testing, every ball and strike call in Major League Baseball can now be challenged by the batter, pitcher, or catcher with a tap of the cap.

This changes everything about umpire accountability. And it changes nothing about how we grade them.

How the ABS Challenge System Works

The rules are straightforward:

  • Who can challenge: The batter, pitcher, or catcher — nobody else. No manager involvement, no dugout assistance.
  • How to challenge: Tap your helmet or cap immediately after the call.
  • Challenge budget: Each team starts with 2 challenges per game. You only lose a challenge when the original call is upheld. Successful challenges are free.
  • Extra innings: Each team resets to 1 challenge regardless of how many they had left.
  • Technology: 12 Hawk-Eye cameras per ballpark, resolving challenges in seconds.

The ABS strike zone uses fixed proportions of each batter's measured height: the top at 53.5% and the bottom at 27%, without cleats. The plate is 17 inches wide, same as always. The system's margin of error is approximately 1/6 of an inch.

On Opening Day, there were 23 challenges across the league. Thirteen were overturned — a 56.5% success rate. Batters, pitchers, and catchers are already learning when it's worth spending one.

What This Means for Our Umpire Analysis

Here's what might surprise you: nothing about our methodology changes.

We've always graded umpires against the rulebook strike zone — the rectangle defined by the plate width and the batter's height. That's the same zone ABS uses. We've always used Hawk-Eye data from Baseball Savant for pitch location. That's the same tracking system ABS uses.

The probabilistic zone models that other grading systems use — where a pitch near the corner gets partial credit because "most umpires call it that way" — those were always measuring something different from what ABS now enforces. We were already measuring what ABS measures: did the call match the rulebook?

So every scorecard we've published since 2021 is directly comparable to ABS outcomes. Our missed call counts represent, roughly, the pool of pitches that could have been successfully challenged if the system had existed.

Five Years of Pre-ABS Baseline Data

This is where it gets interesting. We have umpire accuracy data for 12,449 games across 2021-2025 — the entire pre-ABS era with Statcast tracking. That dataset is now the historical baseline for measuring whether ABS actually improves calling accuracy.

The questions we can answer that nobody else can:

Do umpires get better knowing challenges exist? We'll compare 2026 accuracy distributions against the 2021-2025 baseline. If umpires tighten up their zones to avoid being overturned, we'll see it in the numbers within weeks.

Which umpires needed ABS the most? We already know. Our umpire leaderboard ranks every umpire by accuracy across five seasons. The ones at the bottom of that list — the ones who consistently called the widest or tightest zones — are the umpires whose games will be most affected by the challenge system.

Does the challenge system fix the handedness gap? We've documented that left-handed batters face a systematically different called zone than right-handed batters. ABS doesn't care about handedness. If challenges correct this bias, we'll see the LHB/RHB accuracy gap narrow in 2026.

Do challenges cluster in high-leverage moments? Our WPA and leverage index data lets us measure whether players challenge strategically — burning challenges in the 9th inning of a tie game rather than the 3rd inning of a blowout. Smart challenge usage should correlate with leverage.

The Challenge Strategy Question

Two challenges per game isn't many. With roughly 60-70 called pitches per game and a historical miss rate around 6-7%, there are typically 4-5 missed calls available to challenge. You can't challenge them all.

This creates a game-theory problem: when is a challenge worth the risk?

Our leverage index data suggests the answer. A missed call at LI 3.0+ (VERY HIGH leverage) swings win probability 3x more than average. A missed call at LI 0.5 (LOW leverage) barely matters. If you have data on the umpire's tendencies — which calls they tend to miss, which parts of the zone they expand or compress — you can make better challenge decisions.

That's exactly what our umpire profiles provide. Before the game starts, you can look up tonight's home plate umpire and see their historical accuracy, their handedness splits, and their zone tendency. A pitcher who knows the umpire runs a wide zone outside to lefties might save a challenge for that specific situation.

What We're Tracking Going Forward

Starting with the 2026 season, we're adding ABS context to our analysis:

  • Challenge rate per umpire — which umpires get challenged most often?
  • Overturn rate per umpire — and how does it correlate with our pre-ABS accuracy grades?
  • Challenge leverage — are challenges being used in high-leverage spots or wasted early?
  • Pre- vs. post-ABS accuracy comparison — the definitive answer to whether the system changes umpire behavior

Baseball Savant's ABS Challenge Dashboard is already publishing challenge data in real time. We'll integrate it with our existing umpire profiles as the season progresses.

The Bigger Picture

The ABS challenge system doesn't replace umpires. It gives players a tool to correct the most consequential mistakes. That's a meaningful distinction — and it's one that makes umpire analysis more important, not less.

Every challenge that gets overturned is a data point about where human judgment fails. Every challenge that gets upheld is a data point about where it succeeds. Our five years of pre-ABS data provide the context for understanding both.

The human element isn't going away. It's finally being measured.


Explore our umpire leaderboard to see five seasons of accuracy data — the pre-ABS baseline — or read our methodology to understand exactly how we grade every call.