1/6 Don't crash on non-UTF-8 ApiError bodies by gareth-ellis · Pull Request #2134 · elastic/rally

gareth-ellis · 2026-05-29T14:10:07Z

Summary

When an Elasticsearch ApiError's body is non-UTF-8 (e.g. binary protobuf returned by OTLP endpoints under coordinator-bytes backpressure), e.body.decode("utf-8") in execute_single raises UnicodeDecodeError and crashes the worker.

Replaces six .decode("utf-8") call sites in the driver's ApiError handler with .decode("utf-8", errors="replace") so undecodable bytes downgrade to U+FFFD instead of crashing. No semantic change for valid UTF-8 (the common case).

Part 1 of 6 in the OTLP ingest series, but a standalone hardening fix that's useful on its own. Has no dependencies on the rest of the series.

Series

1/6 Don't crash on non-UTF-8 ApiError bodies #2134 (this PR) — Don't crash on non-UTF-8 ApiError bodies
2/6 Add OTLP binary protobuf core IO and track preparation #2135 — Add OTLP binary protobuf core IO and track preparation
3/6 Add OtlpParamSource for OTLP corpora #2136 — Add OtlpParamSource for OTLP corpora (depends on 2/6 Add OTLP binary protobuf core IO and track preparation #2135)
4/6 Add OtlpIngest runner with backpressure-aware retries #2137 — Add OtlpIngest runner (depends on 2/6 Add OTLP binary protobuf core IO and track preparation #2135)
5/6 Parallelize OTLP corpus generation; download compressed .pb when available #2138 — Parallelize OTLP corpus generation (depends on 2/6 Add OTLP binary protobuf core IO and track preparation #2135, 3/6 Add OtlpParamSource for OTLP corpora #2136, 4/6 Add OtlpIngest runner with backpressure-aware retries #2137)
6/6 Support gzipped OTLP ingest requests #2139 — Support gzipped OTLP ingest requests (depends on 5/6 Parallelize OTLP corpus generation; download compressed .pb when available #2138)

Test plan

All existing driver tests pass
pre-commit clean

🤖 Generated with Claude Code

The ApiError handler in execute_single() decodes `e.body`, `e.error`, and `e.info` as UTF-8 to build a human-readable error message. When the body is binary (e.g., binary protobuf returned by ES OTLP endpoints on 4xx/5xx), the strict decode raises UnicodeDecodeError, which crashes the worker mid-task. Switch the six decode() calls to use errors="replace" so undecodable bytes become U+FFFD instead of aborting the worker. No semantic change for valid UTF-8 (the common case). This is a latent bug independent of OTLP — any operation that surfaces a binary error body would have hit it.

Copilot

Pull request overview

This PR hardens execute_single in the driver so Elasticsearch ApiError handling does not crash when error bodies contain non-UTF-8 bytes.

Changes:

Updates ApiError body/error/info decoding to use UTF-8 replacement for undecodable bytes.
Adds an inline comment explaining the binary response-body motivation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

        if isinstance(e.body, bytes):
            # could be an empty body
-            if error_body := e.body.decode("utf-8"):
+            if error_body := e.body.decode("utf-8", errors="replace"):


pquentin

It's wild that OTLP ingest does this. A bug IMHO. Still approving to unblock you.

gareth-ellis · 2026-06-01T11:56:48Z

+        body = io.BytesIO(b"\xff")
+        str_literal = str(body)
+        error_meta = elastic_transport.ApiResponseMeta(
+            status=499,


@copilot why 499 and not 400 like the test is called? A server is unlikely to return a 499, since that is typically used as a client timeout

Updated in 835a0e1: changed that test fixture and expectation to use HTTP 400 so it matches the test intent.

Copilot AI review requested due to automatic review settings May 29, 2026 14:10

gareth-ellis requested a review from a team as a code owner May 29, 2026 14:10

gareth-ellis mentioned this pull request May 29, 2026

2/6 Add OTLP binary protobuf core IO and track preparation #2135

Open

2 tasks

Copilot started reviewing on behalf of gareth-ellis May 29, 2026 14:11 View session

Copilot AI reviewed May 29, 2026

View reviewed changes

Comment thread esrally/driver/driver.py

if isinstance(e.body, bytes):

# could be an empty body

if error_body := e.body.decode("utf-8"):

if error_body := e.body.decode("utf-8", errors="replace"):

gareth-ellis changed the title ~~Don't crash on non-UTF-8 ApiError bodies~~ 1/6 Don't crash on non-UTF-8 ApiError bodies May 29, 2026

pquentin approved these changes Jun 1, 2026

View reviewed changes

Copilot started work on behalf of gareth-ellis June 1, 2026 11:41 View session

Add non-UTF8 ApiError body execute_single test

34ac42e

Copilot finished work on behalf of gareth-ellis June 1, 2026 11:48

gareth-ellis commented Jun 1, 2026

View reviewed changes

Copilot started work on behalf of gareth-ellis June 1, 2026 11:56 View session

Align non-UTF8 test with HTTP 400 status

835a0e1

Copilot finished work on behalf of gareth-ellis June 1, 2026 12:00

gareth-ellis enabled auto-merge (squash) June 1, 2026 12:03

gareth-ellis merged commit 61526a4 into master Jun 1, 2026
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1/6 Don't crash on non-UTF-8 ApiError bodies#2134

1/6 Don't crash on non-UTF-8 ApiError bodies#2134
gareth-ellis merged 3 commits into
masterfrom
otlp-pr1-driver-utf8

gareth-ellis commented May 29, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

pquentin left a comment

Uh oh!

gareth-ellis Jun 1, 2026

Uh oh!

Copilot AI Jun 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

gareth-ellis commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Series

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

pquentin left a comment

Choose a reason for hiding this comment

Uh oh!

gareth-ellis Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gareth-ellis commented May 29, 2026 •

edited

Loading