db: Smooth out IO from flushing L0 and compaction by andrewbaptist · Pull Request #2004 · cockroachdb/pebble

andrewbaptist · 2022-10-12T14:44:12Z

This PR adds a smoother which monitors the average time for flushing and compaction and paces future flush / compaction loops to attempt to have a consistent IO rate at all times rather than being spikey.

Spikey IO can result in saturating the underlying device which then slows down writes to the WAL. By having a consistent rate of flushing and compaction the P99 latency is greatly reduced.

cockroach-teamcity · 2022-10-12T14:44:21Z

This change is

sumeerbhola

Reviewed 1 of 5 files at r1, 4 of 5 files at r2, all commit messages.
Reviewable status: 5 of 6 files reviewed, 2 unresolved discussions (waiting on @andrewbaptist)

smoother.go line 55 at r2 (raw file):

				// Every 100 iterations, update the estimated utilization under lock
				if totalSamples == numSamples {
					utilRunning := float64(sampleRunning) / float64(totalSamples)

s.countRunning and s.sleepingCount can each be > 1, so I don't quite understand the logic behind dividing by totalSamples, which is only being incremented by 1 for each tick.

If both sampleRunning and sampleSleeping are 0, we would still tick and compute util=0 and slowly shrink s.mu.estimatedUtilization to 0.1, yes? What will cause it to increase above 0.1? Seems to me that the sleeps will keep it at 0.1.

It seems to me that this smoother is not aware of the work backlog or the resource availability. Am I missing something?

smoother.go line 58 at r2 (raw file):

					utilSleeping := float64(sampleSleeping) / float64(totalSamples)
					// Add all the running time and half the sleeping time.
					util := utilRunning + utilSleeping/2

why adding utilSleeping/2?

andrewbaptist · 2022-10-14T16:23:58Z

Data on the performance impact of this change on KV50

Unthrottled P99 (P90 of P99s over the window)
10ms -> 6ms

Throttled P99
130ms -> 84ms

During index creation P99
352ms -> 204ms

Throughput, CPU util, LSM health, and most other metrics are very similar.

prometheus-patched.tar.gz
prometheus-orig.tar.gz

patched_cockroach_workload_run_kv.log
orig_cockroach_workload_run_kv.log

Image showing the difference in P99

https://docs.google.com/spreadsheets/d/1GmUOc69d9r-4GpS_n176Pttw0JhCzgOpAcdfs1ks_WI/edit#gid=1299047713

smoother: integrate into code

andrewbaptist force-pushed the 20221012.io-smoother branch 15 times, most recently from c7480bb to 0c4fa6e Compare October 13, 2022 15:27

sumeerbhola requested changes Oct 13, 2022

View reviewed changes

andrewbaptist force-pushed the 20221012.io-smoother branch 5 times, most recently from 2a3cc9f to dedcb58 Compare October 14, 2022 14:55

andrewbaptist force-pushed the 20221012.io-smoother branch 6 times, most recently from 3852956 to c2f7610 Compare October 14, 2022 19:23

andrewbaptist force-pushed the 20221012.io-smoother branch from c2f7610 to f9d3c11 Compare April 13, 2023 15:10

andrewbaptist force-pushed the 20221012.io-smoother branch from f9d3c11 to 5730a26 Compare April 13, 2023 19:11

andrewbaptist force-pushed the 20221012.io-smoother branch from 5730a26 to 732f64e Compare April 21, 2023 21:00

andrewbaptist force-pushed the 20221012.io-smoother branch from 732f64e to 3ab575a Compare July 11, 2024 19:58

pebble: smoothing code

cb86af8

smoother: integrate into code

andrewbaptist force-pushed the 20221012.io-smoother branch from 3ab575a to cb86af8 Compare July 12, 2024 12:20

andrewbaptist mentioned this pull request Oct 31, 2024

storage: smooth pebble compaction cockroachdb/cockroach#133948

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

db: Smooth out IO from flushing L0 and compaction#2004

db: Smooth out IO from flushing L0 and compaction#2004
andrewbaptist wants to merge 1 commit into
cockroachdb:masterfrom
andrewbaptist:20221012.io-smoother

andrewbaptist commented Oct 12, 2022

Uh oh!

cockroach-teamcity commented Oct 12, 2022

Uh oh!

sumeerbhola left a comment

Uh oh!

andrewbaptist commented Oct 14, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andrewbaptist commented Oct 12, 2022

Uh oh!

cockroach-teamcity commented Oct 12, 2022

Uh oh!

sumeerbhola left a comment

Choose a reason for hiding this comment

Uh oh!

andrewbaptist commented Oct 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrewbaptist commented Oct 14, 2022 •

edited

Loading