checker, server: support split-scatter for MCS keyspaces by lhy1024 · Pull Request #10648 · tikv/pd

lhy1024 · 2026-05-09T03:09:21Z

What problem does this PR solve?

Issue Number: ref #10592

This PR follows up #10621 and #10652. The max-keyspace region-bound fix has been split out and merged by #10665, so this PR now only keeps the remaining keyspace-aware split-scatter changes.

What is changed and how does it work?

Scope in this PR:

Add keyspace.ParseKeyspacePrefix so split-scatter can decode raw keyspace prefixes through the shared keyspace utility instead of duplicating prefix parsing logic.
Split-scatter range hints understand TiDB txn keyspace table/index keys and preserve the keyspace prefix in scatter ranges.
Split-scatter only decodes a txn keyspace prefix when the corresponding txn keyspace boundary regions are present, avoiding false positives for classic/raw keys that happen to start with keyspace-like bytes.
Scatter group names are keyspace-scoped, so table/index groups from different keyspaces do not share running-op accounting.
Raw keyspace keys are intentionally ignored by the table/index range-hint decoder to avoid mixing raw and txn group semantics.

Make split-scatter range hints keyspace-aware for TiDB txn keyspace table/index keys.

Preserve the keyspace prefix in scatter ranges, scope table/index scatter groups by keyspace, and ignore raw keyspace keys for table/index grouping.

Check List

Tests

Unit test
Manual Test

Environment

TCMS execution: https://tcms.pingcap.net/dashboard/executions/plan/8140464
PD: hash e3ab037ffe1f5b3dbf06691658c366da307b9a99, Kernel Type: Next Generation
TiKV CSE: hash f469ce15a327c9b77efe0f2cfab920b65142a6aa
Keyspaces: keyspace_b id 1, keyspace_a id 2, SYSTEM id 16777214
Schedulers: [] after removing balance and evict schedulers
Workload: index scan

Result

keyspace_a hot index regions: 1 -> 52

- Final `keyspace_a` distribution: - store `1001`: leaders `13`, peers `39` - store `1004`: leaders `13`, peers `39` - store `1005`: leaders `13`, peers `39` - store `1160`: leaders `13`, peers `39` - `keyspace_b` same hot index range stayed unchanged: `1` region before, `1` region after.

Check

All keyspace_a region start/end keys use keyspace prefix 0x78000002, matching keyspace_a id 2.
No keyspace_a split boundary uses keyspace id 1 or 16777214.
Sample keyspace_a boundaries

Region	Start key prefix	End key prefix	Approx keys
`2393`	`7800000274800000FF00`	`7800000274800000FF00`	`512`
`2398`	`7800000274800000FF00`	`7800000274800000FF00`	`512`
`2403`	`7800000274800000FF00`	`7800000274800000FF00`	`512`
`2631`	`7800000274800000FF00`	`7800000274800000FF00`	`109026`
`2373`	`7800000274800000FF00`	`7800000274800000FF00`	`109026`

Release note

Load-based split-scatter now uses keyspace-aware table/index grouping and range hints.

coderabbitai · 2026-05-09T03:09:33Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

This PR forwards split reasons via gRPC metadata from PD to the scheduling service, the scheduler extracts the reason and records LOAD-driven split-scatter batches (collecting new region IDs), and split-scatter range hint resolution and scatter-group selection are extended to be txn-keyspace-aware; tests and keyspace utilities were added/updated.

Changes

Split-Scatter Metadata Forwarding and Load-Based Recording with Keyspace Support

Layer / File(s)	Summary
Metadata Key Contract `pkg/utils/grpcutil/grpcutil.go`	New exported constant `SplitReasonMetadataKey` with value `"pd-split-reason"` to propagate split reasons via gRPC metadata in internal forwarding.
Forwarding Path `server/grpc_service.go`, `server/...`	When forwarding `AskBatchSplit` to scheduling service, appends split-reason value to outgoing gRPC context under `SplitReasonMetadataKey`.
Scheduling Service Handler `pkg/mcs/scheduling/server/grpc_service.go`	Added `splitReasonFromContext` helper to extract split reason from gRPC metadata (defaults to `ADMIN`). Updated `AskBatchSplit` to collect newly allocated region IDs and conditionally call `RecordSplitScatterBatch` when split reason is `LOAD`.
Keyspace-Aware Range Hints `pkg/schedule/checker/split_scatter_group.go`, `pkg/schedule/checker/split_scatter.go`	Reworked range hint resolution to decode region keys via `decodeSplitScatterRegionKey`, detect optional txn keyspace prefixes, compute keyspace-prefixed start/end ranges via keyspace-aware helpers, and generate keyspace-aware scatter group names; controller dispatch now validates txn keyspace bounds before resolving hints.
Tests `pkg/mcs/scheduling/server/split_scatter_test.go`, `pkg/mcs/scheduling/server/main_test.go`, `pkg/schedule/checker/split_scatter_test.go`, `tests/server/split_scatter_forward_test.go`	Adds TestMain leak checking; PD client and scheduling-server test doubles; tests for `splitReasonFromContext`, AskBatchSplit recording/skipping behavior, forwarding behavior, and keyspace-aware range-hint test cases and helpers.
Keyspace Utilities & Tests `pkg/keyspace/util.go`, `pkg/keyspace/util_test.go`	Refactored MakeRegionBound to use keyspace-mode prefixes and next-prefix calculation; added TestMakeRegionBound covering normal and overflow cases.
Tools Test Update `tools/pd-ctl/pdctl/command/keyspace_command_test.go`	Updated test to use `keyspace.MakeRegionBound` for expected boundary values and removed unused imports.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

tikv/pd#10621: Implements forwarding and scheduling-service handling for load-based split-scatter (metadata propagation, batch recording, and keyspace-aware range/group logic) which directly relates to this change.

Suggested labels

lgtm, approved

Suggested reviewers

rleungx
bufferflies

Poem

🐰 I hop where split reasons softly stream,
I whisper "pd-split-reason" in the scheme,
LOAD marks batches and new region IDs,
keyspaces guide ranges where the scatter glides,
tests clap their paws — the changes gleam.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 2.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Title check	✅ Passed	The title clearly and concisely summarizes the main change: adding split-scatter support for MCS keyspaces, which is the primary objective of this PR.
Description check	✅ Passed	PR description follows the template structure with issue number, detailed changes, commit message, and test coverage information provided.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

pkg/mcs/scheduling/server/grpc_service.go (1)
353-365: 💤 Low value

Consider validating the parsed split reason enum value.

While the current implementation correctly defaults to ADMIN for missing or malformed metadata, it doesn't validate that the parsed integer corresponds to a valid pdpb.SplitReason enum value. For example, metadata containing "999" would parse successfully but yield an invalid enum.

Since this is an internal PD-to-scheduling-service contract (not a public API), the risk is low. However, adding a simple range check would make the code more defensive:
func splitReasonFromContext(ctx context.Context) pdpb.SplitReason {
	values := metadata.ValueFromIncomingContext(ctx, grpcutil.SplitReasonMetadataKey)
	if len(values) == 0 {
		return pdpb.SplitReason_ADMIN
	}
	reason, err := strconv.ParseInt(values[len(values)-1], 10, 32)
	if err != nil || reason < 0 || reason > int64(pdpb.SplitReason_LOAD) {
		return pdpb.SplitReason_ADMIN
	}
	return pdpb.SplitReason(reason)
}
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/mcs/scheduling/server/grpc_service.go` around lines 353 - 365,
splitReasonFromContext currently parses an int from metadata but doesn't verify
it maps to a valid pdpb.SplitReason; update splitReasonFromContext to parse the
last metadata value (grpcutil.SplitReasonMetadataKey) as before, but after
strconv.ParseInt also check the parsed value is within the valid enum range
(e.g. >= 0 && <= int64(pdpb.SplitReason_LOAD)) and if not (or on parse error)
return pdpb.SplitReason_ADMIN; keep the existing fallback behavior for missing
metadata.
tests/server/split_scatter_forward_test.go (1)
104-106: 💤 Low value

Consider logging the gRPC server error for test debuggability.

While discarding the Serve error is acceptable for test code (since grpcServer.Stop() will cleanly terminate it), logging the error would help diagnose failures if the server fails to start:
go func() {
	if err := grpcServer.Serve(listener); err != nil {
		t.Logf("grpcServer.Serve returned: %v", err)
	}
}()
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/server/split_scatter_forward_test.go` around lines 104 - 106, Update
the anonymous goroutine that starts the gRPC server to log any Serve error
instead of discarding it: call grpcServer.Serve(listener) and if it returns a
non-nil error, use the test logger (t.Logf) to report the error (e.g.,
"grpcServer.Serve returned: %v") so test failures to start the server are
visible; modify the goroutine around grpcServer.Serve and listener to capture
and log the returned error via t.Logf.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@pkg/schedule/checker/split_scatter_group.go`:
- Around line 85-97: The branch in the decoder currently treats any decoded key
starting with 'x' as txn-keyspace and strips the first
splitScatterKeyspacePrefixLen bytes, which can mis-classify raw keys; modify the
logic in the function that returns splitScatterDecodedKey so that before
constructing keyspacePrefix/keyspaceID you fully validate the keyspace encoding
using the canonical parser (e.g., call the same parser used for TiDB txn
keyspace validation) and only strip the prefix when that parser confirms a valid
txn keyspace encoding; otherwise return splitScatterDecodedKey{rawKey: decoded}
unchanged. Also add a regression test targeting the decoder that feeds an
'x'-prefixed raw key (e.g., starts with x\x00\x00\x00t...) and asserts it
remains a rawKey, ensuring splitScatterKeyspacePrefixLen-based heuristics are
not used alone.

---

Nitpick comments:
In `@pkg/mcs/scheduling/server/grpc_service.go`:
- Around line 353-365: splitReasonFromContext currently parses an int from
metadata but doesn't verify it maps to a valid pdpb.SplitReason; update
splitReasonFromContext to parse the last metadata value
(grpcutil.SplitReasonMetadataKey) as before, but after strconv.ParseInt also
check the parsed value is within the valid enum range (e.g. >= 0 && <=
int64(pdpb.SplitReason_LOAD)) and if not (or on parse error) return
pdpb.SplitReason_ADMIN; keep the existing fallback behavior for missing
metadata.

In `@tests/server/split_scatter_forward_test.go`:
- Around line 104-106: Update the anonymous goroutine that starts the gRPC
server to log any Serve error instead of discarding it: call
grpcServer.Serve(listener) and if it returns a non-nil error, use the test
logger (t.Logf) to report the error (e.g., "grpcServer.Serve returned: %v") so
test failures to start the server are visible; modify the goroutine around
grpcServer.Serve and listener to capture and log the returned error via t.Logf.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 932111be-84de-4105-b36a-bf16bf0091e7

📥 Commits

Reviewing files that changed from the base of the PR and between 38a0f9a and fb3f55d9ca32620427e5065b0ad656354a2e8e0f.

📒 Files selected for processing (7)

pkg/mcs/scheduling/server/grpc_service.go
pkg/mcs/scheduling/server/split_scatter_test.go
pkg/schedule/checker/split_scatter_group.go
pkg/schedule/checker/split_scatter_test.go
pkg/utils/grpcutil/grpcutil.go
server/grpc_service.go
tests/server/split_scatter_forward_test.go

codecov · 2026-05-09T03:20:35Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 79.06%. Comparing base (f6653ed) to head (72c613d).
⚠️ Report is 4 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff            @@
##           master   #10648    +/-   ##
========================================
  Coverage   79.06%   79.06%            
========================================
  Files         535      536     +1     
  Lines       73065    73231   +166     
========================================
+ Hits        57767    57901   +134     
- Misses      11211    11231    +20     
- Partials     4087     4099    +12

Flag	Coverage Δ
unittests	`79.06% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

pkg/schedule/checker/split_scatter_test.go (1)
519-525: 💤 Low value

Nit: prefer the existing splitScatterTestTableID constant.

makeSplitScatterTableGroupForKeyspace(splitScatterTestKeyspaceID, 42, true) and the matching codec.GenerateTableKey(42) re-encode the literal 42, which is already exposed as splitScatterTestTableID. Using the constant keeps the test in lockstep if the test table ID is ever changed.
Proposed change
-	wantRange := splitScatterKeyspacePrefixRange(splitScatterTestKeyspaceID, codec.GenerateTableKey(42))
-	wantRange.scatterGroup = makeSplitScatterTableGroupForKeyspace(splitScatterTestKeyspaceID, 42, true)
+	wantRange := splitScatterKeyspacePrefixRange(splitScatterTestKeyspaceID, codec.GenerateTableKey(splitScatterTestTableID))
+	wantRange.scatterGroup = makeSplitScatterTableGroupForKeyspace(splitScatterTestKeyspaceID, splitScatterTestTableID, true)
Same applies to the analogous literals at lines 439-440.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/schedule/checker/split_scatter_test.go` around lines 519 - 525, Replace
the hard-coded table ID literal 42 with the existing splitScatterTestTableID
constant wherever used in this test; specifically update the
codec.GenerateTableKey(42) call and the
makeSplitScatterTableGroupForKeyspace(splitScatterTestKeyspaceID, 42, true) call
(both used when calling splitScatterKeyspacePrefixRange and when building
wantRange.scatterGroup) to use splitScatterTestTableID instead, and do the same
for the analogous occurrences earlier in the file (the other
codec.GenerateTableKey and makeSplitScatterTableGroupForKeyspace usages).
pkg/mcs/scheduling/server/split_scatter_test.go (1)
51-59: 💤 Low value

AllocID stub ignores the requested Count.

The fake hardcodes count := uint32(1) regardless of req.GetCount(), so when production code requests N IDs in one call (the standard PD AskBatchSplit path allocates a region ID plus peer IDs per split, often via a single AllocID with count > 1), the stub silently returns only one ID and a mismatched Count. If AskBatchSplit ever switches from one-by-one allocations to batched allocations, these tests would either start failing in obscure ways or — worse — pass while exercising a non-realistic allocation path.
Proposed change
 func (c *splitScatterPDClient) AllocID(context.Context, *pdpb.AllocIDRequest, ...grpc.CallOption) (*pdpb.AllocIDResponse, error) {
-	count := uint32(1)
-	c.next += uint64(count)
+	count := req.GetCount()
+	if count == 0 {
+		count = 1
+	}
+	c.next += uint64(count)
 	return &pdpb.AllocIDResponse{
 		Header: &pdpb.ResponseHeader{},
 		Id:     c.next - 1,
 		Count:  count,
 	}, nil
 }
(also requires naming the request parameter)
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/mcs/scheduling/server/split_scatter_test.go` around lines 51 - 59, The
AllocID stub in splitScatterPDClient should respect the requested count instead
of hardcoding 1: rename the unnamed request parameter to req
(*pdpb.AllocIDRequest), read count := req.GetCount(), set count = 1 if zero,
then advance c.next by uint64(count) and return the response using that count
(keep the existing pattern of returning Id: c.next-1 and Count: count); update
the function signature AllocID and the body to use req.GetCount() accordingly.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@pkg/mcs/scheduling/server/split_scatter_test.go`:
- Around line 138-147: mcsSplitScatterPendingIDs is reading
Controller.splitScatter.pending via reflection without holding
splitScatter.pendingMu; fix by honoring the lock: add a test-accessor method on
Controller (e.g., Controller.SplitScatterPendingRegionIDs or
Controller.SplitScatterPendingIDs) that acquires splitScatter.pendingMu, copies
the map keys to a []uint64 and returns them, then update
mcsSplitScatterPendingIDs to call that accessor (or alternatively, if you prefer
reflection, use reflection to obtain the splitScatter.pendingMu and Lock/Unlock
it around reading pending); reference symbols: mcsSplitScatterPendingIDs,
Controller, splitScatter, pending, pendingMu.

---

Nitpick comments:
In `@pkg/mcs/scheduling/server/split_scatter_test.go`:
- Around line 51-59: The AllocID stub in splitScatterPDClient should respect the
requested count instead of hardcoding 1: rename the unnamed request parameter to
req (*pdpb.AllocIDRequest), read count := req.GetCount(), set count = 1 if zero,
then advance c.next by uint64(count) and return the response using that count
(keep the existing pattern of returning Id: c.next-1 and Count: count); update
the function signature AllocID and the body to use req.GetCount() accordingly.

In `@pkg/schedule/checker/split_scatter_test.go`:
- Around line 519-525: Replace the hard-coded table ID literal 42 with the
existing splitScatterTestTableID constant wherever used in this test;
specifically update the codec.GenerateTableKey(42) call and the
makeSplitScatterTableGroupForKeyspace(splitScatterTestKeyspaceID, 42, true) call
(both used when calling splitScatterKeyspacePrefixRange and when building
wantRange.scatterGroup) to use splitScatterTestTableID instead, and do the same
for the analogous occurrences earlier in the file (the other
codec.GenerateTableKey and makeSplitScatterTableGroupForKeyspace usages).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2ce03ca2-1e93-4b5f-a822-3922629f3b8e

📥 Commits

Reviewing files that changed from the base of the PR and between fb3f55d9ca32620427e5065b0ad656354a2e8e0f and cb9e5c4c4183fd9f8d894fbce21585858461cf81.

📒 Files selected for processing (5)

pkg/mcs/scheduling/server/main_test.go
pkg/mcs/scheduling/server/split_scatter_test.go
pkg/schedule/checker/split_scatter.go
pkg/schedule/checker/split_scatter_group.go
pkg/schedule/checker/split_scatter_test.go

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

pkg/keyspace/util_test.go (1)

64-81: ⚡ Quick win

Add a partial-carry case for next prefix increment behavior.

Current coverage has no-carry and full-carry, but misses intermediate carry propagation (e.g. 0x0102ff -> 0x010300), which is core to the new boundary logic.

Proposed test extension

 func TestMakeRegionBound(t *testing.T) {
 	re := require.New(t)
 	encodeKey := func(key []byte) []byte {
 		return []byte(codec.EncodeBytes(key))
 	}

 	regionBound := MakeRegionBound(0x010203)
 	re.Equal(encodeKey([]byte{'r', 0x01, 0x02, 0x03}), regionBound.RawLeftBound)
 	re.Equal(encodeKey([]byte{'r', 0x01, 0x02, 0x04}), regionBound.RawRightBound)
 	re.Equal(encodeKey([]byte{'x', 0x01, 0x02, 0x03}), regionBound.TxnLeftBound)
 	re.Equal(encodeKey([]byte{'x', 0x01, 0x02, 0x04}), regionBound.TxnRightBound)

+	carryRegionBound := MakeRegionBound(0x0102ff)
+	re.Equal(encodeKey([]byte{'r', 0x01, 0x02, 0xff}), carryRegionBound.RawLeftBound)
+	re.Equal(encodeKey([]byte{'r', 0x01, 0x03, 0x00}), carryRegionBound.RawRightBound)
+	re.Equal(encodeKey([]byte{'x', 0x01, 0x02, 0xff}), carryRegionBound.TxnLeftBound)
+	re.Equal(encodeKey([]byte{'x', 0x01, 0x03, 0x00}), carryRegionBound.TxnRightBound)
+
 	maxRegionBound := MakeRegionBound(constant.MaxValidKeyspaceID)
 	re.Equal(encodeKey([]byte{'r', 0xff, 0xff, 0xff}), maxRegionBound.RawLeftBound)
 	re.Equal(encodeKey([]byte{'s', 0x00, 0x00, 0x00}), maxRegionBound.RawRightBound)
 	re.Equal(encodeKey([]byte{'x', 0xff, 0xff, 0xff}), maxRegionBound.TxnLeftBound)
 	re.Equal(encodeKey([]byte{'y', 0x00, 0x00, 0x00}), maxRegionBound.TxnRightBound)
 }

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@pkg/keyspace/util_test.go` around lines 64 - 81, Add a test case in
TestMakeRegionBound for a partial-carry next-prefix scenario (e.g. id 0x0102ff)
to cover intermediate carry propagation: call MakeRegionBound(0x0102ff) and
assert that regionBound.RawLeftBound equals encodeKey([]byte{'r', 0x01, 0x02,
0xff}), regionBound.RawRightBound equals encodeKey([]byte{'r', 0x01, 0x03,
0x00}), regionBound.TxnLeftBound equals encodeKey([]byte{'x', 0x01, 0x02,
0xff}), and regionBound.TxnRightBound equals encodeKey([]byte{'x', 0x01, 0x03,
0x00}) so the next-prefix increment logic in MakeRegionBound is validated for
partial carry.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@tests/server/split_scatter_forward_test.go`:
- Around line 81-94: The test currently blocks forever on reading from
fakeScheduling.reasons (in the loop that calls AskBatchSplit) if AskBatchSplit
stops forwarding; change the assertion to use a bounded receive so the test
fails fast: after calling grpcServer.AskBatchSplit, replace the direct receive
from fakeScheduling.reasons with a select that either reads the expected value
(compare to strconv.FormatInt(int64(reason), 10)) or times out after a short
duration (e.g., a few hundred milliseconds) and triggers a test failure;
reference the AskBatchSplit call and the fakeScheduling.reasons channel when
making the change.

---

Nitpick comments:
In `@pkg/keyspace/util_test.go`:
- Around line 64-81: Add a test case in TestMakeRegionBound for a partial-carry
next-prefix scenario (e.g. id 0x0102ff) to cover intermediate carry propagation:
call MakeRegionBound(0x0102ff) and assert that regionBound.RawLeftBound equals
encodeKey([]byte{'r', 0x01, 0x02, 0xff}), regionBound.RawRightBound equals
encodeKey([]byte{'r', 0x01, 0x03, 0x00}), regionBound.TxnLeftBound equals
encodeKey([]byte{'x', 0x01, 0x02, 0xff}), and regionBound.TxnRightBound equals
encodeKey([]byte{'x', 0x01, 0x03, 0x00}) so the next-prefix increment logic in
MakeRegionBound is validated for partial carry.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: d1cc3420-05dd-4fe1-8ec7-03d23c32ddee

📥 Commits

Reviewing files that changed from the base of the PR and between db47cd786b2106ac8582eb59f15ba74701cee49c and c07710f3834c33e932532834bd102a5f6d40476b.

📒 Files selected for processing (5)

pkg/keyspace/util.go
pkg/keyspace/util_test.go
pkg/mcs/scheduling/server/grpc_service.go
pkg/mcs/scheduling/server/split_scatter_test.go
tests/server/split_scatter_forward_test.go

🚧 Files skipped from review as they are similar to previous changes (1)

pkg/mcs/scheduling/server/grpc_service.go

coderabbitai

🧹 Nitpick comments (1)

tools/pd-ctl/pdctl/command/keyspace_command_test.go (1)
57-61: ⚡ Quick win

Avoid self-referential expected values in this test.

On Line 57, expected bounds are derived from keyspace.MakeRegionBound, which is closely related to the logic under test. This weakens regression detection if both implementations drift together. Please keep at least one independently hardcoded/golden boundary case (for one keyspace ID) in addition to these helper-based checks.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tools/pd-ctl/pdctl/command/keyspace_command_test.go` around lines 57 - 61,
The test currently asserts expected bounds by calling keyspace.MakeRegionBound
and comparing its outputs (regionBound.RawLeftBound/RawRightBound and
regionBound.TxnLeftBound/TxnRightBound), which is self-referential; change it so
at least one of the expected values is a hardcoded/golden hex string for a
concrete keyspace ID (e.g., pick a specific keyspaceID used in the test), and
keep the other assertions using keyspace.MakeRegionBound for coverage. Update
the assertion that compares hex.EncodeToString(regionBound.RawLeftBound) (or any
one of RawRightBound/TxnLeftBound/TxnRightBound) to use the independent
hardcoded expected hex value instead of deriving it from
keyspace.MakeRegionBound. Ensure the golden value matches the canonical encoding
for that chosen keyspaceID.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@tools/pd-ctl/pdctl/command/keyspace_command_test.go`:
- Around line 57-61: The test currently asserts expected bounds by calling
keyspace.MakeRegionBound and comparing its outputs
(regionBound.RawLeftBound/RawRightBound and
regionBound.TxnLeftBound/TxnRightBound), which is self-referential; change it so
at least one of the expected values is a hardcoded/golden hex string for a
concrete keyspace ID (e.g., pick a specific keyspaceID used in the test), and
keep the other assertions using keyspace.MakeRegionBound for coverage. Update
the assertion that compares hex.EncodeToString(regionBound.RawLeftBound) (or any
one of RawRightBound/TxnLeftBound/TxnRightBound) to use the independent
hardcoded expected hex value instead of deriving it from
keyspace.MakeRegionBound. Ensure the golden value matches the canonical encoding
for that chosen keyspaceID.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 087b29f0-5446-4a7c-8635-d868d70dae9e

📥 Commits

Reviewing files that changed from the base of the PR and between c07710f3834c33e932532834bd102a5f6d40476b and cd7966f83949eade8f41034d7f7d104bb341397e.

📒 Files selected for processing (1)

tools/pd-ctl/pdctl/command/keyspace_command_test.go

ti-chi-bot · 2026-05-15T10:13:10Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: rleungx

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [rleungx]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ti-chi-bot · 2026-05-15T10:13:11Z

[LGTM Timeline notifier]

Timeline:

2026-05-15 10:13:10.770960155 +0000 UTC m=+3663.256363816: ☑️ agreed by rleungx.

liyishuai · 2026-05-19T08:01:06Z

+				StartKey: startKey,
+				EndKey:   endKey,
+			}, nil)
+			re.Equal(splitScatterRangeHint{}, resolveSplitScatterRangeHint(region))


This test is the only remaining usage of resolveSplitScatterRangeHint. Maybe worth removing the function?

Signed-off-by: lhy1024 <admin@liudos.us>

ti-chi-bot · 2026-05-19T08:28:01Z

@lhy1024: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-unit-test-next-gen-3	`72c613d`	link	true	`/test pull-unit-test-next-gen-3`

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

lhy1024 · 2026-05-21T04:22:44Z

@bufferflies @okJiang PTAL

ti-chi-bot Bot added release-note Denotes a PR that will be considered when it comes time to generate release notes. dco-signoff: yes Indicates the PR's author has signed the dco. labels May 9, 2026

ti-chi-bot Bot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label May 9, 2026

coderabbitai Bot reviewed May 9, 2026

View reviewed changes

Comment thread pkg/schedule/checker/split_scatter_group.go Outdated

ti-chi-bot Bot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels May 9, 2026

coderabbitai Bot reviewed May 9, 2026

View reviewed changes

Comment thread pkg/mcs/scheduling/server/split_scatter_test.go Outdated

coderabbitai Bot reviewed May 9, 2026

View reviewed changes

Comment thread tests/server/split_scatter_forward_test.go Outdated

coderabbitai Bot reviewed May 9, 2026

View reviewed changes

This was referenced May 9, 2026

schedulingpb: add split reason to ask batch split request pingcap/kvproto#1459

Merged

server,mcs: pass split reason to scheduling service #10652

Merged

ti-chi-bot Bot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels May 11, 2026

lhy1024 force-pushed the split-scatter-3 branch from 0421855 to c80fbbc Compare May 12, 2026 06:12

ti-chi-bot Bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels May 12, 2026

lhy1024 changed the title ~~checker, server: support split-scatter for MCS keyspaces~~ checker: support keyspace-aware split scatter May 12, 2026

This was referenced May 12, 2026

keyspace: max valid keyspace region bound wraps around #10664

Closed

Scatter recently split regions in PD #10592

Closed

okJiang mentioned this pull request May 12, 2026

TestAffinityListWithIDs is flaky #10565

Open

lhy1024 changed the title ~~checker: support keyspace-aware split scatter~~ checker, server: support split-scatter for MCS keyspaces May 13, 2026

lhy1024 force-pushed the split-scatter-3 branch from c8bde1b to 21ad27e Compare May 13, 2026 08:29

ti-chi-bot Bot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels May 13, 2026

lhy1024 force-pushed the split-scatter-3 branch 4 times, most recently from fba1da3 to cf12a91 Compare May 13, 2026 09:31

ti-chi-bot Bot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels May 13, 2026

lhy1024 force-pushed the split-scatter-3 branch from cf12a91 to e3ab037 Compare May 13, 2026 10:50

ti-chi-bot Bot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels May 13, 2026

lhy1024 requested review from bufferflies and okJiang May 13, 2026 11:07

rleungx approved these changes May 15, 2026

View reviewed changes

ti-chi-bot Bot added the needs-1-more-lgtm Indicates a PR needs 1 more LGTM. label May 15, 2026

ti-chi-bot Bot added the approved label May 15, 2026

liyishuai reviewed May 19, 2026

View reviewed changes

checker: support split scatter for keyspaces

72c613d

Signed-off-by: lhy1024 <admin@liudos.us>

lhy1024 force-pushed the split-scatter-3 branch from e3ab037 to 72c613d Compare May 19, 2026 08:12

Conversation

lhy1024 commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What problem does this PR solve?

What is changed and how does it work?

Check List

Environment

Result

Check

Release note

Uh oh!

coderabbitai Bot commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov Bot commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

ti-chi-bot Bot commented May 15, 2026

Uh oh!

ti-chi-bot Bot commented May 15, 2026

[LGTM Timeline notifier]

Uh oh!

liyishuai May 19, 2026

Choose a reason for hiding this comment

Uh oh!

ti-chi-bot Bot commented May 19, 2026

Uh oh!

lhy1024 commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lhy1024 commented May 9, 2026 •

edited

Loading

coderabbitai Bot commented May 9, 2026 •

edited

Loading

codecov Bot commented May 9, 2026 •

edited

Loading