Skip to content

DAOS-18976 rebuild: bump rebuild gen when leader retry the rebuild#18326

Open
liuxuezhao wants to merge 1 commit into
release/2.6from
lxz/rebuild_gen_26
Open

DAOS-18976 rebuild: bump rebuild gen when leader retry the rebuild#18326
liuxuezhao wants to merge 1 commit into
release/2.6from
lxz/rebuild_gen_26

Conversation

@liuxuezhao
Copy link
Copy Markdown
Contributor

If on the same PS leader, it retry the rebuild/reclaim on same version, should bump the generation.

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

If on the same PS leader, it retry the rebuild/reclaim on same version, should bump
the generation.

Signed-off-by: Xuezhao Liu <xuezhao.liu@hpe.com>
@liuxuezhao liuxuezhao requested review from a team as code owners May 22, 2026 08:48
@github-actions
Copy link
Copy Markdown

Ticket title is 'Aurora rebuild failing with DER_HG / DER_SHUTDOWN'
Status is 'In Progress'
Labels: 'test_2.6.5rc1'
https://daosio.atlassian.net/browse/DAOS-18976

Copy link
Copy Markdown
Contributor

@wangshilong wangshilong left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So this pr #18289 is not needed? bump new gen means it won't return EBUSY and will abort previous last rebuild scan

@liuxuezhao
Copy link
Copy Markdown
Contributor Author

liuxuezhao commented May 22, 2026

So this pr #18289 is not needed? bump new gen means it won't return EBUSY and will abort previous last rebuild scan

I think PR18289 still needed. It make SCAN exit earlier when finalize, and the race for rebuild_pool_tls_lookup() still need to fix even if abort the old gen rebuild.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants