DAOS-18640 test: per_server_fault_domain.yaml - Increase space_rb to 10%#18283
DAOS-18640 test: per_server_fault_domain.yaml - Increase space_rb to 10%#18283shimizukko wants to merge 3 commits into
Conversation
The tests in per_server_fault_domain.py is failing due to rebuild timeout after stopping the ranks. test_rf2_unclean had been fixed with space_rb:5%, but apparently 5% isn't enough for other tests in this file, so increase it. Skip-unit-tests: true Skip-fault-injection-test: true Skip-func-hw-test-large: false Test-tag: PerServerFaultDomainTest Test-repeat: 3 Signed-off-by: Makito Kano <makito.kano@hpe.com>
|
Ticket title is 'container/per_server_fault_domain.py:PerServerFaultDomainTest.test_rf2_unclean - wait for rebuild test timeout - DER_NOSPACE' |
|
Test stage Functional Hardware Large completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-18283/1/execution/node/824/log |
Skip-unit-tests: true Skip-fault-injection-test: true Skip-func-hw-test-large: false Test-tag: PerServerFaultDomainTest Test-repeat: 4 Signed-off-by: Makito Kano <makito.kano@hpe.com>
mchaarawi
left a comment
There was a problem hiding this comment.
why is it OK to change space_rb to 10?
the default is 5 and changing it to 10 means there is an issue..
on aurora we don't even set to 10 and set to 8. some more information is needed why 10 is required here.
After some more investigation here, I'm not sure increasing space_rb will even help: |
daltonbohning
left a comment
There was a problem hiding this comment.
I agree with Mohamad that we should not need to increase space_rb for this one test. It's probably better to adjust the data generation or size such that we have enough space to begin with. See my comment here: https://daosio.atlassian.net/browse/DAOS-18640?focusedCommentId=174182
Skip-unit-tests: true Skip-fault-injection-test: true Skip-func-hw-test-large: false Test-tag: PerServerFaultDomainTest Test-repeat: 5 Signed-off-by: Makito Kano <makito.kano@hpe.com>
The tests in per_server_fault_domain.py is failing due to rebuild timeout after stopping the ranks.
test_rf2_unclean had been fixed with space_rb:5%, but apparently 5% isn't enough for other tests in this file, so increase it.
Skip-unit-tests: true
Skip-fault-injection-test: true
Skip-func-hw-test-large: false
Test-tag: PerServerFaultDomainTest
Test-repeat: 3
Steps for the author:
After all prior steps are complete: