configuring KnativeServing to allow zero initial scale when installed#1774
Conversation
|
/test opendatahub-operator-e2e |
There was a problem hiding this comment.
The changes make sense and look good to me but it'd be good to have someone with more operator/kserve experience take a look as well.
CC: @zdtsw
|
/cc @Jooho |
i added other serving guys as reviewers here as well. |
|
btw, if this is not needed for rhoai 2.19, we can wait till next week to sync downstream, right? |
|
@zdtsw RHOAIENG-18965 is marked as fix version RHOAI_2.20.0 |
|
The changes looks good to me and I tested it on my end, it is working fine.
|
It should, ideally, there should be an e2e test to ensure it happens
since this changes the spec of the KnativeServing, I would expect the related controller would act upon the change. This can be tested independently of the odh operator. |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: carlkyrillos, spolti The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
@zdtsw the failed use case is not related to this change. Could you please review the e2e ci? |
|
/test opendatahub-operator-e2e |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #1774 +/- ##
=======================================
Coverage 25.71% 25.71%
=======================================
Files 173 173
Lines 11892 11892
=======================================
Hits 3058 3058
Misses 8545 8545
Partials 289 289 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
@brettmthompson can you please fix the conflict? |
c77e3c8 to
f82c35a
Compare
Signed-off-by: Brett Thompson <[email protected]>
f82c35a to
4c3d3d8
Compare
…cturing Signed-off-by: Brett Thompson <[email protected]>
israel-hdez
left a comment
There was a problem hiding this comment.
@brettmthompson Please, make sure that docs for manual setup are updated.
|
/test opendatahub-operator-e2e |
…0-causes-new-pod-creation
|
/lgtm |
1fcd734
into
opendatahub-io:main
|
/cherry-pick rhoai |
|
@grdryn: new pull request created: #1877 DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Description
This PR is required to resolve RHOAIENG-18965.
The cause of "Changing 0 replicas to 0 causes a new pod creation" is the default KnativeServing configuration, which does not allow for this functionality. By default KnativeServing is configured to not allow zero initial scale and has the default initial scale value as 0. This means that for any knative revision to be considered ready at least one replica must first be created. In the case of changing an inference service's replicas to 0, a new revision is created with initial scale as the default value of 1, which causes a single pod to be spun up. Once the initial scale is reached this pod is then destroyed to achieve the desired state of 0 replicas.
Per the Knative Configuring Scale Bounds documentation, to prevent pods from being created when an inference service is configured to have 0 replicas, the following is required:
How Has This Been Tested?
Test Strategy:
Inference Service:
Inference Service:
Merge criteria