This website uses Cookies. Click Accept to agree to our website's cookie use as described in our Privacy Policy. Click Preferences to customize your cookie settings.
I have set my Vertex AI Deployed Model min_replicas to 3 and max to 10,
its still getting descaled to 2, the min_replicas are automatically
getting descaled to 2. Why is this happening? Its causing errors on
production for the service.
I have a deployed endpoint on Vertex AI with auto-scaling being enabled.
But I want to manually adjust the min-replicas and max-replicas for the
deployed endpoint. How to do so?
The requirement was I wanted to manually increase min & max-replicas of
a already deployed model in vertex ai. I got this api Method:
projects.locations.endpoints.mutateDeployedModel - But this api is only
allowing me to scale nodes till a limit of m...