Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to update lease lock in fabric client with kubernetes version : v1.25.11-eks-a5565ad #5635

Closed
premkumarramasamy opened this issue Dec 5, 2023 · 10 comments · Fixed by #5747
Assignees
Labels
Waiting on feedback Issues that require feedback from User/Other community members

Comments

@premkumarramasamy
Copy link

premkumarramasamy commented Dec 5, 2023

Describe the bug

We are facing continous leadership change due the below issue and same was working fine before the upgrade to latest version. We have also tried deleting the object and deployed the app and the issue appears again.

There is no other/manual edits to lease object.

io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: PATCH at: https://172.20.0.1:443/apis/coordination.k8s.io/v1/namespaces/xyz/leases/abc. Message: Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again. Received status: Status(apiVersion=v1, code=409, details=StatusDetails(causes=[], group=coordination.k8s.io, kind=leases, name=abc, retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, message=Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again, metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=Conflict, status=Failure, additionalProperties={}).
	at io.fabric8.kubernetes.client.KubernetesClientException.copyAsCause(KubernetesClientException.java:238)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.waitForResult(OperationSupport.java:507)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handleResponse(OperationSupport.java:524)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handlePatch(OperationSupport.java:419)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handlePatch(OperationSupport.java:397)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.handlePatch(BaseOperation.java:763)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.lambda$patch$2(HasMetadataOperation.java:232)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.patch(HasMetadataOperation.java:237)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.patch(HasMetadataOperation.java:252)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.patch(HasMetadataOperation.java:45)
	at io.fabric8.kubernetes.client.extension.ResourceAdapter.patch(ResourceAdapter.java:305)
	at io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.ResourceLock.update(ResourceLock.java:65)
	at io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.tryAcquireOrRenew(LeaderElector.java:236)
	at io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.lambda$acquire$4(LeaderElector.java:172)
	at io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.lambda$loop$8(LeaderElector.java:282)
	at java.base/java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1804)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
	at java.base/java.lang.Thread.run(Thread.java:840)
Caused by: io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: PATCH at: https://172.20.0.1:443/apis/coordination.k8s.io/v1/namespaces/xyz/leases/abc. Message: Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again. Received status: Status(apiVersion=v1, code=409, details=StatusDetails(causes=[], group=coordination.k8s.io, kind=leases, name=abc, retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, message=Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again, metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=Conflict, status=Failure, additionalProperties={}).
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.requestFailure(OperationSupport.java:660)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.requestFailure(OperationSupport.java:640)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.assertResponseCode(OperationSupport.java:589)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.lambda$handleResponse$0(OperationSupport.java:549)
	at java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:646)
	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147)
	at io.fabric8.kubernetes.client.http.StandardHttpClient.lambda$completeOrCancel$10(StandardHttpClient.java:140)
	at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863)
	at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841)
	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147)
	at io.fabric8.kubernetes.client.http.ByteArrayBodyHandler.onBodyDone(ByteArrayBodyHandler.java:52)
	at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863)
	at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841)
	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147)
	at io.fabric8.kubernetes.client.okhttp.OkHttpClientImpl$OkHttpAsyncBody.doConsume(OkHttpClientImpl.java:136)
	... 3 more

Fabric8 Kubernetes Client version

6.9.2

Steps to reproduce

Pods are hitting frequent leadership change due to this issue

Expected behavior

Error shouldn't happen and object should be modified

Runtime

Kubernetes (vanilla)

Kubernetes API Server version

other (please specify in additional context)

Environment

Amazon

Fabric8 Kubernetes Client Logs

io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: PATCH at: https://172.20.0.1:443/apis/coordination.k8s.io/v1/namespaces/xyz/leases/abc. Message: Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again. Received status: Status(apiVersion=v1, code=409, details=StatusDetails(causes=[], group=coordination.k8s.io, kind=leases, name=abc, retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, message=Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again, metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=Conflict, status=Failure, additionalProperties={}).
	at io.fabric8.kubernetes.client.KubernetesClientException.copyAsCause(KubernetesClientException.java:238)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.waitForResult(OperationSupport.java:507)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handleResponse(OperationSupport.java:524)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handlePatch(OperationSupport.java:419)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.handlePatch(OperationSupport.java:397)
	at io.fabric8.kubernetes.client.dsl.internal.BaseOperation.handlePatch(BaseOperation.java:763)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.lambda$patch$2(HasMetadataOperation.java:232)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.patch(HasMetadataOperation.java:237)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.patch(HasMetadataOperation.java:252)
	at io.fabric8.kubernetes.client.dsl.internal.HasMetadataOperation.patch(HasMetadataOperation.java:45)
	at io.fabric8.kubernetes.client.extension.ResourceAdapter.patch(ResourceAdapter.java:305)
	at io.fabric8.kubernetes.client.extended.leaderelection.resourcelock.ResourceLock.update(ResourceLock.java:65)
	at io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.tryAcquireOrRenew(LeaderElector.java:236)
	at io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.lambda$acquire$4(LeaderElector.java:172)
	at io.fabric8.kubernetes.client.extended.leaderelection.LeaderElector.lambda$loop$8(LeaderElector.java:282)
	at java.base/java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1804)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)
	at java.base/java.lang.Thread.run(Thread.java:840)
Caused by: io.fabric8.kubernetes.client.KubernetesClientException: Failure executing: PATCH at: https://172.20.0.1:443/apis/coordination.k8s.io/v1/namespaces/xyz/leases/abc. Message: Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again. Received status: Status(apiVersion=v1, code=409, details=StatusDetails(causes=[], group=coordination.k8s.io, kind=leases, name=abc, retryAfterSeconds=null, uid=null, additionalProperties={}), kind=Status, message=Operation cannot be fulfilled on leases.coordination.k8s.io "abc": the object has been modified; please apply your changes to the latest version and try again, metadata=ListMeta(_continue=null, remainingItemCount=null, resourceVersion=null, selfLink=null, additionalProperties={}), reason=Conflict, status=Failure, additionalProperties={}).
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.requestFailure(OperationSupport.java:660)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.requestFailure(OperationSupport.java:640)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.assertResponseCode(OperationSupport.java:589)
	at io.fabric8.kubernetes.client.dsl.internal.OperationSupport.lambda$handleResponse$0(OperationSupport.java:549)
	at java.base/java.util.concurrent.CompletableFuture$UniApply.tryFire(CompletableFuture.java:646)
	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147)
	at io.fabric8.kubernetes.client.http.StandardHttpClient.lambda$completeOrCancel$10(StandardHttpClient.java:140)
	at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863)
	at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841)
	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147)
	at io.fabric8.kubernetes.client.http.ByteArrayBodyHandler.onBodyDone(ByteArrayBodyHandler.java:52)
	at java.base/java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:863)
	at java.base/java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:841)
	at java.base/java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:510)
	at java.base/java.util.concurrent.CompletableFuture.complete(CompletableFuture.java:2147)
	at io.fabric8.kubernetes.client.okhttp.OkHttpClientImpl$OkHttpAsyncBody.doConsume(OkHttpClientImpl.java:136)
	... 3 more

Additional context

No response

@shawkins
Copy link
Contributor

shawkins commented Dec 6, 2023

@premkumarramasamy there isn't enough information here to determine what you are seeing. Presumably the exception you are showing is from a debug log with the message Exception occurred while renewing lock. This is at a debug level because it is possible for multiple electors to attempt to become leaders at the same time.

We are facing continous leadership change due the below issue and same was working fine before the upgrade to latest version.

What version specifically are you referring to? There have only been a couple of small changes in the leader elector in the last 6 months, so that could help narrow in.

@manusa manusa added the Waiting on feedback Issues that require feedback from User/Other community members label Dec 19, 2023
@XComp
Copy link

XComp commented Jan 16, 2024

I'm not 100% sure whether it's exactly the same issue. But we're experiencing the same stacktrace in FLINK-34007. I found something (see my comment) that sounds like a bug when the LeaderElector is not able to renew its lease:

The timeout of the renew operation leads to a stopLeading call in LeaderElector:96 which results in LeaderElector being stopped (the stopped flag is set to true in LeaderElector:119 and is never disabled again). Any later call of tryAcquireOrRenew will not perform any action because of the stopped flag being true (see LeaderElector#tryAcquireOrRenew:211).

We tracked the issue down to a change in this PR #4125 which ended up in v6.0.0. We've experiencing this in Apache Flink 1.18 which uses fabric8io:kubernetes-client v6.6.2

The scenario in our case is that the client is usually not re-instantiated after a leadership loss. Therefore, it is stuck in its stopped state.

@shawkins
Copy link
Contributor

@XComp @csviri @premkumarramasamy it might to be good to review expected behavior:

When the elector starts, it will attempt to repeatively acquire the lock. If it fails to do so there should be a debug log of "Failed to acquire lease ...", or an error log of "Exception occurred while acquiring lock ..." - ideally that shoud not always be presented as an error as it's expected to fail when there is competition. I mistakenly above thought that this would be logged at the debug level.

So possible issue number 1 - is @premkumarramasamy seeing a situation where the lock seems like it should be obtainable, but is not - there's not enough information here to determine that.

Once aquired, the elector will repetitively renew. If a renewal attempt ends in an exception, we'll log that at a debug level and try again - arguably that shouldn't always be logged that low. If the elector sees that some other elector has taken over, then it will stop trying to renew and send out the appropriate notifications. It will restart.

Possible issue number 2 - what @XComp is mentioning sounds like there is an expectation that the the elector will resume after leadership loss. However I don't believe this was ever how it was implemented - cc @manusa

@XComp
Copy link

XComp commented Jan 18, 2024

thanks for your response @shawkins

Possible issue number 2 - what @XComp is mentioning sounds like there is an expectation that the the elector will resume after leadership loss. However I don't believe this was ever how it was implemented

Sorry if I didn't make myself clear enough here. Flink doesn't assume that the client resumes after the leadership is lost. One LeaderElector#run/#start call would handle a single "leader election session" (i.e. trying to acquire the leadership, renewing the lease and stopping the leadership). A subsequent call of LeaderElector#run/#start would happen as part of the leadership loss handling on the Flink side. This behavior is on par with what was offered by fabric8io:kubernetes-client:5.12.4

The problem is that the subsequent call of LeaderElector#run/#start would not be able to acquire a new leadership because of the LeaderElector#stopped field being set to true (which happened in the previous LeaderElector#stopLeading call).

As far as I understand it, #stopped would need to be switched to false again when another leader election cycle starts (within LeaderElector#start). WDYT?

@shawkins
Copy link
Contributor

As far as I understand it, #stopped would need to be switched to false again when another leader election cycle starts (within LeaderElector#start). WDYT?

Ah, so you want to keep using the same instance. We haven't been explicit about whether that's expected to work, nor have we explicitly preventing things like simultaneous start calls.

I think the intent behind the stopped change was to make it more like informers, which also cannot be restarted. Since there's state held on the elector and on the lock we then don't need to worry about any kind of reset logic - even if if seems unncessary or trivial in this case we still need to ensure for example that the wrong listerner event won't be called on another usage.

So let's separate the ability to reuse elector instances as an enhancement request. At least for now you should be able to workaround around it by creating a new elector.

@XComp
Copy link

XComp commented Jan 18, 2024

So let's separate the ability to reuse elector instances as an enhancement request.

Fair enough. We could interpret this as desired behavior. My initial perspective was that it's more like a bug because the old behavior supported the reuse of the instance. But the change happened as part of a major version update, which justifies this difference in semantics, I guess.

Thanks for your clarifications.

@XComp
Copy link

XComp commented Jan 18, 2024

One other question, though: What is the semantic of the holder identity in that case?

The issue with the current approach is that one could re-instantiate the LeaderElector to handle another leadership cycle but pass in the same holder identity. If the old identity is still persisted in the ConfigMap (e.g. due to network outage), the leadership cannot be acquired if the new LeaderElector instance uses the same holder identity because a leadership change is identified by a change in the holder identity (see LeaderElector:247).

This sounds like a flaw in the current implementation, doesn't it? The holder identity should be owned by the LeaderElector if the instance shouldn't be reusable? ...to prevent scenarios where the identity is reused. 🤔

to be honest, that's the scenario we have in Flink right now. Changing this would require a bigger refactoring on our side. 😇

@shawkins
Copy link
Contributor

@XComp I may not be following. By re-instanciate I'm assuming that you mean create a new instace of the LeaderElector after the previous one has stopped. There are two cases:

  1. Use a same identity

The previous elector will leave things either in a state where the lock is still held (cancelled - but not released, jvm crashed), or it's no longer holding the lock.

If the lock is still held by same identity, the new elector will still make the initial calls for updateObserved and associated notifications as the observedRecord starts as null. The isLeader check will succeed and it will move from the aquire loop to the renew loop. At worst the lifecycle events such as onStopLeading and onStartLeading don't differentiate the corner cases - that is onStopLeading is always called even if releaseOnCancel is false, etc.

If the lock is held by a different identity, the new elector will enter the aquire loop and just keep trying as per usual.

  1. Use the different identity

The only difference here is in the lock for some reason is still held by the old elector. You'll have to wait for the lease renewal duration to expire before a new leader (including the new elector) can take over.

If you think there's some other case, please open a new issue for that.

@wangyang0918
Copy link
Contributor

wangyang0918 commented Jan 19, 2024

I think @shawkins is right. LeaderElector could work as expected when creating a new instance with same lock identity.

But I still believe that LeaderElector needs to be run more than once just like the behavior in v5.12.4. If it is designed for only run once, then we need the sanity check in the run() to throw exception and make it explicit in the java doc.

@shawkins
Copy link
Contributor

If it is designed for only run once, then we need the sanity check in the run() to throw exception and make it explicit in the java doc.

Agreed - that needs to happen whether we pursue making it restartable or not.

shawkins added a commit to shawkins/kubernetes-client that referenced this issue Feb 6, 2024
manusa pushed a commit that referenced this issue Feb 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Waiting on feedback Issues that require feedback from User/Other community members
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants