Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SSH occasional disconnection after vMotion of a containerVM finishes #4484

Open
keramix opened this issue Mar 30, 2017 · 7 comments
Open

SSH occasional disconnection after vMotion of a containerVM finishes #4484

keramix opened this issue Mar 30, 2017 · 7 comments
Labels
area/ux component/tether kind/defect Behavior that is inconsistent with what's intended priority/p4

Comments

@keramix
Copy link
Contributor

keramix commented Mar 30, 2017

Details

If a user is attached to a containerVM that gets vMotioned, there is a chance the ssh connection will get disconnected after vMotion completes. The containerVM will be powered on but the user might get detached after vmotion completes. This event happens if the user is typing on the stdin right before the containerVM goes to the downtime state (usually between 58% and 72% of the vMotion progress)

By further investigating the issue, it turns out that the SSH Server at the tether side closes the connection because of a MAC (Message Authentication Code) failure. This possibly means that a few bytes that the SSH client sends do not get read by the SSH server while vMotion is happening leading to an incorrect SSH MAC. Attached are the logs (enabled the crypto/ssh logs as well). Most relevant are the tether logs.

Steps to Reproduce

1- Run a container and attach to it
2- Start vmotion
3- Type random keys on the containerVM stdin between 58% and 72% of vmotion progress

log10.zip

@keramix keramix added the kind/defect Behavior that is inconsistent with what's intended label Mar 30, 2017
@mdubya66 mdubya66 added the impact/doc/note Requires creation of or changes to an official release note label Mar 31, 2017
@mdubya66
Copy link
Contributor

This edge case needs a release note.

@mhagen-vmware
Copy link
Contributor

please prioritize, estimate and move to backlog

@stuclem
Copy link
Contributor

stuclem commented Apr 17, 2017

Proposed release note:


  • Occasional disconnection during vMotion. #4484
    If you are attached to a container VM that is migrated by vMotion, the SSH connection to the container VM might drop when vMotion completes.

@mhagen-vmware do we need to say anything more?

@mhagen-vmware
Copy link
Contributor

lgtm

@mhagen-vmware
Copy link
Contributor

could also add, repair steps - issue another docker attach after vmotion completes to reattach.

@stuclem
Copy link
Contributor

stuclem commented Apr 17, 2017

Done. Thanks @mhagen-vmware.

@stuclem stuclem removed the impact/doc/note Requires creation of or changes to an official release note label Apr 17, 2017
@corrieb
Copy link
Contributor

corrieb commented Jul 19, 2017

@stuclem The fact that SSH is being used as the transport is completely hidden from the client - it's purely an implementation detail. So we should phrase the note purely in terms of docker attach.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/ux component/tether kind/defect Behavior that is inconsistent with what's intended priority/p4
Projects
None yet
Development

No branches or pull requests

5 participants