[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [BUG] Linux pvh vm not getting destroyed on shutdown


  • To: Elliott Mitchell <ehem+xen@xxxxxxx>
  • From: Roger Pau Monné <roger.pau@xxxxxxxxxx>
  • Date: Mon, 15 Feb 2021 10:00:23 +0100
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=citrix.com; dmarc=pass action=none header.from=citrix.com; dkim=pass header.d=citrix.com; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=7A+SDUPTUJPa7Mxd4brTfMtyHyQijV0QKoLVGPD+R+M=; b=ZU+kUphZlQtr58n5De6uPtqMCjT7QauY/qJ7V2F9dHGkjyQl1Ely7k0EUy0VGHGhVZ83vRvjoqx+tr+rzFJxOSzsmIjPAXNHZap0XeOY4nibtDw+0M1qHvzJk0yeuKaGWsaQYE+vNkWlIWhyK1WaSOO8cmBY8AV47DcaNeeLBELO0AXTr7cv/g4yn/mmPyJLQoSz1wBqTnyTtKT9KrujLRkelaGAIVpj/fuIj+nw1XRIoBB69F/Gu0XKV4+v9nycltDbhqQ4mRbvsot3wAA9IDcIe2pw8eGT4kl1//8PoqC3COxVoVH7PV2tcvknOV9x+ErULv89XZz86XuszLckFQ==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Iv1oEYxpb3vD8PujlyCM50RQz2p1n1AxWvg+iOCbgI9XzzDv9KjEmwJdRnAT6mG8RgNHpXkHpaA6GJOjgCpE4Tea5CEfjs7Q84T5ypc4zEzBv1sdWzx0DcWtvZQ5PmkQbzI3FlvBXYVPGwEs7ZqRJApj2EtyKt+YiT2FbM/wmkmwHsuQCubw7Y+odKCzlHlsfjvdzYDRZeLEMkPL6zllRu3/HBQi1v+lD2eXqv3O+EqtvoHkmmGjyRRrgueQywt0Tj0/qAFK9CW+8DayrnrCG0CXI1Tb7MB6EfzM1MmhlNsCl9ZH3uGiNdRUZCV35Pu0+CwF4ta/7HROVk+ACfYN+g==
  • Authentication-results: esa3.hc3370-68.iphmx.com; dkim=pass (signature verified) header.i=@citrix.onmicrosoft.com
  • Cc: Maximilian Engelhardt <maxi@xxxxxxxxxxxxx>, <xen-devel@xxxxxxxxxxxxxxxxxxxx>, <pkg-xen-devel@xxxxxxxxxxxxxxxxxxxxxxx>
  • Delivery-date: Mon, 15 Feb 2021 09:00:41 +0000
  • Ironport-sdr: Kz0FDnN74ipmbbRLVWAdsyGKWrbcL13JdKO9l895DUv5f/HKTdEPf/zXrjGpxrRgda5bEZiJUp UxpMZjInSjKYLMD/pY69lYs/ZH2hKJvw0jK0yrCjprJ7b6uXZIm5J4DbDa4xIcQmEyuJRaEjFr X4ftgsdLDaR/N7QiiJXVCWNlTH9Ok0EpmCHU11VCXFiZqwmU9xdRgOxwLUNtyiEpQqIjoUov4K 31fACMFXP2tXNsqtvs+5aKBGeEW9rwPMlkS0iVhA0mjM1PmNopw0L7queliu5TVU27uSiIuo1v 0v0=
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On Sun, Feb 14, 2021 at 07:27:46PM -0800, Elliott Mitchell wrote:
> On Sun, Feb 14, 2021 at 11:45:47PM +0100, Maximilian Engelhardt wrote:
> > On Samstag, 13. Februar 2021 19:21:56 CET Elliott Mitchell wrote:
> > > On Sat, Feb 13, 2021 at 04:36:24PM +0100, Maximilian Engelhardt wrote:
> > > > * The issue started with Debian kernel 5.8.3+1~exp1 running in the vm,
> > > > Debian kernel 5.7.17-1 does not show the issue.
> > > 
> > > I think the first kernel update during which I saw the issue was around
> > > linux-image-4.19.0-12-amd64 or linux-image-4.19.0-13-amd64.  I think
> > > the last security update to the Xen packages was in a similar timeframe
> > > though.  Rate this portion as unreliable though.  I can definitely state
> > > this occurs with Debian's linux-image-4.19.0-13-amd64 and kernels built
> > > from corresponding source, this may have shown earlier.
> > 
> > We don't see any issues with the current Debian buster (Debian stable) 
> > kernel 
> > (4.19.0-14-amd64 #1 SMP Debian 4.19.171-2 (2021-01-30) x86_64 GNU/Linux) 
> > and 
> > also did not notice any issues with the older kernel packages in buster. 
> > Also 
> > the security update of xen in buster did not cause any behavior change for 
> > us. 
> > In our case everything in buster is working as we expect it to work (using 
> > latest updates and security updates).
> 
> I can't really say much here.  I keep up to date and I cannot point to a
> key ingredient as the one which caused this breakage.
> 
> 
> > > Fresh observation.  During a similar timeframe I started noticing VM
> > > creation leaving a `xl create` process behind.  I had discovered this
> > > process could be freely killed without appearing to effect the VM and had
> > > thus been doing so (memory in a lean Dom0 is precious).
> > > 
> > > While typing this I realized there was another scenario I needed to try.
> > > Turns out if I boot PV GRUB and get to its command-line (press 'c'), then
> > > get away from the VM console, kill the `xl create` process, return to
> > > the console and type "halt".  This results in a hung VM.
> > > 
> > > Are you perhaps either killing the `xl create` process for effected VMs,
> > > or migrating the VM and thus splitting the `xl create` process from the
> > > effected VMs?
> > > 
> > > This seems more a Debian issue than a Xen Project issue right now.
> > 
> > We don't migrate the vms, we don't kill any processes running on the dom0 
> > and 
> > I don't see anything in our logs indicating something gets killed on the 
> > dom0. 
> > On our systems the running 'xl create' processes only use very little 
> > memory.
> > 
> > Have you tried if you still observer your hangs if you don't kill the xl 
> > processes?
> 
> That is exactly what I pointed to above.  On stable killing the
> mysterious left behind `xl create` process causes the problem to
> manifest, while leaving it undisturbed appears to makes the problem not
> manifest.

You cannot kill the 'xl create' process, or else events for the domain
(like shutdown) won't be handled by the toolstack, and thus the domain
won't be destroyed when the guest shuts down. The same would happen if
the guest ties to reboot, it won't work properly because the reboot
request won't be handled by the toolstack as you have just killed the
xl process that's in charge of doing it.

Roger.



 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.