[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: x86/HVM: Linux'es apic_pending_intr_clear() warns about stale IRR


  • To: Andrew Cooper <Andrew.Cooper3@xxxxxxxxxx>
  • From: Jan Beulich <jbeulich@xxxxxxxx>
  • Date: Thu, 3 Nov 2022 09:48:20 +0100
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=xi1pBCMH3iFyLLmMgDBJQHipe45LdC7rPoIOb0Jln7Q=; b=LfVIUS9bqcp+Rm0jtVH/DHgrJs++yyL8Gf4cDBVNpMeWLDeTzPfeJqJKH3tTeEkNNPRL+AhQW5moe1nCga46IeUsCCuZoVpGhPVkmx54Qe378vu29dwud4/87uOKlYHRKkP1OCS+bkUULODwW19lz4QRm/q5XTr06oJ748Z2x1e7L4RyoZE72C7YFyc8Uze66eNwr4fWqAyDtYGM7YNbHwa15x3Ss0wALQ1WgHvTRD5E5L+0w5ZrakXKoZkulRJu+V8bhbskIJWY9uMZha5fPNO5Qcoxo8bDqcmPqWTSlezwyVVhADPJkVS9mno9jU3uHIjVrggBeCVHkmUMvYcHGA==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=PjE8DBIj4LR+RT1euuh+lU+z422amGx86uauBCdOPfxaGIE9+ADF2HY++QUn2fn860xO/fbQ/qjHeTaIdTQpNtNooPGOzrdIMjRMwOPJhC5c7JaIrf4q3fv7dE6rVRVOTS7MsuTXGpxocSOu0HMKlmLoHQr5x+VMUOtbHKv5L5RpEheJBmtGFxG45nqrl5ALfjfkzwMYKpGF+x0rQweGKDPsPHgWX5KmVeospdzTLWFegVddGAyLZIM1OG/HDF2s858J52Ah1WqdAML3q8qBUAM2GVFQ5JnZhRigbEqqCejOrjpjpDupT/hqJV3K5eM9H5LWCdoVVrNMVW2H5QmP0g==
  • Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com;
  • Cc: Roger Pau Monne <roger.pau@xxxxxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>
  • Delivery-date: Thu, 03 Nov 2022 08:48:37 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 31.10.2022 19:37, Andrew Cooper wrote:
> On 31/10/2022 15:55, Jan Beulich wrote:
>> Hello,
>>
>> quite likely this isn't new, but I've ended up noticing it only recently:
>> On an oldish system where I hand a HVM guest an SR-IOV NIC (not sure yet
>> whether that actually matters) all APs have that warning issued, with all
>> reported values zero except for the very first IRR one - that's 00080000.
>> Which is suspicious by itself, for naming vector 0x13, i.e. below 0x20
>> and hence within CPU exception range.
> 
> To be clear, these are the VM's APs ?

Yes. I'm now also pretty sure this is a Linux side issue, as I've verified
it to be new in 6.0. Debugging is complicated some by the host not being
very reliable anymore - the SR-IOV card has been producing floods of
corrected errors for a long time, but recently apparently uncorrected
errors have been occurring every now and then, resulting in a host reset.
But I'll keep trying to see if I can spot where this behavior is coming
from.

Jan



 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.