[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH 0/2] xen/mm: limit in-place scrubbing
- To: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>
- From: Roger Pau Monné <roger.pau@xxxxxxxxxx>
- Date: Fri, 9 Jan 2026 13:31:14 +0100
- Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=citrix.com; dmarc=pass action=none header.from=citrix.com; dkim=pass header.d=citrix.com; arc=none
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=f4Cf8CU1ruHvAsJmqchLSN1yCpGtBp9l387ayotkjqc=; b=UldFIDfC2HDJQFFmu0wXb9pXMQJXuyXLHsZ7fwe4R+0pBtdzTD/K+FTlL/yp4njF5poUH80kthvOAQS2FywXmvrIhOrWivH1UY6qVkOgM8DKiCa1F5/oiAowmoCb0CNDf29vwhzp1jABe0Raz4KT7p/32tNkL9r9lCw691V9kauDby36bbpovTIla9HGOouD65xuC0TP87YTMZbb9hoyuEWuxsClUBwEhpnEpXUCY/crTH43RWTURFcKJwl68ynZYlVCbvAkwRz3GCv7wd/iK5xInVRpURRpUWDhFynIET7qLuE8p1CwF0YdcXTA5Hqk+QAkkdlmWhZyGigezVFpKQ==
- Arc-seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=Jmuz4G+qcEy/thCqChmPr+6SCb9AnzYTbXz5cPxU+tDTlTkvf1NKw9Vz0OQwb5NfzuCeS+cD+e9LEv6hmlN++9rMMdwGInD43ESRCWgCRZyCsaP3WUKw5F/MS5/NdtLfewC6VUrx/m8IuiTvHBL7iixc5AnyU7xAOXjgr7ZoRO7HBfD/E+K6o5q2A1D3z/inCBkKftSa1K8CJYBFlPjU7nMeZFWSlRl0Z+X28TwvbhIZte3bYII14eMzi0tPy6NSBffqESuIGjT/ClvDo7fkJltFXtn3MqHm+R/cmW2+pECTDDexeMBqdkcWZhHpD0aPSSQgCdcCjSt0iknyiF5s3A==
- Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=citrix.com;
- Cc: Jan Beulich <jbeulich@xxxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Julien Grall <julien@xxxxxxx>, Bertrand Marquis <bertrand.marquis@xxxxxxx>, Michal Orzel <michal.orzel@xxxxxxx>, Volodymyr Babchuk <Volodymyr_Babchuk@xxxxxxxx>, Anthony PERARD <anthony.perard@xxxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx
- Delivery-date: Fri, 09 Jan 2026 12:31:30 +0000
- List-id: Xen developer discussion <xen-devel.lists.xenproject.org>
On Fri, Jan 09, 2026 at 10:29:20AM +0000, Andrew Cooper wrote:
> On 09/01/2026 10:15 am, Jan Beulich wrote:
> > On 08.01.2026 18:55, Roger Pau Monne wrote:
> >> In XenServer we have seen the watchdog occasionally triggering during
> >> domain creation if 1GB pages are scrubbed in-place during physmap
> >> population.
> > That's pretty extreme - writing to 1Gb of memory can't really take over 5s,
> > can it?
>
> Sure it can.
>
> > Is there lock contention involved?
>
> Almost certainly, and it's probably the more relevant aspect in this case.
Possibly. I can tell Edwin to give me his reproduction. There's also
the map_domain_page() page aspect of this operation. On big enough
systems this will cause a fair amount of stress to the map cache,
since each page is mapped, scrubbed and unmapped. I don't think
however the systems on which we have seen this to be using the map
cache (it was on debug=n builds with less than 5TB of memory).
> > Or is this when very many CPUs
> > try to do the same in parallel?
>
> The scenario is reboot of a VM when Xapi is doing NUMA placement using
> per-node claims.
Not exclusively. We have reports of this also happening without any
claims or NUMA placements being used.
AFAICT it's possibly triggered when doing reboots of multiple VMs in
parallel, and all reports of it I've seen it's on multi-node NUMA
systems. I wonder if scrubbing a 1G remote page in 4K chunks is
killing the intra-node bandwidth.
Thanks, Roger.
|