Xen project Mailing List

Re: [PATCH] x86/CPUID: surface suitable value in EBX of XSTATE subleaf 1

To: Andrew Cooper <Andrew.Cooper3@xxxxxxxxxx>

Date: Tue, 23 Aug 2022 14:01:18 +0200

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=917xEOW/jGptWxGBU+dV3BfQdpbg9XQh5tb4yZWsCUQ=; b=mZICr+7lnf09eSmlviyBKUkA16nbFJssdVyBQiGd0ZRxkFqmaI8l5t5cR87VV8PDoaSaXN+6V/2OOL/EqBe3L43BWa/Vo3g41gONuDs0WY7ZsdmCFHky7q9yvP+7m+xU9BswNHKA0Kyk18oOBwNRHBMJ36G1SbG9eOXdiKcngiyyF4spbO2ScqvCrqIAuqnwfmZ4RYHZ2X80MzQLIRxr6AWO7UhJjjfbVFQGK2LKMy5NlfjfVaSe2bk+9gLpKxjISHlbCKwtSXfIhOtYLEKGypij3q2Gcr/T+m1mQGDB/7fBSjeg7Ey8d6haJCUZ7U8UjE2lK9Kx8xjL6EjnETV/ug==

Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=XV9c1pnI1lUQPCSDtHR8h5c0+1oxgr/GPWKaWRB7hFRfVB0t2rDZR2zcKrb2FgtwacE9NUCcWCyvoowVOccCmxvSJBzkLa7/B8PW3/87Jew7mtYlc/aMDKVKzqK9pjsGFxpCOBTUJlCCcbj5HafwD+D9n16xD7e0W3Di51FCMcEuyzpYmB78yJQBENKj2e2+OIbMR1jzxw1r84Z+ba27HOR+8u25cYto2NbAf/iJQ1kV4Wj6z/RtHk5c0ZhPhYcCAoGTW9LGIuyWyoVCLz1bdJkdZgctU3MZ2ni3Msl0tv/fHXZ9WR9W/H5vRTTS50X7QX9Y5bQZOit3SXzInJadCA==

Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com;

Cc: Wei Liu <wl@xxxxxxx>, Roger Pau Monne <roger.pau@xxxxxxxxxx>, Marek Marczykowski-Górecki <marmarek@xxxxxxxxxxxxxxxxxxxxxx>, Thomas Gleixner <tglx@xxxxxxxxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>

Delivery-date: Tue, 23 Aug 2022 12:01:34 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 23.08.2022 12:48, Andrew Cooper wrote: > On 23/08/2022 10:27, Jan Beulich wrote: >> On 23.08.2022 10:59, Andrew Cooper wrote: >>> On 23/08/2022 07:42, Jan Beulich wrote: >>>> exposed to PV domains. >>>> >>>> Considering that the size reported is that of the compacted save area, >>>> I view Linux'es assumption as appropriate (short of the SDM properly >>>> considering the case). Therefore we need to populate the field also when >>>> only XSAVEC is supported for a guest. >>> This is a mess. The SDM is fairly clear (but only in Vol1) that this >>> leaf is specific to XSAVES. >> The way it's written my assumption is that they simply didn't care about >> XSAVEC when writing this, or they were assuming that both features would >> always be supported together (yet even if they are in Intel's hardware, >> the architecture should spell out things as if both were entirely >> independent, or it should specify that one takes the other as a prereq). > > Real hardware has XSAVEC == XSAVES on Intel (Skylake) and AMD (Zen1). > Despite an attempt to separate the parts of the ISA, they are > inextricably linked. > > It is only under virt that we get XSAVEC without XSAVES. > >>>> Fixes: 460b9a4b3630 ("x86/xsaves: enable xsaves/xrstors for hvm guest") >>>> Fixes: 8d050ed1097c ("x86: don't expose XSAVES capability to PV guests") >>>> Signed-off-by: Jan Beulich <jbeulich@xxxxxxxx> >>> CC Marek. Looks like Jan has found the issue you reported on IRC. >>> >>> Jan: Be aware that I submitted >>> https://lore.kernel.org/lkml/20220810221909.12768-1-andrew.cooper3@xxxxxxxxxx/ >>> to Linux to correct some of the diagnostics. >>>> --- a/xen/arch/x86/cpuid.c >>>> +++ b/xen/arch/x86/cpuid.c >>>> @@ -1142,7 +1142,7 @@ void guest_cpuid(const struct vcpu *v, u >>>> switch ( subleaf ) >>>> { >>>> case 1: >>>> - if ( p->xstate.xsaves ) >>>> + if ( p->xstate.xsavec || p->xstate.xsaves ) >>> If we're doing this, then it wants to be xsavec only, with the comment >>> being extended to explain why. >> Why would that be? Both insns use compacted format, and neither is >> dependent upon the other in terms of being supported. IOW XSAVES alone >> and XSAVEC alone enabled for a domain should still lead through this >> path. > > Hmm. Because my fixes to compaction handling haven't been committed > yet, and in particular one the one which makes XSAVES strictly depend on > XSAVEC. > > In which case this hunk is correct for Xen as it currently is, and will > be need to be adjusted when I rebase the compaction series. May I translate this to an Ack then? Iirc there were no other change requests. >>> But this is going to further complicate my several-year-old series >>> trying to get Xen's XSTATE handling into a position where we can start >>> to offer supervisor states. >> Where do you see further complication? The necessary fiddling with XSS >> here would of course be dependent upon p->xstate.xsaves alone (or, >> maybe better, on the set of enabled features in XSS being non-empty), >> but that's simply another (inner) if(). >> >> As an aside, I actually wonder what use the supplied size is to user >> mode code when any XSS-controlled feature is enabled: They'd allocate >> a needlessly large block of memory, as they would only be able to use >> XSAVEC. > > This field is an already known kernel=>user infoleak. There are threads > about it on LKML. > > But it does highlight another problem. This change does not fix Linux > on AMD Zen3 hardware, where the kernel will find the CPUID value larger > than it can calculate the size to be, because Xen's use of CET-SS will > show up in the CPUID value. > > Linux needs an adjustment from != to <= for this check. I was wondering about that too, but if I'm not mistaken the change you suggest is the opposite of what would be apparently safe there (against overrunning buffers). Hence it may take more than just the comparison type to be modified. Jan

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.