Xen project Mailing List

Re: [PATCH 3/4] xen/version: Drop bogus return values for XENVER_platform_parameters

From: Andrew Cooper <Andrew.Cooper3@xxxxxxxxxx>

Date: Fri, 6 Jan 2023 12:14:31 +0000

Accept-language: en-GB, en-US

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=citrix.com; dmarc=pass action=none header.from=citrix.com; dkim=pass header.d=citrix.com; arc=none

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=fCGwVpkqYvY+ebNdIlosFz/zHpTs09CKo5DcQO4LugE=; b=mnvCVmDHDXZWfDmHXxQLe3PXF63ajX/wS+b6uEiWIC4QdXcRP9i2EYcSffdZL+KNgG6TCeFjRoYDB0uCFtGNkuwKv1lC8EMxP/evgwTqn1llQHd3oQKct5J9QodSNI9lP2ex6y+qsHyZb8G6M7awI031ojFWqhYB0WtAaRQ2J0yav6bmT5XKNABKSQuT488mq/Y5BXpAE7Axbuwf9kk0/APE8OknSkl7huz+BUb+QWG/OssJvOJXW4J3F7Q2rCKPL+viXuaX62HZ0YSCjDZOIxDFU4Ge5FcVWWESNt4hMbOSSwIpwvcirDAlXcvVu/CfB3mU8b5+kqxPwwwTuLpxrQ==

Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Gnzv7MAvU48SvKl6ofvDE2CqNYjtMDmtwPNEprPFcX5B/cUkYeO/3pxDwDUOxOXB/uX9zTzD4ssPckYMPcXHMaZ83aTgV+dZg6BJSSZqkrPJIYt8XMYoVu814f9iQYlHiutSvjeKxxtfBFh3FOlcNc4HW+z+vLCMmFZ50D6ABO9+0Lx6G27jDcTOpy2fUC2MU6tLB8GVNuE2PrG0EE3QSxilTXkTQ+Agv4nFmlM6sjczHRxJb8Bw0hEWY+MGxxOKgLngiax8EyA0eGEgE57EIkmZehUKseY7N5a8dY35i5yetf2rgr6oIqvicYeI9hLi2X+++AkTEvTAV76roGgbpw==

Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=citrix.com;

Cc: George Dunlap <George.Dunlap@xxxxxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, Julien Grall <julien@xxxxxxx>, Xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxxx>

Delivery-date: Fri, 06 Jan 2023 12:14:49 +0000

Ironport-data: A9a23:n7hKqqnsttR2muTDhFU+qeno5gxKJ0RdPkR7XQ2eYbSJt1+Wr1Gzt xJJC2HVO/aKNmT9c9p/aISy9EJSvJ7UndNgSApp/ig8ECMWpZLJC+rCIxarNUt+DCFhoGFPt JxCN4aafKjYaleG+39B55C49SEUOZmgH+a6U6icf3grHmeIcQ954Tp7gek1n4V0ttawBgKJq LvartbWfVSowFaYCEpNg064gE4p7aqaVA8w5ARkPqgS5AGGzRH5MbpETU2PByqgKmVrNrbSq 9brlNmR4m7f9hExPdKp+p6TnpoiG+O60aCm0xK6aoD66vRwjnVaPpUTbZLwXXx/mTSR9+2d/ f0W3XCGpaXFCYWX8AgVe0Ew/yiTpsSq8pefSZS0mZT7I0Er7xIAahihZa07FdRwxwp5PY1B3 aYEDg0dPlOGu/CRw4y9evtP3906F+C+aevzulk4pd3YJdAPZMmZBoD1v5pf1jp2gd1SF/HDY cZfcSBocBnLfxxIPBEQFY46m+CrwHL4dlW0qnrM/fZxvzeVklI3jOaF3Nn9I7RmQe18mEqCq 32A1GP+GhwAb/SUyCaf82LqjejK9c/+cNNLRO3iqKc76LGV7k0cSwQnZXSQmvKSm3yGQe4FL VdMoBN7+MDe82TuFLERRSaQonSJoxodUNp4CPAh5UeGza+8yx2CGmEOQzpFadonnMw7Xzon0 hmOhdyBLSNrmK2YTzSa7Lj8hTGvPSkYK0cSaClCShEKi/HzrYd2gh/RQ9JLFK+uksazCTz22 yqNriU1m/MUl8Fj6kmg1VXOgjbpo4eTSAcwv1/TRjj9sl0/Y5O5bYu171Sd9exHMIuSUliGu j4DhtSa6+cNS5qKkURhXdkwIV1g3N7dWBW0vLKlN8BJG+iFk5J7Qb1t3Q==

Ironport-hdrordr: A9a23:y5JW5a7IPuV9pSEKKQPXwHrXdLJyesId70hD6qm+c20wTiX4rb HcoB1/73SbtN9/YhEdcK+7SdO9qB/nlKKdgrNhTItKIjOW2ldARbsKheHfKlbbak7DH4BmpM Jdm6MXMqyMMbAT5/yX3OHSeexO/DFJmprEuc7ui05ICSVWQ+VY6QF9YzzrYnGfhmN9dOYE/F 733Ls4m9JkE05nEfhTfUN1ONTrlpnwjZf7ZhxDLAIm7QTmt0LS1JfKVyKA2wsYUXd04ZpKyx miryXJop+7tu29yFvn23TN449wkN/so+EzffCku4wuMzDxjQTtW4h7Qb2Fu1kO0ZmS1Go=

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

Thread-index: AQHZH69XYhd2rMGoWE+asSBKOPPG4q6OdxyAgAA2ioCAAMm+gIAA8B2AgAChQoCAAEi6gA==

Thread-topic: [PATCH 3/4] xen/version: Drop bogus return values for XENVER_platform_parameters

On 06/01/2023 7:54 am, Jan Beulich wrote: > On 05.01.2023 23:17, Andrew Cooper wrote: >> On 05/01/2023 7:57 am, Jan Beulich wrote: >>> On 04.01.2023 20:55, Andrew Cooper wrote: >>>> On 04/01/2023 4:40 pm, Jan Beulich wrote: >>>>> On 03.01.2023 21:09, Andrew Cooper wrote: >>>>>> A split in virtual address space is only applicable for x86 PV guests. >>>>>> Furthermore, the information returned for x86 64bit PV guests is wrong. >>>>>> >>>>>> Explain the problem in version.h, stating the other information that PV >>>>>> guests >>>>>> need to know. >>>>>> >>>>>> For 64bit PV guests, and all non-x86-PV guests, return 0, which is >>>>>> strictly >>>>>> less wrong than the values currently returned. >>>>> I disagree for the 64-bit part of this. Seeing Linux'es exposure of the >>>>> value in sysfs I even wonder whether we can change this like you do for >>>>> HVM. Who knows what is being inferred from the value, and by whom. >>>> Linux's sysfs ABI isn't relevant to us here. The sysfs ABI says it >>>> reports what the hypervisor presents, not that it will be a nonzero number. >>> It effectively reports the hypervisor (virtual) base address there. How >>> can we not care if something (kexec would come to mind) would be using >>> it for whatever purpose. >> What about kexec do you think would care? > I didn't think about anything specific, but I could see why it may want to > know where in VA space Xen sits. The kexec image doesn't run "under" Xen; it replaces Xen in memory, and transition into the new image is via no paging (32bit) or identity paging (64bit) in the reserved region. We don't really support kexec load (it's there, but I don't expect anyone has exercised it in anger), but if we were to load a new Xen+dom0, then kexec-tools still has nothing relevant to do with new-Xen+dom0's split. >>>>>> + * For all guest types using hardware virt extentions, Xen is not >>>>>> mapped into >>>>>> + * the guest kernel virtual address space. This now return 0, where it >>>>>> + * previously returned unrelated data. >>>>>> + */ >>>>>> #define XENVER_platform_parameters 5 >>>>>> struct xen_platform_parameters { >>>>>> xen_ulong_t virt_start; >>>>> ... the field name tells me that all that is being conveyed is the virtual >>>>> address of where the hypervisor area starts. >>>> IMO, it doesn't matter what the name of the field is. It dates from the >>>> days when 32bit PV was the only type of guest. >>>> >>>> 32bit PV guests really do have a variable split, so the guest kernel >>>> really does need to get this value from Xen. >>>> >>>> The split for 64bit PV guests is compile-time constant, hence why 64bit >>>> PV kernels don't care. >>> ... once we get to run Xen in 5-level mode, 4-level PV guests could also >>> gain a variable split: Like for 32-bit guests now, only the r/o M2P would >>> need to live in that area, and this may well occupy less than the full >>> range presently reserved for the hypervisor. >> ... you can't do this, because it only works for guests which have >> chosen to find the M2P using XENMEM_machphys_mapping (e.g. Linux), and >> doesn't for e.g. MiniOS which does: >> >> #define machine_to_phys_mapping ((unsigned long *)HYPERVISOR_VIRT_START) > Hmm, looks like a misunderstanding? I certainly wasn't thinking about > making the start of that region variable, but rather the end (i.e. not > exactly like for 32-bit compat). But for what purpose? You can't make 4-level guests have a variable range. The best you can do is make a 5-level-aware guest running on 4-level Xen have some new semantics, but a 4-level PV guest already owns the overwhelming majority of virtual address space, so being able to hand back a few extra TB is not interesting. And for making that happen... >>>> For compat HVM, it happens to pick up the -1 from: >>>> >>>> #ifdef CONFIG_PV32 >>>> HYPERVISOR_COMPAT_VIRT_START(d) = >>>> is_pv_domain(d) ? __HYPERVISOR_COMPAT_VIRT_START : ~0u; >>>> #endif >>>> >>>> in arch_domain_create(), whereas for non-compat HVM, it gets a number in >>>> an address space it has no connection to in the slightest. ARM guests >>>> end up getting XEN_VIRT_START (== 2M) handed back, but this absolutely >>>> an internal detail that guests have no business knowing. >>> Well, okay, this looks to be good enough an argument to make the adjustment >>> you propose for !PV guests. >> Right, HVM (on all architectures) is very cut and dry. >> >> But it feels wrong to not address the PV64 issue at the same time >> because it is similar level of broken, despite there being (in theory) a >> legitimate need for a PV guest kernel to know it. > To me it feels wrong to address the 64-bit PV issue by removing information, > when - as you also say - it is actually _missing_ information. To me the > proper course of action would be to expose the upper bound as well (such > that, down the road, it could become dynamic). There's also no info leak > there, as the two (static) bounds are part of the PV ABI anyway. ... the absolute best you could do here is introduce a new XENVER_platform_parameters2 that has a different structure. Which still leaves us with XENVER_platform_parameters providing data which is somewhere between useless and actively unhelpful. ~Andrew

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.