Xen project Mailing List

Re: [PATCH v3 07/14 RESEND] cpufreq: Export HWP parameters to userspace

Date: Thu, 11 May 2023 16:10:51 +0200

Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none

Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=60h4Ji5aH6BWI3I0HTaMMmDfMGn15K/ZLYgwIKV8H2o=; b=D1YFlvTFyMtKS0uK+38NuaLrtuOaR83CXYiK1ALcQpYbbIfwmhWurGPo/4lTfynx5WkM94CZJOf3qPAPyGfJeAowSPsnHgRMkecdVoj0TZFLK/0pcLKAwzqVC5ONo4EH7UhOF28ITvHztoYJwoVmEfVk0+lCy5QeUXi2zYexM7xe/NZTG28k7WK55GLLiWd530f0OiCEx2kptur4A1HEIU1OV77Pif9KwqWS4lpVw2jG4EI/19tU/xCyPhljUJ/2Wm9yBIU/7B6pWwM8madC5asCgof8weUUjRgkvdVNKrM+9etvOr8kN0MiXzSjD12i4YtAZlo+aVj6MtFU6EZ8Dg==

Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=EhKtI0n8/ZME3zsVh1IjI5eNEIlHR+vUxpwEwP87lxiSqRZib8UuBaCYZTqLkEUzzuxujgSwG3Nfth4EQI7fYgdRP9S1eK4R/uqanJqAU771cVNlZScRE+9LRc3k5sDzBCJGMw6qKRCEDZHGi4Y4gOFAx6u2naQn8W+hmohQ+ZUGoiUYA/eiK6FbY/Oh5LGdApa2/Mi4GGD0omI3ZqnCl+gKHmm2VnbSLI3hJMQoI6gcsTsDCtVx22XL1fWQwp2gP68Ak1OrE2WT+vjyEPvxVKT7Xf1vkikc9m4jHG3TYNISKKjQJ38OEGtlEjTR3FT3ta2DvKiihtLh5pppHc45WQ==

Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com;

Cc: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Roger Pau Monné <roger.pau@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, George Dunlap <george.dunlap@xxxxxxxxxx>, Julien Grall <julien@xxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx

Delivery-date: Thu, 11 May 2023 14:11:15 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 11.05.2023 15:49, Jason Andryuk wrote: > On Thu, May 11, 2023 at 2:21 AM Jan Beulich <jbeulich@xxxxxxxx> wrote: >> >> On 10.05.2023 19:49, Jason Andryuk wrote: >>> On Mon, May 8, 2023 at 6:26 AM Jan Beulich <jbeulich@xxxxxxxx> wrote: >>>> >>>> On 01.05.2023 21:30, Jason Andryuk wrote: >>>>> Extend xen_get_cpufreq_para to return hwp parameters. These match the >>>>> hardware rather closely. >>>>> >>>>> We need the features bitmask to indicated fields supported by the actual >>>>> hardware. >>>>> >>>>> The use of uint8_t parameters matches the hardware size. uint32_t >>>>> entries grows the sysctl_t past the build assertion in setup.c. The >>>>> uint8_t ranges are supported across multiple generations, so hopefully >>>>> they won't change. >>>> >>>> Still it feels a little odd for values to be this narrow. Aiui the >>>> scaling_governor[] and scaling_{max,min}_freq fields aren't (really) >>>> used by HWP. So you could widen the union in struct >>>> xen_get_cpufreq_para (in a binary but not necessarily source compatible >>>> manner), gaining you 6 more uint32_t slots. Possibly the somewhat oddly >>>> placed scaling_cur_freq could be included as well ... >>> >>> The values are narrow, but they match the hardware. It works for HWP, >>> so there is no need to change at this time AFAICT. >>> >>> Do you want me to make this change? >> >> Well, much depends on what these 8-bit values actually express (I did >> raise this question in one of the replies to your patches, as I wasn't >> able to find anything in the SDM). That'll then hopefully allow to >> make some educated prediction on on how likely it is that a future >> variant of hwp would want to widen them. > > Sorry for not providing a reference earlier. In the SDM, > HARDWARE-CONTROLLED PERFORMANCE STATES (HWP) section, there is this > second paragraph: > """ > In contrast, HWP is an implementation of the ACPI-defined > Collaborative Processor Performance Control (CPPC), which specifies > that the platform enumerates a continuous, abstract unit-less, > performance value scale that is not tied to a specific performance > state / frequency by definition. While the enumerated scale is roughly > linear in terms of a delivered integer workload performance result, > the OS is required to characterize the performance value range to > comprehend the delivered performance for an applied workload. > """ > > The numbers are "continuous, abstract unit-less, performance value." > So there isn't much to go on there, but generally, smaller numbers > mean slower and bigger numbers mean faster. > > Cross referencing the ACPI spec here: > https://uefi.org/specs/ACPI/6.5/08_Processor_Configuration_and_Control.html#collaborative-processor-performance-control > > Scrolling down you can find the register entries such as > > Highest Performance > Register or DWORD Attribute: Read > Size: 8-32 bits > > AMD has its own pstate implementation that is similar to HWP. Looking > at the Linux support, the AMD hardware also use 8 bit values for the > comparable fields: > https://elixir.bootlin.com/linux/latest/source/arch/x86/include/asm/msr-index.h#L612 > > So Intel and AMD are 8bit for now at least. Something could do 32bits > according to the ACPI spec. > > 8 bits of granularity for slow to fast seems like plenty to me. I'm > not sure what one would gain from 16 or 32 bits, but I'm not designing > the hardware. From the earlier xenpm output, "highest" was 49, so > still a decent amount of room in an 8 bit range. Hmm, thanks for the pointers. I'm still somewhat undecided. I guess I'm okay with you keeping things as you have them. If and when needed we can still rework the structure - it is possible to change it as it's (for the time being at least) still an unstable interface. Jan

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.