Xen project Mailing List

Re: [PATCH 1/8] x86/pdx: simplify calculation of domain struct allocation boundary

To: Roger Pau Monné <roger.pau@xxxxxxxxxx>

Date: Thu, 12 Jun 2025 14:00:55 +0200

Autocrypt: addr=jbeulich@xxxxxxxx; keydata= xsDiBFk3nEQRBADAEaSw6zC/EJkiwGPXbWtPxl2xCdSoeepS07jW8UgcHNurfHvUzogEq5xk hu507c3BarVjyWCJOylMNR98Yd8VqD9UfmX0Hb8/BrA+Hl6/DB/eqGptrf4BSRwcZQM32aZK 7Pj2XbGWIUrZrd70x1eAP9QE3P79Y2oLrsCgbZJfEwCgvz9JjGmQqQkRiTVzlZVCJYcyGGsD /0tbFCzD2h20ahe8rC1gbb3K3qk+LpBtvjBu1RY9drYk0NymiGbJWZgab6t1jM7sk2vuf0Py O9Hf9XBmK0uE9IgMaiCpc32XV9oASz6UJebwkX+zF2jG5I1BfnO9g7KlotcA/v5ClMjgo6Gl MDY4HxoSRu3i1cqqSDtVlt+AOVBJBACrZcnHAUSuCXBPy0jOlBhxPqRWv6ND4c9PH1xjQ3NP nxJuMBS8rnNg22uyfAgmBKNLpLgAGVRMZGaGoJObGf72s6TeIqKJo/LtggAS9qAUiuKVnygo 3wjfkS9A3DRO+SpU7JqWdsveeIQyeyEJ/8PTowmSQLakF+3fote9ybzd880fSmFuIEJldWxp Y2ggPGpiZXVsaWNoQHN1c2UuY29tPsJgBBMRAgAgBQJZN5xEAhsDBgsJCAcDAgQVAggDBBYC AwECHgECF4AACgkQoDSui/t3IH4J+wCfQ5jHdEjCRHj23O/5ttg9r9OIruwAn3103WUITZee e7Sbg12UgcQ5lv7SzsFNBFk3nEQQCACCuTjCjFOUdi5Nm244F+78kLghRcin/awv+IrTcIWF hUpSs1Y91iQQ7KItirz5uwCPlwejSJDQJLIS+QtJHaXDXeV6NI0Uef1hP20+y8qydDiVkv6l IreXjTb7DvksRgJNvCkWtYnlS3mYvQ9NzS9PhyALWbXnH6sIJd2O9lKS1Mrfq+y0IXCP10eS FFGg+Av3IQeFatkJAyju0PPthyTqxSI4lZYuJVPknzgaeuJv/2NccrPvmeDg6Coe7ZIeQ8Yj t0ARxu2xytAkkLCel1Lz1WLmwLstV30g80nkgZf/wr+/BXJW/oIvRlonUkxv+IbBM3dX2OV8 AmRv1ySWPTP7AAMFB/9PQK/VtlNUJvg8GXj9ootzrteGfVZVVT4XBJkfwBcpC/XcPzldjv+3 HYudvpdNK3lLujXeA5fLOH+Z/G9WBc5pFVSMocI71I8bT8lIAzreg0WvkWg5V2WZsUMlnDL9 mpwIGFhlbM3gfDMs7MPMu8YQRFVdUvtSpaAs8OFfGQ0ia3LGZcjA6Ik2+xcqscEJzNH+qh8V m5jjp28yZgaqTaRbg3M/+MTbMpicpZuqF4rnB0AQD12/3BNWDR6bmh+EkYSMcEIpQmBM51qM EKYTQGybRCjpnKHGOxG0rfFY1085mBDZCH5Kx0cl0HVJuQKC+dV2ZY5AqjcKwAxpE75MLFkr wkkEGBECAAkFAlk3nEQCGwwACgkQoDSui/t3IH7nnwCfcJWUDUFKdCsBH/E5d+0ZnMQi+G0A nAuWpQkjM1ASeQwSHEeAWPgskBQL

Cc: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx

Delivery-date: Thu, 12 Jun 2025 12:01:13 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 12.06.2025 12:46, Roger Pau Monné wrote: > On Thu, Jun 12, 2025 at 11:03:21AM +0200, Jan Beulich wrote: >> On 11.06.2025 19:16, Roger Pau Monne wrote: >>> @@ -498,14 +474,15 @@ struct domain *alloc_domain_struct(void) >>> * On systems with CONFIG_BIGMEM there's no packing, and so there's no >>> * such restriction. >>> */ >>> -#if defined(CONFIG_BIGMEM) || !defined(CONFIG_PDX_COMPRESSION) >>> - const unsigned int bits = IS_ENABLED(CONFIG_BIGMEM) ? 0 : >>> - 32 + PAGE_SHIFT; >>> +#if defined(CONFIG_BIGMEM) >>> + const unsigned int bits = 0; >>> #else >>> - static unsigned int __read_mostly bits; >>> + static unsigned int __ro_after_init bits; >>> >>> if ( unlikely(!bits) ) >>> - bits = _domain_struct_bits(); >>> + bits = flsl(pfn_to_paddr(pdx_to_pfn( >>> + 1UL << (sizeof(((struct page_info *)NULL)->v.inuse._domain) * >>> 8)))) >>> + - 1; >> >> While Andrew did point you at sizeof_field(), we can have this even less >> verbose >> by utilizing that frame_table is of the right type and (almost) globally in >> scope. >> >> Further, why use pfn_to_paddr()? >> >> bits = flsl(pdx_to_pfn(1UL << >> (sizeof(frame_table->v.inuse._domain) * 8))) >> + >> PAGE_SHIFT - 1; > > I've introduced and used pdx_to_paddr(), which I think it's more > natural. We already had a paddr_to_pdx() which was missing it's > bidirectional equivalent. It's now: > > bits = flsl(pdx_to_paddr(1UL << (sizeof_field(struct page_info, > v.inuse._domain) * 8))) > - 1; Textually this is better, yes. I won't insist on the other variant, while still noting that your way there's an extra shift whereas my way there's merely an extra add. >> However, it further feels like this was off by one; we had similar issues >> over >> time in several places. There potentially being a gap between one less than >> the PDX used here and that very PDX, don't we need to calculate based on the >> "one less" value here? Hmm, there being a gap means no allocation would >> succeed for the precise value of "bits" (in the mask-compression scheme), so >> functionally all would be fine. Yet just to avoid setting a bad precedent I >> think we'd still be better off using >> >> bits = flsl(pdx_to_pfn((1UL << >> (sizeof(frame_table->v.inuse._domain) * 8)) >> - >> 1)) + PAGE_SHIFT; >> >> If one would log the value of bits, the result would then also be less >> confusing in (at least) the mask-compression scheme. > > > Is the above correct tough? > > Take for example the hypothetical case where pdx_to_pfn() returns > 0x10. Hmm, yes - while impossible in the mask-compression scheme, it is in principle possible with other schemes (like the offset one). > Then flsl() will return 5 (let's leave the PAGE_SHIFT > adjustment out for the example here). The allocation bit width would > be off-by-one, because allocating using a bit width of 5 would also > allow 0x11 to be allocated, and that won't be correct. > > I think we need to get the bit width of the next pdx (so the > non-inclusive end of the range), and then subtract 1 from it, > otherwise the allocation bit width is possibly off-by-one. I think you're right, and I can't really see how to (easily) get the more precise value for the mask-compression scheme then. I would therefore like to ask that you attach a comment clarifying the slight oddity. Jan

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.