Xen project Mailing List

Re: [PATCH v2] x86/HVM: restrict use of pinned cache attributes as well as associated flushing

To: Roger Pau Monné <roger.pau@xxxxxxxxxx>

Date: Tue, 10 Jun 2025 13:59:52 +0200

Autocrypt: addr=jbeulich@xxxxxxxx; keydata= xsDiBFk3nEQRBADAEaSw6zC/EJkiwGPXbWtPxl2xCdSoeepS07jW8UgcHNurfHvUzogEq5xk hu507c3BarVjyWCJOylMNR98Yd8VqD9UfmX0Hb8/BrA+Hl6/DB/eqGptrf4BSRwcZQM32aZK 7Pj2XbGWIUrZrd70x1eAP9QE3P79Y2oLrsCgbZJfEwCgvz9JjGmQqQkRiTVzlZVCJYcyGGsD /0tbFCzD2h20ahe8rC1gbb3K3qk+LpBtvjBu1RY9drYk0NymiGbJWZgab6t1jM7sk2vuf0Py O9Hf9XBmK0uE9IgMaiCpc32XV9oASz6UJebwkX+zF2jG5I1BfnO9g7KlotcA/v5ClMjgo6Gl MDY4HxoSRu3i1cqqSDtVlt+AOVBJBACrZcnHAUSuCXBPy0jOlBhxPqRWv6ND4c9PH1xjQ3NP nxJuMBS8rnNg22uyfAgmBKNLpLgAGVRMZGaGoJObGf72s6TeIqKJo/LtggAS9qAUiuKVnygo 3wjfkS9A3DRO+SpU7JqWdsveeIQyeyEJ/8PTowmSQLakF+3fote9ybzd880fSmFuIEJldWxp Y2ggPGpiZXVsaWNoQHN1c2UuY29tPsJgBBMRAgAgBQJZN5xEAhsDBgsJCAcDAgQVAggDBBYC AwECHgECF4AACgkQoDSui/t3IH4J+wCfQ5jHdEjCRHj23O/5ttg9r9OIruwAn3103WUITZee e7Sbg12UgcQ5lv7SzsFNBFk3nEQQCACCuTjCjFOUdi5Nm244F+78kLghRcin/awv+IrTcIWF hUpSs1Y91iQQ7KItirz5uwCPlwejSJDQJLIS+QtJHaXDXeV6NI0Uef1hP20+y8qydDiVkv6l IreXjTb7DvksRgJNvCkWtYnlS3mYvQ9NzS9PhyALWbXnH6sIJd2O9lKS1Mrfq+y0IXCP10eS FFGg+Av3IQeFatkJAyju0PPthyTqxSI4lZYuJVPknzgaeuJv/2NccrPvmeDg6Coe7ZIeQ8Yj t0ARxu2xytAkkLCel1Lz1WLmwLstV30g80nkgZf/wr+/BXJW/oIvRlonUkxv+IbBM3dX2OV8 AmRv1ySWPTP7AAMFB/9PQK/VtlNUJvg8GXj9ootzrteGfVZVVT4XBJkfwBcpC/XcPzldjv+3 HYudvpdNK3lLujXeA5fLOH+Z/G9WBc5pFVSMocI71I8bT8lIAzreg0WvkWg5V2WZsUMlnDL9 mpwIGFhlbM3gfDMs7MPMu8YQRFVdUvtSpaAs8OFfGQ0ia3LGZcjA6Ik2+xcqscEJzNH+qh8V m5jjp28yZgaqTaRbg3M/+MTbMpicpZuqF4rnB0AQD12/3BNWDR6bmh+EkYSMcEIpQmBM51qM EKYTQGybRCjpnKHGOxG0rfFY1085mBDZCH5Kx0cl0HVJuQKC+dV2ZY5AqjcKwAxpE75MLFkr wkkEGBECAAkFAlk3nEQCGwwACgkQoDSui/t3IH7nnwCfcJWUDUFKdCsBH/E5d+0ZnMQi+G0A nAuWpQkjM1ASeQwSHEeAWPgskBQL

Cc: "xen-devel@xxxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxxx>, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>

Delivery-date: Tue, 10 Jun 2025 12:00:04 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 10.06.2025 12:44, Roger Pau Monné wrote: > On Tue, Jun 10, 2025 at 09:40:38AM +0200, Jan Beulich wrote: >> On 09.06.2025 12:36, Roger Pau Monné wrote: >>> On Wed, Jun 04, 2025 at 11:48:00AM +0200, Jan Beulich wrote: >>>> @@ -605,31 +606,35 @@ int hvm_set_mem_pinned_cacheattr(struct >>>> >>>> type = range->type; >>>> call_rcu(&range->rcu, free_pinned_cacheattr_entry); >>>> - p2m_memory_type_changed(d); >>>> switch ( type ) >>>> { >>>> - case X86_MT_UCM: >>>> + case X86_MT_WB: >>>> + case X86_MT_WP: >>>> + case X86_MT_WT: >>>> /* >>>> - * For EPT we can also avoid the flush in this case; >>>> - * see epte_get_entry_emt(). >>>> + * Flush since we don't know what the cachability is >>>> going >>>> + * to be. >>>> */ >>>> - if ( hap_enabled(d) && cpu_has_vmx ) >>>> - case X86_MT_UC: >>>> - break; >>>> - /* fall through */ >>>> - default: >>>> - flush_all(FLUSH_CACHE); >>>> + if ( is_iommu_enabled(d) || cache_flush_permitted(d) ) >>>> + flush = true; >>> >>> Is the check here required? memory_type_changed() will already check >>> for is_iommu_enabled() and cache_flush_permitted(), and hence you >>> could just set flush to true unconditionally here IMO. >> >> The behavioral difference is when both predicates are false: The way I have >> it now, p2m_memory_type_changed() will then still be called (conditionally), >> better matching prior behavior. > > I see. Yes, p2m_memory_type_changed() needs to be called. > >> >>>> break; >>>> } >>>> - return 0; >>>> + rc = 0; >>>> + goto finish; >>>> } >>>> domain_unlock(d); >>>> return -ENOENT; >>>> >>>> case X86_MT_UCM: >>>> case X86_MT_UC: >>>> - case X86_MT_WB: >>>> case X86_MT_WC: >>>> + /* Flush since we don't know what the cachability was. */ >>>> + if ( !is_iommu_enabled(d) && !cache_flush_permitted(d) ) >>>> + return -EPERM; > > When assigning IO resources without an IOMMU enabled we likely need > to allow the pinned cache attributes to be set, but there's no need to > propagate the changes to the p2m, as the EMT calculation won't take > into account the pinned attributes. Why would it not do so? Am I overlooking a conditional there that would cause hvm_get_mem_pinned_cacheattr() to not be called? The only related one I see is if ( type != p2m_mmio_direct && !is_iommu_enabled(d) && !cache_flush_permitted(d) ) covering the without-IOMMU case just the same as the "with" one. (The "without" case looks dubious to me, as I don't think we arrange for any identity mapping, but that's a separate topic.) > IOW: I don't think we can safely short-circuit and return -EPERM here > without agreeing that it's a behavioral difference form the previous > implementation. There's no question there is a behavioral change here. Without I/O resources (and without IOMMU) we simply don't accept cache attributes other then WB elsewhere; the change is to avoid doing so here as well, to get things to be consistent. Hence the -EPERM return. >>>> @@ -682,9 +687,11 @@ int hvm_set_mem_pinned_cacheattr(struct >>>> >>>> xfree(newr); >>>> >>>> - p2m_memory_type_changed(d); >>>> - if ( type != X86_MT_WB ) >>>> - flush_all(FLUSH_CACHE); >>>> + finish: >>>> + if ( flush ) >>>> + memory_type_changed(d); >>>> + else if ( d->vcpu && d->vcpu[0] ) >>>> + p2m_memory_type_changed(d); >>> >>> FWIW, I would just call memory_type_changed() unconditionally >>> regardless of the change. >> >> In which case the need for the "flush" local var would go away, if I >> understand your suggestion correctly. Like above, there'll then be >> more of a behavioral change than intended. In particular ... > > There will be a behavioral change, but not one that the guest would > notice IMO. > >>> We suspect the hypercall is only used at >>> domain creation time (where memory_type_changed() won't do a cache >>> flush anyway). >> >> ... "suspect" is not enough for my taste. The only alternative there >> that I see (as mentioned in a post-commit-message remark) is to >> refuse such "late" changes altogether. Yet for that we need to be >> sure, which it looks like no-one of us is. > > Why do you say only alternative? Oh, sorry, I meant "only" just in regard to options keeping the main code structure of the change. I agree ... > Calling memory_type_changed() unconditionally (without taking into > account the previous or new cache attributes) would also be an > acceptable solution, that might wide the cache flushing a bit, but > would still be correct and much simpler IMO. ... that this, too, is a possibility. It would, however, go against the stated purpose of the change (in the subject "... as well as associated flushing"), which - after all - was the main goal here, seeing the series this was originally part of. Jan

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.