Xen project Mailing List

Re: [Xen-devel] [PATCH] x86: fix paging_log_dirty_op to work with paging guests

From: Roger Pau Monné <roger.pau@xxxxxxxxxx>

Date: Thu, 13 Dec 2018 15:14:06 +0100

Cc: George Dunlap <George.Dunlap@xxxxxxxxxxxxx>, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Tim Deegan <tim@xxxxxxx>, Wei Liu <wei.liu2@xxxxxxxxxx>, xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxxx>

Delivery-date: Thu, 13 Dec 2018 14:16:33 +0000

List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On Thu, Dec 13, 2018 at 05:51:51AM -0700, Jan Beulich wrote: > >>> On 13.12.18 at 12:39, <roger.pau@xxxxxxxxxx> wrote: > > On Wed, Dec 12, 2018 at 09:07:14AM -0700, Jan Beulich wrote: > >> >>> On 12.12.18 at 15:54, <roger.pau@xxxxxxxxxx> wrote: > >> > @@ -488,6 +490,16 @@ static int paging_log_dirty_op(struct domain *d, > >> > bytes = (unsigned int)((sc->pages - pages + 7) >> > >> > 3); > >> > if ( likely(peek) ) > >> > { > >> > + if ( paging_mode_enabled(current->domain) ) > >> > + /* > >> > + * Drop the target p2m lock, or else Xen will > >> > panic > >> > + * when trying to acquire the p2m lock of the > >> > caller > >> > + * due to invalid lock order. Note that there > >> > are no > >> > + * lock ordering issues here, and the panic is > >> > due to > >> > + * the fact that the lock level tracking > >> > doesn't record > >> > + * the domain the lock belongs to. > >> > + */ > >> > + paging_unlock(d); > >> > >> This makes it sound as if tracking the domain would help. It doesn't, > >> at least not as long as there is not also some sort of ordering or > >> hierarchy between domains. IOW I'd prefer if the "doesn't" became > >> "can't". > > > > Well, Just keeping correct order between each domain locks should be > > enough? > > > > Ie: exactly the same that Xen currently does but on a per-domain > > basis. This is feasible, but each CPU would need to store the lock > > order of each possible domain: > > > > DECLARE_PER_CPU(uint8_t, mm_lock_level[DOMID_FIRST_RESERVED]); > > > > This would consume ~32KB per CPU, which is not that much but seems a > > waste when most of the time a single entry will be used. > > Well, tracking by domain ID wouldn't help you - the controlling > domain may well have a higher ID than the being controlled one, > i.e. the nesting you want needs to be independent of domain ID. It's not tracking the domain ID, but rather tracking the lock level of each different domain, hence the need for the array in the pcpu structure. The lock checker would take a domain id and a level, and perform the check as: if ( mm_lock_level[domid] > level ) panic > >> Now before we go this route I think we need to consider whether > >> this really is the only place where the two locks get into one > >> another's way. That's because I don't think we want to introduce > >> more of such, well, hackery, and hence we'd otherwise need a > >> better solution. For example the locking model could be adjusted > >> to allow such nesting in the general case: Dom0 and any domain > >> whose ->target matches the subject domain here could be given > >> a slightly different "weight" in the lock order violation check logic. > > > > So locks from domains != current would be given a lower order, let's > > say: > > > > #define MM_LOCK_ORDER_nestedp2m 8 > > #define MM_LOCK_ORDER_p2m 16 > > #define MM_LOCK_ORDER_per_page_sharing 24 > > #define MM_LOCK_ORDER_altp2mlist 32 > > #define MM_LOCK_ORDER_altp2m 40 > > #define MM_LOCK_ORDER_pod 48 > > #define MM_LOCK_ORDER_page_alloc 56 > > #define MM_LOCK_ORDER_paging 64 > > #define MM_LOCK_ORDER_MAX MM_LOCK_ORDER_paging > > > > If domain != current, the above values are used. If domain == current > > the values above are used + MM_LOCK_ORDER_MAX? So in that case the > > order of MM_LOCK_ORDER_p2m against the current domain would be 16 + 64 > > = 80? > > > > This has the slight inconvenience that not all mm lock call sites have > > the target domain available, but can be solved. > > No, I wasn't envisioning a really generic approach, i.e. > permitting this for all of the locks. Do we need this? Till now > I was rather considering to do this for just the paging lock, > with Dom0 and the controlling domains using just a small > (+/- 2 and +/- 1) offset from the normal paging one. > However, I now realize that indeed there may be more of > such nesting, and you've merely hit a specific case of it. So far that's the only case I've hit, but I'm quite sure there are a non-trivial amount of hypercalls that where only used by a PV Dom0, so I don't discard finding other such instances. > In any event it seems to me that you've got the ordering > inversed, i.e. domain == current (or really: the general case, > i.e. after excluding the two special cases) would use base > values, domain == currd->target would use some offset, and > Dom0 would use double the offset. > > With it extended to all locks I'm no longer sure though that > the end result would be safe. I think it would be helpful to > rope in whoever did originally design this locking model. I'm adding Tim which I think is the original author of the mm lock infrastructure. Thanks, Roger. _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxxxxxxxxx https://lists.xenproject.org/mailman/listinfo/xen-devel

©2013 Xen Project, A Linux Foundation Collaborative Project. All Rights Reserved.
Linux Foundation is a registered trademark of The Linux Foundation.
Xen Project is a trademark of The Linux Foundation.