[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: [Xen-devel] long latency of domain shutdown
On 14/5/08 16:54, "Jan Beulich" <jbeulich@xxxxxxxxxx> wrote: > I'm afraid that's not the case - after they are now using the domain > shutdown fix successfully, they upgraded the machine to 64G and > the system fails to boot. Sounds exactly like other reports we had on > the list regarding boot failures with lots of memory that can be avoided > using dom0_mem=<much smaller value>. As I understand it, this is > due to the way the kernel creates its 1:1 mapping - the hypervisor has > to validate the whole tree from each L4 entry being installed in a single > step - for a 4G machine I measured half a second for this operation, so > obviously anything beyond 32G is open for problems when the PM timer > is in use. Hmm, yes that makes sense. 32GB is 8M ptes, so I could imagine that taking a while to validate. Anyhow this obviously needs fixing regardless of the specific details of this specific failure case. > Unless you tell me that this is on your very short term agenda to work on, > I'll make an attempt at finding a reasonable solution starting tomorrow. Yes, I'll sort this one out hopefully by next week. I think this can be solved pretty straightforwardly. It's the encoding of the continuation into the page_info structure, and synchronisation of that, that needs some back-of-envelope thought. As long as there are not too many callers of {get,put}_page_type(L{2,3,4}_pagetable), and I don't think we have that many, then the changes should be pretty localised. Only those callers have to deal with 'EAGAIN' (or equivalent). -- Keir _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxxxxxxxx http://lists.xensource.com/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |