[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] Problems with merlot* AMD Opteron 6376 systems (Was Re: stable trees (was: [xen-4.2-testing test] 58584: regressions))



>>> On 26.06.15 at 16:44, <ian.campbell@xxxxxxxxxx> wrote:
> On Fri, 2015-06-26 at 13:59 +0100, Jan Beulich wrote:
>> >>> On 26.06.15 at 14:37, <ian.campbell@xxxxxxxxxx> wrote:
>> > On Fri, 2015-06-26 at 12:16 +0100, Ian Campbell wrote:
>> >> On Fri, 2015-06-26 at 11:49 +0100, Jan Beulich wrote:
>> >> > >>> On 26.06.15 at 12:37, <ian.campbell@xxxxxxxxxx> wrote:
>> >> > > At Andy Cooper's request I ran a quick job with mtrr.show=true
>> >> > > http://logs.test-lab.xenproject.org/osstest/logs/58909/ 
>> >> > > 
>> >> > > I think the relevant serial output is:
>> >> > > Jun 26 09:57:42.325077 (XEN) MTRR default type: uncachable
>> >> > > Jun 26 09:57:42.325111 (XEN) MTRR fixed ranges enabled:
>> >> > > Jun 26 09:57:42.333068 (XEN)   00000-9ffff write-back
>> >> > > Jun 26 09:57:42.333101 (XEN)   a0000-bffff uncachable
>> >> > > Jun 26 09:57:42.333128 (XEN)   c0000-fffff write-back
>> >> > > Jun 26 09:57:42.341077 (XEN) MTRR variable ranges enabled:
>> >> > > Jun 26 09:57:42.341110 (XEN)   0 base 000000000000 mask ffff80000000 
>> > write-back
>> >> > > Jun 26 09:57:42.349088 (XEN)   1 base 000080000000 mask ffffc0000000 
>> > write-back
>> >> > > Jun 26 09:57:42.349124 (XEN)   2 disabled
>> >> > > Jun 26 09:57:42.357068 (XEN)   3 disabled
>> >> > > Jun 26 09:57:42.357098 (XEN)   4 disabled
>> >> > > Jun 26 09:57:42.357122 (XEN)   5 disabled
>> >> > > Jun 26 09:57:42.357147 (XEN)   6 disabled
>> >> > > Jun 26 09:57:42.365063 (XEN)   7 disabled
>> >> > 
>> >> > This alone would mean UC for all memory above 4G. But I seem to
>> >> > recall AMD having some mechanism to avoid using MTRRs for this
>> >> > case. Let me try to dig this out once back from lunch.
>> >> 
>> >> While you do that it seems like I may as well try a run with
>> >> "e820-mtrr-clip" given to Xen.
>> > 
>> > According to http://logs.test-lab.xenproject.org/osstest/logs/58914/ it
>> > didn't make any difference to the end result.
>> > 
>> > It did seems to cause a huge number of
>> > Jun 26 11:51:29.933067 (XEN) AMD-Vi: IO_PAGE_FAULT: domain = 0, device id 
>> > = 
> 
>> > 0x92, fault address = 0xbdfe7000, flags = 0
>> > messages which weren't there before, not sure if that is a clue or not.
>> 
>> I think that's a result of amd_iommu_hwdom_init() now stopping
>> below the reserved ranges right below 3Gb. I.e. these ought to
>> go away if you had the system use minimally more than 4Gb. I
>> also think that you'd see those too without limiting memory if the
>> reserved range was large enough to not share a PDX with the
>> highest RAM page below 4Gb (due to the way mfn_valid() works),
>> or if we indeed only mapped RAM pages there.
> 
> I think you are probably speaking hypothetically, but just in case: Do
> you actually want me to do any of that? I'm not sure how easy it would
> be.

No, that was indeed meant only as an explanation, not as something
to test.

Jan


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.