[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] Dom0 losing interrupts???


  • To: Jan Beulich <JBeulich@xxxxxxxxxx>
  • From: Juergen Gross <juergen.gross@xxxxxxxxxxxxxx>
  • Date: Mon, 14 Feb 2011 10:38:40 +0100
  • Cc: "xen-devel@xxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxx>
  • Delivery-date: Mon, 14 Feb 2011 01:39:35 -0800
  • Domainkey-signature: s=s1536a; d=ts.fujitsu.com; c=nofws; q=dns; h=X-SBRSScore:X-IronPort-AV:Received:X-IronPort-AV: Received:Received:Message-ID:Date:From:Organization: User-Agent:MIME-Version:To:CC:Subject:References: In-Reply-To:Content-Type:Content-Transfer-Encoding; b=Af/KGd7kHbosl1GLzQbf5XD4dl/axw0xtw0kGeIVFqHoZMtXap3Izx++ 4JjHMAbdRf6S+tTTMfpgY6MJclUKpzhA/S+xu48rQkkupXtvu++akfImQ m7gwHiEU6MGwTPtyubpULu96C96wcBKolmrNfwOxkIAaqYGJjR5AzookH R+jYauhk9qUsnpP954TFLw6kSqMzKRP88YlE3av9FWoyshq1p7at+FXPC bHMztWTR8hkJ7gX4UBdLFITiaF565;
  • List-id: Xen developer discussion <xen-devel.lists.xensource.com>

On 02/14/11 10:26, Jan Beulich wrote:
On 14.02.11 at 07:59, Juergen Gross<juergen.gross@xxxxxxxxxxxxxx>  wrote:
I used xen-unstable, kernel 2.6.32.24 from SLES11 SP1 on a 12 core INTEL
nehalem machine. I pinned all 12 Dom0 vcpus to pcpu 1-2 and started a
parallel
build. After about 2 minutes the first missing interrupts were reported, a
little bit later the next one, no xen messages are printed:

That's certainly not too surprising, somewhat depending on the
maximally tolerated latencies. It seems unlikely to me for a 6-fold
CPU over-commit to promise stable operation, yet certain
adjustments could probably be done to make it work better (like
temporarily boosting the priority of a hardware interrupt's target
vCPU).

I would understand timeouts. But shouldn't the interrupt come in sooner or
later? At least the megasas driver seems not to be able to recover from this
problem, as a result my root filesystem is set to read-only...

This would mean there is a problem in the megasas driver, correct?
And Andre reports stability problems of his machine in similar cases, but
in his case the network driver seems to be the reason.

Are you planning to prepare a patch for boosting the priority of vcpus being
the target for a hardware interrupt? I think I would have to search some time
to find the correct places to change...


Juergen

--
Juergen Gross                 Principal Developer Operating Systems
TSP ES&S SWE OS6                       Telephone: +49 (0) 89 3222 2967
Fujitsu Technology Solutions              e-mail: juergen.gross@xxxxxxxxxxxxxx
Domagkstr. 28                           Internet: ts.fujitsu.com
D-80807 Muenchen                 Company details: ts.fujitsu.com/imprint.html

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.