[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume

To: Jan Beulich <JBeulich@xxxxxxxx>
From: Juergen Gross <juergen.gross@xxxxxxxxxxxxxx>
Date: Fri, 25 Jan 2013 12:05:52 +0100
Cc: George Dunlap <George.Dunlap@xxxxxxxxxxxxx>, Tomasz Wroblewski <tomasz.wroblewski@xxxxxxxxxx>, "Keir \(Xen.org\)" <keir@xxxxxxx>, "xen-devel@xxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxx>
Delivery-date: Fri, 25 Jan 2013 11:06:13 +0000
Domainkey-signature: s=s1536a; d=ts.fujitsu.com; c=nofws; q=dns; h=X-SBRSScore:X-IronPort-AV:Received:X-IronPort-AV: Received:Received:Message-ID:Date:From:Organization: User-Agent:MIME-Version:To:CC:Subject:References: In-Reply-To:Content-Type:Content-Transfer-Encoding; b=vDYt+lfDVboj9IE8g+kFrT0eK+XAERpprXHUxIIdLyUtAAKz+4fwlHy8 ibcuVrooQ8SpbBwYRQW9ahm8VquFt7/I/xJsX6xxt9v8RvlC7u7t1eWy1 FPLCAMOe6RLA4ttJEKfSbtbbbgglEnUZD1vxVW+RqXoJLPm+AJuvSPDWZ QZQYkmy6S15pQ18V4qZ7enEq5tpSq/SYysl8wyB2bxtioJO1Il5wY3ynf aB9c85jFX1tQjwrjG/3Q18cVyly/z;
List-id: Xen developer discussion <xen-devel.lists.xen.org>

Am 25.01.2013 11:40, schrieb Jan Beulich:

On 25.01.13 at 11:35, Juergen Gross<juergen.gross@xxxxxxxxxxxxxx>  wrote:

Am 25.01.2013 11:31, schrieb Jan Beulich:

On 25.01.13 at 11:23, Juergen Gross<juergen.gross@xxxxxxxxxxxxxx>   wrote:

Am 25.01.2013 11:15, schrieb Jan Beulich:

On 25.01.13 at 10:45, Tomasz Wroblewski<tomasz.wroblewski@xxxxxxxxxx>    wrote:

I think I had already raised the question of the placement of
this rcu_barrier() here, and the lack of a counterpart in the
suspend portion of the path. Keir? Or should
rcu_barrier_action() avoid calling process_pending_softirqs()
while still resuming, and instead call __do_softirq() with all but
RCU_SOFTIRQ masked (perhaps through a suitable wrapper,
or alternatively by open-coding its effect)?

Though I recall these vcpu_wake crashes happen also from other entry
points in enter_state but rcu_barrier, so I dont think removing that
helps much. Just was unable to get a proper log of them today due to
most of them being cut in half. Will try bit more.


In which case making __do_softirq() itself honor being in the
suspend/resume path might still be an option.

My belief is that as long as vcpu_migrate is not called in
cpu_disable_scheduler, the vcpu->processor shall continue to point to
offline cpu. Which will crash if the vcpu_wake is called for that vcpu.
If vcpu_migrate is called, then vcpu_wake will still be called with some
frequency but since vcpu->processor shall point to online cpu, and it
won't crash. So likely avoiding the wakes here completely is not the
goal, just the offline ones.


But you neglect the fact that waking vCPU-s at this point is
unnecessary anyway (they have nowhere to run on).


What about adding a global scheduler_disable() in freeze_domains() and a
scheduler_enable() in thaw_domains() which will switch scheduler locking to
a global lock (or disable it at all?). This should solve all problems
without
any complex changes of current behavior.


I don't see how this would address the so far described
shortcomings.


The crash happens due to an access to the scheduler percpu area which isn't
allocated at the moment. The accessed element is the address of the
scheduler
lock for this cpu. Disabling the percpu locking scheme of the scheduler
while
the non-boot cpus are offline will avoid the crash.


Ah, okay. But that wouldn't prevent other bad effects that could
result from vCPU-s pointing to offline pCPU-s. Hence I think such
a solution, even if sufficient for now, would set us up for future
similar (and similarly hard to debug) issues.


To avoid this problem you would have to change the suspend logic, I think.

We could take the cpus just physically offline without removing all the
state information from the system. During resume the old state and percpu areas
would just be reused.

While this would be a clean solution, it's not a really small task to do...


Juergen

--
Juergen Gross                 Principal Developer Operating Systems
PBG PDG ES&S SWE OS6                   Telephone: +49 (0) 89 3222 2967
Fujitsu Technology Solutions              e-mail: juergen.gross@xxxxxxxxxxxxxx
Domagkstr. 28                           Internet: ts.fujitsu.com
D-80807 Muenchen                 Company details: ts.fujitsu.com/imprint.html

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel

References:
- [Xen-devel] [PATCH] Fix scheduler crash after s3 resume
  - From: Tomasz Wroblewski
- Re: [Xen-devel] [PATCH] Fix scheduler crash after s3 resume
  - From: Juergen Gross
- [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Tomasz Wroblewski
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Jan Beulich
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Tomasz Wroblewski
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Jan Beulich
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Tomasz Wroblewski
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Jan Beulich
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Tomasz Wroblewski
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Jan Beulich
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Juergen Gross
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Jan Beulich
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Juergen Gross
- Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
  - From: Jan Beulich

Prev by Date: Re: [Xen-devel] kernel log flooded with: xen_balloon: reserve_additional_memory: add_memory() failed: -17
Next by Date: Re: [Xen-devel] Fatal crash on xen4.2 HVM + qemu-xen dm + NFS
Previous by thread: Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
Next by thread: Re: [Xen-devel] [PATCH v2] Fix scheduler crash after s3 resume
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.