[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] hanging tapdisk2 processes and improper udev rules



On Fri, 2011-07-22 at 12:36 +0100, Andreas Olsowski wrote:
> On 07/22/2011 11:28 AM, Ian Campbell wrote:
> 
> > This is because udev and forward/backward compatibility are strangers
> > passing in the night. I presume if you make the recommended change to
> > SYMLINK+= instead of NAME= in your udev script this goes away?
> You assume correctly.
> 
> > I posted a patch to fix this "libxl: attempt to cleanup tapdisk
> > processes on disk backend destroy" a couple of times, most recently at
> > http://marc.info/?l=xen-devel&m=131066210526755 but it hasn't been
> > applied yet. Can you try it?
> 
> I tried it:
> 
> make -j7 tools:
> ...
> libxl_device.c: In function âlibxl__device_destroyâ:
> libxl_device.c:253: error: incompatible type for argument 1 of 
> âlibxl__device_destroy_tapdiskâ
> libxl_internal.h:321: note: expected âstruct libxl__gc *â but argument 
> is of type âlibxl__gcâ
> libxl_device.c:274: error: incompatible type for argument 1 of 
> âlibxl__device_destroy_tapdiskâ
> libxl_internal.h:321: note: expected âstruct libxl__gc *â but argument 
> is of type âlibxl__gcâ
> 
> My expertise with C is barely existant, but i took a look at 
> tools/libxl/libxl_device.c
> 
> and changed your
> libxl__device_destroy_tapdisk(gc, be_path);
> into
> libxl__device_destroy_tapdisk(&gc, be_path);
> 
> as i have seen some &gc on other lines of code.

That looks right. I think this is just a difference between current
xen-unstable and xen-4.1 (due to 23045:c426a7140c99 FWIW).

> And it compiled.
> 
> I then created a guest, shut it down.
> First it kept beeing in a -ps--- state, i wanted to take a look at the 
> runing processes with "ps auxww" but the ps process hung itself.
> I could no longer run "ps" successfully after this point.

Uh. That really shouldn't happen :-/ In fact baring a bug in the host OS
itself I'm not sure how ps can ever get into that state...

> syslog showed:
> ul 22 13:00:07 xenturio1 xl: tap-err:tap_ctl_read_message: failure 
> reading message
> Jul 22 13:00:07 xenturio1 xl: tap-err:tap_ctl_send_and_receive: failed 
> to receive 'unknown' message
> 
> Either my hack to get your code to compile was no good or your patch has 
> some unforseen side effects.

It's possible that it relies on something in xen-unstable that I'm not
aware of. Would it be possible for you to try and repro this issue with
xen-unstable.hg and this patch?

Daniel, have you got any idea what might be going on here?

Ian.
> 
> 
> 
> I have now rebooted the server.
> 
> 
> As i went on to check if multipath had any effect on it i added
> devnode "^td" to the blacklist.
> 
> Now when i xl create a vm it only boots up to a certain point and then 
> does nothing.
> If that certain point were to be the login prompt everything would be 
> fine, but it isnt:
> http://pastebin.com/Lmie6KwY
> 
> This is how it should look like:
> 
> http://pastebin.com/CsgYypbk
> 
> I will try to backtrace my steps and see what i did do to break my system.
> 
> In the meantime i have other systems i can test stuff on.
> 
> 
> 
> -with best regards
> 
> 



_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.