[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [Xen-devel] [BUG] xenstored crash [xen-4.1.3] - likely tdb related
Hello, we now observed several xenstored crashes. After enabling writing core filed I was able to capture the following stack trace through gdb: > 0 talloc_chunk_from_ptr (ptr=0xff0000000000) at talloc.c:116 > 116 if ((tc->flags & ~0xF) != TALLOC_MAGIC) { > warning: not using untrusted file > "/root/xen-4.1-4.1.3/xen-4.1.3/tools/xenstore/.gdbinit" > (gdb) bt > #0 talloc_chunk_from_ptr (ptr=0xff0000000000) at talloc.c:116 > #1 0x0000000000407edf in talloc_free (ptr=0xff0000000000) at talloc.c:551 > #2 0x000000000040a348 in tdb_open_ex (name=0x167d620 > "/var/lib/xenstored/tdb.0x16a48b0", > hash_size=<value optimized out>, tdb_flags=0, open_flags=<value optimized > out>, mode=<value optimized out>, > log_fn=0x4093b0 <null_log_fn>, hash_fn=<value optimized out>) at > tdb.c:1958 > #3 0x000000000040a684 in tdb_open (name=0xff0000000000 <Address > 0xff0000000000 out of bounds>, hash_size=0, > tdb_flags=4254928, open_flags=-1, mode=3974450184) at tdb.c:1773 > #4 0x000000000040a70b in tdb_copy (tdb=0x16c9040, outfile=0x167d620 > "/var/lib/xenstored/tdb.0x16a48b0") > at tdb.c:2124 > #5 0x0000000000406c2d in do_transaction_start (conn=0x167e310, in=<value > optimized out>) > at xenstored_transaction.c:164 > #6 0x00000000004045ca in process_message (conn=0x167e310) at > xenstored_core.c:1214 > #7 consider_message (conn=0x167e310) at xenstored_core.c:1261 > #8 handle_input (conn=0x167e310) at xenstored_core.c:1308 > #9 0x0000000000405170 in main (argc=<value optimized out>, argv=<value > optimized out>) at xenstored_core.c:1964 > > (gdb) frame 2 > #2 0x000000000040a348 in tdb_open_ex (name=0x167d620 > "/var/lib/xenstored/tdb.0x16a48b0", > hash_size=<value optimized out>, tdb_flags=0, open_flags=<value optimized > out>, mode=<value optimized out>, > log_fn=0x4093b0 <null_log_fn>, hash_fn=<value optimized out>) at > tdb.c:1958 > 1958 SAFE_FREE(tdb->locked); > (gdb) print tdb->locked > $3 = (struct tdb_lock_type *) 0xff0000000000 The "tdb->locked" address looks bogus. I had a look at xen/tools/xenstore/tdb.c myself but did not spot any obvious errors. As tdb_copy() looks like some internal function of tdb and tdb has come from the SAMBA project, this looks more like a bug in tdb then in xenstored. I compared tdb between RELEASE-4.1.3 and master and didn't see any interesting changes, so I'm not convinced that an update to 4.1.6 or newer xen-4.x would solve this specific issue. The crash is very annoying as the domains can no longer be managed or migrated. As xenstored (AFAIK) can't be restarted, we currently have to reboot the host to get the system back to a workable state. Has someone seen that bug elsewhere? Sincerely Philipp -- Philipp Hahn Open Source Software Engineer Univention GmbH be open. Mary-Somerville-Str. 1 D-28359 Bremen Tel.: +49 421 22232-0 Fax : +49 421 22232-99 hahn@xxxxxxxxxxxxx http://www.univention.de/ Geschäftsführer: Peter H. Ganten HRB 20755 Amtsgericht Bremen Steuer-Nr.: 71-597-02876 _______________________________________________ Xen-devel mailing list Xen-devel@xxxxxxxxxxxxx http://lists.xen.org/xen-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |