|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: Network driver domain broken
On Thu, Mar 03, 2022 at 01:08:31PM -0500, Jason Andryuk wrote:
> On Thu, Mar 3, 2022 at 11:34 AM Roger Pau Monné <roger.pau@xxxxxxxxxx> wrote:
> >
> > On Thu, Mar 03, 2022 at 05:01:23PM +0100, Andrea Stevanato wrote:
> > > On 03/03/2022 15:54, Andrea Stevanato wrote:
> > > > Hi all,
> > > >
> > > > according to the conversation that I had with royger, aa67b97ed34
> > > > broke the driver domain support.
> > > >
> > > > What I'm trying to do is to setup networking between guests using
> > > > driver domain. Therefore, the guest (driver) has been started with the
> > > > following cfg.
> > > >
> > > > name = "guest0"
> > > > kernel = "/media/sd-mmcblk0p1/Image"
> > > > ramdisk = "/media/sd-mmcblk0p1/rootfs.cpio.gz"
> > > > extra = "console=hvc0 rdinit=/sbin/init root=/dev/ram0"
> > > > memory = 1024 vcpus = 2
> > > > driver_domain = 1
> > > >
> > > > On guest0 I created the bridge, assigned a static IP and started the
> > > > udhcpd on xenbr0 interface.
> > > > While the second guest has been started with the following cfg:
> > > >
> > > > name = "guest1"
> > > > kernel = "/media/sd-mmcblk0p1/Image"
> > > > ramdisk = "/media/sd-mmcblk0p1/rootfs.cpio.gz"
> > > > extra = "console=hvc0 rdinit=/sbin/init root=/dev/ram0"
> > > > memory = 1024 vcpus = 2
> > > > vcpus = 2
> > > > vif = [ 'bridge=xenbr0, backend=guest0' ]
> > > >
> > > > Follows the result of strace xl devd:
> > > >
> > > > # strace xl devd
> > > > execve("/usr/sbin/xl", ["xl", "devd"], 0xffffdf0420c8 /* 13 vars */) = 0
>
> > > > ioctl(5, _IOC(_IOC_NONE, 0x50, 0, 0x30), 0xffffe6e41b40) = -1 EPERM
> > > > (Operation not permitted)
> > > > write(2, "libxl: ", 7libxl: ) = 7
> > > > write(2, "error: ", 7error: ) = 7
> > > > write(2, "libxl_utils.c:820:libxl_cpu_bitm"...,
> > > > 87libxl_utils.c:820:libxl_cpu_bitmap_alloc: failed to retrieve the
> > > > maximum number of cpus) = 87
> > > > write(2, "\n", 1
> > > > ) = 1
> > > > clone(child_stack=NULL,
> > > > flags=CLONE_CHILD_CLEARTID|CLONE_CHILD_SETTID|SIGCHLD,
> > > > child_tidptr=0xffff9ee7a0e0) = 814
> > > > wait4(814, [{WIFEXITED(s) && WEXITSTATUS(s) == 0}], 0, NULL) = 814
> > > > --- SIGCHLD {si_signo=SIGCHLD, si_code=CLD_EXITED, si_pid=814,
> > > > si_uid=0, si_status=0, si_utime=2, si_stime=2} ---
>
> xl devd is daemonizing, but strace is only following the first
> process. Use `strace xl devd -F` to prevent the daemonizing (or
> `strace -f xl devd` to follow children).
Or as a first step try to see what kind of messages you get from `xl
devd -F` when trying to attach a device using the driver domain.
> > > > close(6) = 0
> > > > close(5) = 0
> > > > munmap(0xffff9f45f000, 4096) = 0
> > > > close(7) = 0
> > > > close(10) = 0
> > > > close(9) = 0
> > > > close(8) = 0
> > > > close(11) = 0
> > > > close(3) = 0
> > > > close(4) = 0
> > > > exit_group(0) = ?
> > > > +++ exited with 0 +++
> > > >
> > > > royger told me that it is a BUG and not an issue with my setup.
> > > > Therefore here I am.
> >
> > Just a bit more context: AFAICT the calls to libxl_cpu_bitmap_alloc in
> > parse_global_config will prevent xl from being usable on anything
> > different than the control domain (due to sysctl only available to
> > privileged domains). This is an issue for 'xl devd', as it won't
> > start anymore.
>
> These look non-fatal at first glance?
Indeed. I was too quick reading the trace and assumed `xl devd` exited
due to the errors, but those are non fatal, the process just
daemonized.
Thanks, Roger.
|
![]() |
Lists.xenproject.org is hosted with RackSpace, monitoring our |