[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-users] VMs go down - indeterministic behaviour

The network shows un-pingable actually and from outside. When I do an xm list on dom0 the domU still gets listed. A colleague of mine says she can log in through xm console fine when the vm is in this "unreachable from outside" state from dom0.

Can you point me to what firmware update you needed?

On Fri, Jan 14, 2011 at 9:03 AM, Steven Timm <timm@xxxxxxxx> wrote:
Does the VM actually go down or does the network just go
unpingable?  During the time when the VM is unresponsive, what
do you see on xm cons?  In early days of Xen we saw sometimes
that the network between dom0 and domU would go down sometimes,
i.e. dom0 couldn't see domU or the outside network but
domU could still see the outside network..we eventually
tracked it to a firmware update that was needed on the NIC.


On Fri, 14 Jan 2011, Megha Vishwanath wrote:


We've been trying to investigate this problem for a while now. We have a
Debian box on which at any point in time we run 5/6 VMs. Sometimes one the
VMs just randomly go down and we detect this from a ping alert.

This happened this morning and I'm adding in here the xm dmesg output from
dom0 (which was NOT what went down) and xm info. Has anyone seen this kind
of behavior? Any other info I can provide that helps? And if that needs me
to use some tools / commands, pls do let me know.

megha@beta:~$ sudo xm dmesg
(XEN) Xen version 4.0.1 (Debian 4.0.1-1) (waldi@xxxxxxxxxx) (gcc version
4.4.5 20100824 (prerelease) (Debian 4.4.4-11) ) Fri Sep  3 15:38:12 UTC 2010
(XEN) Bootloader: GRUB 1.98+20100804-4
(XEN) Command line: placeholder
(XEN) Video information:
(XEN)  VGA is text mode 80x25, font 8x16
(XEN)  VBE/DDC methods: none; EDID transfer time: 2 seconds
(XEN)  EDID info not retrieved because no DDC retrieval method detected
(XEN) Disc information:
(XEN)  Found 2 MBR signatures
(XEN)  Found 2 EDD information structures
(XEN) Xen-e820 RAM map:
(XEN)  0000000000000000 - 000000000009d000 (usable)
(XEN)  000000000009d000 - 00000000000a0000 (reserved)
(XEN)  00000000000e4000 - 0000000000100000 (reserved)
(XEN)  0000000000100000 - 00000000cff50000 (usable)
(XEN)  00000000cff50000 - 00000000cff65000 (ACPI data)
(XEN)  00000000cff65000 - 00000000cff80000 (ACPI NVS)
(XEN)  00000000cff80000 - 00000000d0000000 (reserved)
(XEN)  00000000e0000000 - 00000000f0000000 (reserved)
(XEN)  00000000fec00000 - 00000000fec10000 (reserved)
(XEN)  00000000fee00000 - 00000000fee01000 (reserved)
(XEN)  00000000ff000000 - 0000000100000000 (reserved)
(XEN)  0000000100000000 - 0000000130000000 (usable)
(XEN) ACPI: RSDP 000F61D0, 0014 (r0 PTLTD )
(XEN) ACPI: RSDT CFF5E9DB, 0078 (r1 PTLTD    RSDT    6040000  LTP        0)
(XEN) ACPI: FACP CFF6440A, 0074 (r1 INTEL  TUMWATER  6040000 PTL         3)
(XEN) ACPI: DSDT CFF60525, 3EE5 (r1  Intel BLAKFORD  6040000 MSFT  3000001)
(XEN) ACPI: APIC CFF6447E, 0090 (r1 PTLTD   APIC    6040000  LTP        0)
(XEN) ACPI: SSDT CFF6450E, 00AF (r5 PTLTD  PTL-MI0   6040000 PTEC        1)
(XEN) ACPI: SPMI CFF645BD, 0041 (r5 PTLTD  PTL-SPMI  6040000 PTL         1)
(XEN) ACPI: MCFG CFF645FE, 003C (r1 PTLTD    MCFG    6040000  LTP        0)
(XEN) ACPI: BOOT CFF6463A, 0028 (r1 PTLTD  $SBFTBL$  6040000  LTP        1)
(XEN) ACPI: SPCR CFF64662, 0050 (r1 PTLTD  $UCRTBL$  6040000 PTL         1)
(XEN) ACPI: SLIC CFF646B2, 0176 (r1 OEMID_ OEMTABLE  6040000  LTP        0)
(XEN) ACPI: ERST CFF64828, 0590 (r1 SMCI   ERSTTBL   6040000 SMCI        1)
(XEN) ACPI: HEST CFF64DB8, 00A8 (r1 SMCI   HESTTBL   6040000 SMCI        1)
(XEN) ACPI: BERT CFF64E60, 0030 (r1 SMCI   BERTTBL   6040000 SMCI        1)
(XEN) ACPI: EINJ CFF64E90, 0170 (r1 SMCI   EINJTBL   6040000 SMCI        1)
(XEN) ACPI: SSDT CFF602C6, 025F (r1  PmRef  Cpu0Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF60220, 00A6 (r1  PmRef  Cpu7Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF6017A, 00A6 (r1  PmRef  Cpu6Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF600D4, 00A6 (r1  PmRef  Cpu5Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF6002E, 00A6 (r1  PmRef  Cpu4Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF5FF88, 00A6 (r1  PmRef  Cpu3Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF5FEE2, 00A6 (r1  PmRef  Cpu2Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF5FE3C, 00A6 (r1  PmRef  Cpu1Tst     3000 INTL 20050228)
(XEN) ACPI: SSDT CFF5EA53, 13E9 (r1  PmRef    CpuPm     3000 INTL 20050228)
(XEN) System RAM: 4094MB (4193204kB)
(XEN) Domain heap initialised
(XEN) Processor #0 7:7 APIC version 20
(XEN) Processor #1 7:7 APIC version 20
(XEN) Processor #2 7:7 APIC version 20
(XEN) Processor #3 7:7 APIC version 20
(XEN) IOAPIC[0]: apic_id 4, version 32, address 0xfec00000, GSI 0-23
(XEN) IOAPIC[1]: apic_id 5, version 32, address 0xfec80000, GSI 24-47
(XEN) Enabling APIC mode:  Flat.  Using 2 I/O APICs
(XEN) Using scheduler: SMP Credit Scheduler (credit)
(XEN) Detected 2666.801 MHz processor.
(XEN) Initing memory sharing.
(XEN) VMX: Supported advanced features:
(XEN)  - APIC MMIO access virtualisation
(XEN)  - APIC TPR shadow
(XEN)  - Virtual NMI
(XEN)  - MSR direct-access bitmap
(XEN) HVM: ASIDs disabled.
(XEN) HVM: VMX enabled
(XEN) I/O virtualisation disabled
(XEN) Total of 4 processors activated.
(XEN)  -> Using new ACK method
(XEN) checking TSC synchronization across 4 CPUs: passed.
(XEN) Platform timer appears to have unexpectedly wrapped 1 times.
(XEN) Platform timer is 3.579MHz ACPI PM Timer
(XEN) Allocated console ring of 16 KiB.
(XEN) Brought up 4 CPUs
(XEN) CPUIDLE: disabled due to no HPET. Force enable with 'cpuidle'.
(XEN)  Xen  kernel: 64-bit, lsb, compat32
(XEN)  Dom0 kernel: 64-bit, PAE, lsb, paddr 0x1000000 -> 0x16b1000
(XEN)  Dom0 alloc.:   0000000128000000->000000012c000000 (985849 pages to be
(XEN)  Loaded kernel: ffffffff81000000->ffffffff816b1000
(XEN)  Init. ramdisk: ffffffff816b1000->ffffffff832b4600
(XEN)  Phys-Mach map: ffffffff832b5000->ffffffff83a5a7c8
(XEN)  Start info:    ffffffff83a5b000->ffffffff83a5b4b4
(XEN)  Page tables:   ffffffff83a5c000->ffffffff83a7d000
(XEN)  Boot stack:    ffffffff83a7d000->ffffffff83a7e000
(XEN)  TOTAL:         ffffffff80000000->ffffffff83c00000
(XEN)  ENTRY ADDRESS: ffffffff81502200
(XEN) Dom0 has maximum 4 VCPUs
(XEN) Scrubbing Free RAM: .done.
(XEN) Xen trace buffers: disabled
(XEN) Std. Loglevel: Errors and warnings
(XEN) Guest Loglevel: Nothing (Rate-limited: Errors and warnings)
(XEN) Xen is relinquishing VGA console.
(XEN) *** Serial input -> DOM0 (type 'CTRL-a' three times to switch input to
(XEN) Freed 176kB init memory.

sudo xm info
host                   : beta
release                : 2.6.32-5-xen-amd64
version                : #1 SMP Fri Sep 17 22:00:48 UTC 2010
machine                : x86_64
nr_cpus                : 4
nr_nodes               : 1
cores_per_socket       : 4
threads_per_core       : 1
cpu_mhz                : 2666
hw_caps                :
virt_caps              : hvm
total_memory           : 4094
free_memory            : 256
node_to_cpu            : node0:0-3
node_to_memory         : node0:256
node_to_dma32_mem      : node0:253
max_node_id            : 0
xen_major              : 4
xen_minor              : 0
xen_extra              : .1
xen_caps               : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32
hvm-3.0-x86_32p hvm-3.0-x86_64
xen_scheduler          : credit
xen_pagesize           : 4096
platform_params        : virt_start=0xffff800000000000
xen_changeset          : unavailable
xen_commandline        : placeholder
cc_compiler            : gcc version 4.4.5 20100824 (prerelease) (Debian
cc_compile_by          : waldi
cc_compile_domain      : debian.org
cc_compile_date        : Fri Sep  3 15:38:12 UTC 2010
xend_config_format     : 4

Steven C. Timm, Ph.D  (630) 840-8525
timm@xxxxxxxx  http://home.fnal.gov/~timm/
Fermilab Computing Division, Scientific Computing Facilities,
Grid Facilities Department, FermiGrid Services Group, Group Leader.
Lead of FermiCloud project.

9845844936 | www.klp.org.in

Xen-users mailing list



Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.