[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-users] Re: Reg REMUS on two VMs



remus really work ok on 4.2 unstable!

walter

On 31.07.2011, at 04:09, Shriram Rajagopalan <rshriram@xxxxxxxxx> wrote:

I would suggest you upgrade your xen installation. IIRC 4.0.2 still had problems with remus.

On Thu, Jul 28, 2011 at 11:41 PM, Rajendar K <k.rajendar@xxxxxxxxx> wrote:
Hi Sriram,

            Once again, thanks for your timely reply....
            I tested as your suggesstion and i found that i don't have any file in the
location "/var/lib/xen"

[root@globushead xen]# pwd
/var/lib/xen
[root@globushead xen]# ls -ll
total 0

Further i tested the command "xm save " and results as follows
Terminal 1:
xm save -c TestVM1 TestVM1.chkpt

Terminal 2 :
 xm save -c TestVM2 TestVM2.chkpt

& issued command simultaneously.. I could able to get those checkpoint files as follows & no error has occured..


[root@globushead ~]# ls -ll TestVM*
-rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt
-rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt


I hereby represents the xend.log as follows for your reference......
************************************************
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x)
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups)
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved
[2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: /usr/lib64/xen/bin/xc_save 33 4 0 0 0
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to get the suspend evtchn port
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423)
[2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend
[2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In saveInputHandler suspend
[2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ...
[2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) XendDomainInfo.shutdown(suspend)
[2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended.
[2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done
[2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch
[2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM1 id=3 reason=suspend.
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained entries in p2m table
[2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM2 id=4 reason=suspend.
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory pages: iter 1   0%^H^H^H^H  5%outbuf_write: 4194304 > 4161520@12615696
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 11%^H^H^H^H 20%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 26%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 32%^H^H^H^H 38%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 43%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 49%^H^H^H^H 55%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 61%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 67%^H^H^H^H 73%outbuf_write: 622592 > 598004@16179212
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 78%^H^H^H^H 84%outbuf_write: 2662400 > 2007024@14770192
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 90%^H^H^H^H 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target 0%, sent 3344Mb/s, dirtied 1127Mb/s 11049 pages
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x)
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups)
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved
[2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0
[2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(3)
[2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed
[2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0
[2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(4)
[2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed

************************************************





On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan <rshriram@xxxxxxxxx> wrote:
check /var/lib/xen/suspend_evtchn_*_lock.d
Make sure there are different lock files for each domain.
And before starting, make sure there are no stray lock files.

Try this litmus test first. (do both commands simultaneously, in two different
terminals)
terminal 1: xm save -c TestVM1 TestVM1.chkpt
terminal 2: xm save -c TestVM2 TestVM2.chkpt

If this fails, then your problem is something else.
Check the log files /var/log/xen/xend.log for errors.


Most important of all, you are not doing any disk replication!
 So, when the VM recovers on backup, it will be running with an
older version of the disk and will possibly crash

it should be tap2:remus:backuphost:backupport|aio:....


shriram

On Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@xxxxxxxxx> wrote:
Hi,
       Thanks for your kind reply.
       I have installed xen 4.0.2 in RHEL5 and tried remus. With a single remus command
running for a single virtual machine, it works fine but when i tried for two virtual machine,
the steps which i have tried is as follows,

i have run remus command on two terminals

(i) remus --no-net 1 192.168.36.233
(ii) remus --no-net 2 192.168.36.233


Terminal 1 :
The following error i got where i run command

remus --no-net 1 192.168.36.233

SUSPEND shinfo 00076122
delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages
 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, dirtied 2359Mb/s 72 pages
Total pages sent= 41424 (1.19x)
(of which 0 were fixups)
All memory is saved
PROF: resumed at 1311824828.014423
PROF: flushed memory at 1311824828.015603
PROF: suspending at 1311824828.212492
SUSPEND shinfo 00076122
delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages
 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent 3506Mb/s, dirtied 3506Mb/s 107 pages
Total pages sent= 41531 (1.19x)
(of which 0 were fixups)
All memory is saved
PROF: resumed at 1311824828.215125
PROF: flushed memory at 1311824828.216482
PROF: suspending at 1311824828.426094
domain 1 not shut down
ERROR Internal error: Suspend request failed
ERROR Internal error: Domain appears not to have suspended
Save exit rc=1


Terminal 2 :
The following error i got where i run command

remus --no-net 2 192.168.36.233


Had 0 unexplained entries in p2m table
 1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent 74Mb/s, dirtied 1Mb/s 517 pages
 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent 77Mb/s, dirtied 0Mb/s 0 pages
 3: sent 0, skipped 0, Start last iteration
PROF: suspending at 1311824828.327395
domain 2 not shut down
ERROR Internal error: Suspend request failed
ERROR Internal error: Domain appears not to have suspended
Save exit rc=1


--------------------------------------

On the host machine where i run remus command , i can see the two VMs in suspended state
[root@globuscompute ~]# xm list
Name                                        ID   Mem VCPUs      State   Time(s)
Domain-0                                     0  1681     2     r-----     75.1
TestVM1                                      1   128     1     ---s--      5.2
TestVM2                                      2   128     1     ---s--      5.1


And also the destination machine i can see one VM in "b" state
[root@globushead ~]# xm list
Name                                        ID   Mem VCPUs      State   Time(s)
Domain-0                                     0  1684     2     r-----    215.4
TestVM1                                      1   128     1     -b----      0.5


Kindly help me in this situation..

I hereby specify the configuration file which i have used for your reference..

VM1:

kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen"
memory=128
ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen"
disk = ['tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w']
name="TestVM1"
root="/dev/xvda1 ro"
vif=['bridge=eth0',]


VM 2:

kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen"
memory=128
ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen"
disk = ['tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w']
name="TestVM2"
root="/dev/xvda2 ro"
vif=['bridge=eth0',]



On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan <rshriram@xxxxxxxxx> wrote:

Yes you certainly can. What kind of problem are you facing?
shriram

On Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@xxxxxxxxx> wrote:

Hi Sriram,
               
                Can we run REMUS for more than one VM simultaneously ?
               I have tried it and found  that it is not fruitful..
               Kindly help me for a solution.


--
Raj,
CDAC.




--
Raj,
CDAC.




--
Raj,
CDAC.

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.