xen-users
Re: [Xen-users] Re: Reg REMUS on two VMs
remus really work ok on 4.2 unstable!
I would suggest you upgrade your xen installation. IIRC 4.0.2 still had problems with remus. On Thu, Jul 28, 2011 at 11:41 PM, Rajendar K <k.rajendar@xxxxxxxxx> wrote:
Hi Sriram,
Once again, thanks for your timely reply.... I tested as your suggesstion and i found that i don't have any file in the
location "/var/lib/xen"
[root@globushead xen]# pwd
/var/lib/xen [root@globushead xen]# ls -ll total 0
Further i tested the command "xm save " and results as follows Terminal 1: xm save -c TestVM1 TestVM1.chkpt
Terminal 2 : xm save -c TestVM2 TestVM2.chkpt
& issued command simultaneously.. I could able to get those checkpoint files as follows & no error has occured..
[root@globushead ~]# ls -ll TestVM* -rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM1.chkpt
-rwxr-xr-x 1 root root 134508135 Jul 29 09:05 TestVM2.chkpt
I hereby represents the xend.log as follows for your reference...... ************************************************ [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x)
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups) [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:124) [xc_save]: /usr/lib64/xen/bin/xc_save 33 4 0 0 0
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) xc_save: failed to get the suspend evtchn port [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:394) suspend
[2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:127) In saveInputHandler suspend [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:129) Suspending 4 ... [2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:519) XendDomainInfo.shutdown(suspend)
[2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:135) Domain 4 suspended. [2011-07-29 09:05:15 5355] DEBUG (XendCheckpoint:144) Written done
[2011-07-29 09:05:15 5355] DEBUG (XendDomainInfo:1891) XendDomainInfo.handleShutdownWatch [2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM1 id=3 reason=suspend. [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Had 0 unexplained entries in p2m table
[2011-07-29 09:05:15 5355] INFO (XendDomainInfo:2088) Domain has shutdown: name=migrating-TestVM2 id=4 reason=suspend. [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Saving memory pages: iter 1 0%^H^H^H^H 5%outbuf_write: 4194304 > 4161520@12615696
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 11%^H^H^H^H 20%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 26%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 32%^H^H^H^H 38%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 43%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 49%^H^H^H^H 55%outbuf_write: 4194304 > 4169716@12607500 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 61%outbuf_write: 4194304 > 4169716@12607500
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 67%^H^H^H^H 73%outbuf_write: 622592 > 598004@16179212 [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 78%^H^H^H^H 84%outbuf_write: 2662400 > 2007024@14770192
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) ^H^H^H^H 90%^H^H^H^H 97%^M 1: sent 32768, skipped 0, delta 321ms, dom0 12%, target 0%, sent 3344Mb/s, dirtied 1127Mb/s 11049 pages [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) Total pages sent= 32768 (0.94x)
[2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) (of which 0 were fixups) [2011-07-29 09:05:15 5355] INFO (XendCheckpoint:423) All memory is saved [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0
[2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(3) [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed [2011-07-29 09:05:17 5355] INFO (XendCheckpoint:423) Save exit rc=0
[2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3117) XendDomainInfo.resumeDomain(4) [2011-07-29 09:05:17 5355] DEBUG (XendDomainInfo:3158) XendDomainInfo.resumeDomain: completed
************************************************
On Thu, Jul 28, 2011 at 7:50 PM, Shriram Rajagopalan <rshriram@xxxxxxxxx> wrote:
check /var/lib/xen/suspend_evtchn_*_lock.d Make sure there are different lock files for each domain. And before starting, make sure there are no stray lock files.
Try this litmus test first. (do both commands simultaneously, in two different
terminals) terminal 1: xm save -c TestVM1 TestVM1.chkpt terminal 2: xm save -c TestVM2 TestVM2.chkpt
If this fails, then your problem is something else. Check the log files /var/log/xen/xend.log for errors.
Most important of all, you are not doing any disk replication! So, when the VM recovers on backup, it will be running with an older version of the disk and will possibly crash
it should be tap2:remus:backuphost:backupport|aio:....
shriramOn Wed, Jul 27, 2011 at 11:55 PM, Rajendar K <k.rajendar@xxxxxxxxx> wrote:
Hi, Thanks for your kind reply. I have installed xen 4.0.2 in RHEL5 and tried remus. With a single remus command running for a single virtual machine, it works fine but when i tried for two virtual machine,
the steps which i have tried is as follows,
i have run remus command on two terminals
(i) remus --no-net 1 192.168.36.233 (ii) remus --no-net 2 192.168.36.233
Terminal 1 : The following error i got where i run command
remus --no-net 1 192.168.36.233
SUSPEND shinfo 00076122 delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2457Mb/s 75 pages 86: sent 72, skipped 0, delta 1ms, dom0 0%, target 0%, sent 2359Mb/s, dirtied 2359Mb/s 72 pages
Total pages sent= 41424 (1.19x) (of which 0 were fixups) All memory is saved PROF: resumed at 1311824828.014423 PROF: flushed memory at 1311824828.015603 PROF: suspending at 1311824828.212492 SUSPEND shinfo 00076122
delta 1ms, dom0 100%, target 0%, sent 0Mb/s, dirtied 2359Mb/s 72 pages 87: sent 107, skipped 0, delta 1ms, dom0 0%, target 0%, sent 3506Mb/s, dirtied 3506Mb/s 107 pages Total pages sent= 41531 (1.19x) (of which 0 were fixups)
All memory is saved PROF: resumed at 1311824828.215125 PROF: flushed memory at 1311824828.216482 PROF: suspending at 1311824828.426094 domain 1 not shut down ERROR Internal error: Suspend request failed
ERROR Internal error: Domain appears not to have suspended Save exit rc=1
Terminal 2 : The following error i got where i run command
remus --no-net 2 192.168.36.233
Had 0 unexplained entries in p2m table
1: sent 32377, skipped 391, delta 14193ms, dom0 11%, target 0%, sent 74Mb/s, dirtied 1Mb/s 517 pages 2: sent 517, skipped 0, delta 219ms, dom0 11%, target 0%, sent 77Mb/s, dirtied 0Mb/s 0 pages 3: sent 0, skipped 0, Start last iteration
PROF: suspending at 1311824828.327395 domain 2 not shut down ERROR Internal error: Suspend request failed ERROR Internal error: Domain appears not to have suspended Save exit rc=1
--------------------------------------
On the host machine where i run remus command , i can see the two VMs in suspended state [root@globuscompute ~]# xm list Name ID Mem VCPUs State Time(s) Domain-0 0 1681 2 r----- 75.1
TestVM1 1 128 1 ---s-- 5.2 TestVM2 2 128 1 ---s-- 5.1
And also the destination machine i can see one VM in "b" state
[root@globushead ~]# xm list Name ID Mem VCPUs State Time(s) Domain-0 0 1684 2 r----- 215.4 TestVM1 1 128 1 -b---- 0.5
Kindly help me in this situation..
I hereby specify the configuration file which i have used for your reference..
VM1:
kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen"
memory=128 ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen" disk = ['tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86.img,xvda1,w'] name="TestVM1" root="/dev/xvda1 ro"
vif=['bridge=eth0',]
VM 2:
kernel="/opt/euca-debian-5.0-i386/xen-kernel/vmlinuz-2.6.24-19-xen" memory=128 ramdisk="/opt/euca-debian-5.0-i386/xen-kernel/initrd.img-2.6.24-19-xen"
disk = ['tap2:tapdisk:aio:/opt/euca-debian-5.0-i386/debian.5-0.x86_1.img,xvda2,w'] name="TestVM2" root="/dev/xvda2 ro" vif=['bridge=eth0',]
On Wed, Jul 27, 2011 at 6:56 PM, Shriram Rajagopalan <rshriram@xxxxxxxxx> wrote:
Yes you certainly can. What kind of problem are you facing? shriramOn Wed, Jul 27, 2011 at 5:07 AM, Rajendar K <k.rajendar@xxxxxxxxx> wrote:
Hi Sriram, Can we run REMUS for more than one VM simultaneously ?
I have tried it and found that it is not fruitful.. Kindly help me for a solution.
-- Raj, CDAC.
-- Raj, CDAC.
-- Raj, CDAC.
|
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
|
|
|