Re: [Xen-users] drbd 8 primary/primary and xen migration on RHEL

On 2008-07-31 21:58, nathan@xxxxxxxxxxxx wrote:

I am running DRBD primary/primary on Centos 5.2 with CLVM and GFS withno problems. The only issue I have with live migration is that the arptakes 10 - 15 sec to get refreshed so you lose connectivity during thattime. I have the problem with 3.0ish xen on Centos 5.2 as well as xen3.2.1.

One can run a job on the vm to generate a packet every second or two toresolve this; ping in a loop should do it.

My scenario doesn't involve any clustered filesystem. I'm using phy:drbd devices as the backing for the vm, not files. As far as Iunderstand things, a clustered filesystem shouldn't be necessary, aslong as the drbd devices are in sync at the moment migration occurs.

But the question remains whether that condition is guaranteed, and Ihope to hear from someone who knows the answer to that question...

Anyway, other then the ARP issue, I have this working in production withabout two dozen DomUs.
Note: If you want to use LVM for xen rather then files on GFS/LVM/DRBDyou need to run the latest DRBD that supports max-bio-bvecs.

I'm actually running drbd on top of LVM. But I'll look into themax-bio-bvecs thing anyway out of curiosity.


Thanks for the reply.

On Thu, 31 Jul 2008, Antibozo wrote:
Greetings.
I've reviewed the list archives, particularly the posts from Zakk, onthis subject, and found results similar to his. drbd provides ablock-drbd script, but with full virtualization, at least on RHEL 5,this does not work; by the time the block script is run, the qemu-dmhas already been started.
Instead I've been simply musing the possibility of keeping the drbddevices in primary/primary state at all times. I'm concerned about arace condition, however, and want to ask if others have examined thisalternative.
I am thinking of a scenario where the vm is running on node A, and hasa process that is writing to disk at full speed, and consequently thedrbd device on the node B is lagging. If I perform a live migrationfrom node A to B under this condition, the local device on node Bmight not be in sync at the time the vm is started on that node. Maybe.
If I use drbd protocol C, theoretically at least, a sync on the deviceon node A shouldn't return until node B is fully in sync. So I guessmy main question is: during migration, does xend force a device syncon node A before the vm is started on node B?
A secondary question I have (and this may be a question for the drbdfolks as well) is: why is the block-drbd script necessary? I.e. whynot simply leave the drbd primary/primary at all times--what benefitis there to marking the device secondary on the standby node?
Or am I just very confused? Does anyone else have thoughts orexperience on this matter? All responses are appreciated.


_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

WARNING - OLD ARCHIVES

xen-users

Re: [Xen-users] drbd 8 primary/primary and xen migration on RHEL 5