WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

[Xen-users] Live migration: 2500ms downtime

To: xen-users@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-users] Live migration: 2500ms downtime
From: "Marconi Rivello" <marconirivello@xxxxxxxxx>
Date: Fri, 10 Aug 2007 08:15:51 -0300
Delivery-date: Fri, 10 Aug 2007 04:16:19 -0700
Dkim-signature: a=rsa-sha1; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:mime-version:content-type; b=mHoJVUD0127GxcScLer2BqgkxOKrag+BnjmgYVAag220oRzr4kJLT2LGer7pA505bwamvjs7uPIP1ZmQKis2qzuJHUVjSRQTa1yu6xXZdyye+Ffksv8QvM+CEPt/bEdynal5OONy/PjwGnYRCkOdxf0bhxTKUBNcovl0jgI/r44=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:mime-version:content-type; b=m+9SlOmxmj9omAwaCFiqgZUk/Zg14hjDWUsnkCyo9pQ6P9SsFq4pPyHhPrTP5V+UAD5SiukoDx1xuGGJhNKyLiWA157NxRJHVRZ/zTxT8NGL2lldIV4kYogMGzFvapFh/Gy67zJgLJWPyZXQiB20MfcGX8vmBJQycekHbvZDJ2I=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
Hi there,

I've read the paper on Xen live migration, and it shows some very impressive figures, like 165ms downtime on a running web server, and 50ms for a quake3 server.

I installed CentOS 5 on 2 servers, each with 2x Xeon E5335 (quad-core), 2x Intel 80003ES2LAN Gb NICs. Then I installed 2 DomUs, also with CentOS 5.

One NIC is connected to the LAN (on the same switch and VLAN), the other interconnects the 2 servers with a cross cable.

Then I start pinging the DomU that is going to be migrated with 100ms interval, from within the Dom0 that is currently hosting it. And migrate the VM. The pinging is done on the LAN interface, while the migration occurs on the cross cabled one.

64 bytes from 10.10.241.44: icmp_seq=97 ttl=64 time=0.044 ms
64 bytes from 10.10.241.44: icmp_seq=98 ttl=64 time=0.039 ms
64 bytes from 10.10.241.44: icmp_seq=99 ttl=64 time=0.039 ms
64 bytes from 10.10.241.44: icmp_seq=125 ttl=64 time=0.195 ms
64 bytes from 10.10.241.44: icmp_seq=126 ttl=64 time= 0.263 ms
64 bytes from 10.10.241.44: icmp_seq=127 ttl=64 time=0.210 ms

As you can see, the response time before the migration is around 40us, and after, it's 200us, which is understandable since the VM is now in another physical host.

The problem is the 25 lost packets between the last phase of the migration. Don't get me wrong: 2.5s is a very good time, but 50 times higher than what it is told to be, isn't.

I tried the same test connecting both machines on a hub, and got the same results.

Did anybody try to measure the downtime during a live migration? How are the results?

Any thoughts and suggestions are very appreciated.

Thanks,
Marconi.
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users