WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-devel

[Xen-devel] Re: Stability of migration?

To: xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxx>
Subject: [Xen-devel] Re: Stability of migration?
From: John Byrne <john.l.byrne@xxxxxx>
Date: Tue, 13 Jun 2006 18:12:25 -0700
Delivery-date: Tue, 13 Jun 2006 18:12:51 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
In-reply-to: <448F5934.1010805@xxxxxx>
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References: <448F5934.1010805@xxxxxx>
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Thunderbird 1.5.0.4 (X11/20060516)
I should have made clear I am testing on x86_64.

John

John Byrne wrote:

Hi,

With xen-unstable changset 10333:360f9dc71f51, live migration is not reliable. Migrating an active domain (I use a kernel build in my test) back and forth between two machines will result in the build or the domain crashing. I tweaked xc_linux_save.c to enable the verify pass without outputting all the debugging messages and I can see that one or two pages do not get a data match in the log.

I have yet to see a failure of the domain with non-live migration, but I sometimes see a data mismatch on a page during the verification. Which would indicate that either suspend doesn't mean what I think it does or pages of a suspended VM are being altered when they shouldn't be.

So, I guess I'll start with the easy question: should non-live migration ever have a page fail to verify? If not, how can I identify the source of the problem?

The harder question: how to identify the source of the corruption in live migration?

Thanks,

John Byrne



_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

<Prev in Thread] Current Thread [Next in Thread>