WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-devel

RE: [Xen-devel] [PATCH] Fix softlockup issue after vcpu hotplug

To: "Keir Fraser" <Keir.Fraser@xxxxxxxxxxxx>, "Tian, Kevin" <kevin.tian@xxxxxxxxx>, <xen-devel@xxxxxxxxxxxxxxxxxxx>
Subject: RE: [Xen-devel] [PATCH] Fix softlockup issue after vcpu hotplug
From: "Graham, Simon" <Simon.Graham@xxxxxxxxxxx>
Date: Tue, 30 Jan 2007 14:29:21 -0500
Delivery-date: Tue, 30 Jan 2007 11:29:18 -0800
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
Thread-index: AcdESFqDCWsISfq5RGeHgxcxVzRqmQACelaDAAAZiDAAAQnwRAATVWmw
Thread-topic: [Xen-devel] [PATCH] Fix softlockup issue after vcpu hotplug
> On 30/1/07 09:54, "Tian, Kevin" <kevin.tian@xxxxxxxxx> wrote:
> 
> > Another simple approach to trigger such warning is to let
> > __xen_suspend() jumps to smp_resume immediately after
> > smp_suspend, as a test case for suspend cancel. People can
> > observe all vcpus except vcpu0 fall into that warning frequently.
> 
> Do you know if this problem has been observed across many versions of
> Xen or
> e.g., only after the upgrade to 2.6.18?
> 

I'm not sure but I think that we've been seeing something very similar
when live migrating domains with 3.0.3/2.6.16.29) -- my understanding is
that the live migration code takes the domain down to UP, does the
migration and then restores SMP -- we VERY often see soft lockup
messages following this (several times per night in our regression
testing) with stack traces identical to those posted by Kevin.

I also added some instrumentation and in every single case, the 'stolen'
time is > 5s when we see the soft lockup.

Simon


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel