Re: [Xen-devel] Re: DomU lockups after resume from S3 on Core i

To:	Jan Beulich <JBeulich@xxxxxxxxxx>
Subject:	Re: [Xen-devel] Re: DomU lockups after resume from S3 on Core i5 processors
From:	Joanna Rutkowska <joanna@xxxxxxxxxxxxxxxxxxxxxx>
Date:	Tue, 06 Jul 2010 10:59:36 +0200
Cc:	Jeremy Fitzhardinge <jeremy@xxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxx
Delivery-date:	Tue, 06 Jul 2010 02:11:37 -0700
Dkim-signature:	v=1; a=rsa-sha1; c=relaxed/relaxed; d=messagingengine.com; h=message-id:date:from:mime-version:to:cc:subject:references:in-reply-to:content-type; s=smtpout; bh=mPNvSA7fx21N3Qsj0784Miohq/M=; b=Vs6xU1tWQRIc/tHX6dGAHPEYQCQ6KVn7TKt6p55j/IRUTS6A/tjVcH5sWnQLYTgWmHAgWbcz+hPrkE5IfNkDtMKAucsdPpnUvlJWDv9nAeAkPr4vjCYmobler2vKezPbnSHTupsfDYpO2/CR5tqtsVZdQtnxOao2NRQZT3Mzqng=
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to:	<4C33085E0200007800009AE1@xxxxxxxxxxxxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<4C31B629.7070601@xxxxxxxxxxxxxxxxxxxxxx> <4C324E8C.4030305@xxxxxxxxxxxxxxxxxxxxxx> <4C3257B2.1040002@xxxxxxxxxxxxxxxxxxxxxx> <4C32602A.8070305@xxxxxxxx> <4C326241.2030503@xxxxxxxxxxxxxxxxxxxxxx> <4C3267FB.3070202@xxxxxxxx> <4C33085E0200007800009AE1@xxxxxxxxxxxxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.10) Gecko/20100621 Fedora/3.0.5-1.fc13 Lightning/1.0b2pre Thunderbird/3.0.5

On 07/06/10 10:41, Jan Beulich wrote:
>>>> On 06.07.10 at 01:17, Jeremy Fitzhardinge <jeremy@xxxxxxxx> wrote:
>> On 07/05/2010 03:52 PM, Joanna Rutkowska wrote:
>>> On 07/06/10 00:43, Jeremy Fitzhardinge wrote:
>>>> Do you know what's going on it in that it might be waiting
>>>> for?
>>>>     
>>> No idea. I might be guessing that it would be different kernel
>>> subsystems each time -- e.g. when I'm lucky and when the apps got only
>>> "partially" locked up, I can e.g. open new tabs in Google Chrome, I can
>>> see some thumbnails of my popular websites, but without their contents.
>>> This would suggest the networking subsystem is dead, but at the same
>>> time Chrome is apparently communicating fine with the X server in the
>>> DomU (and which in turn talks fine with Dom0 over Xen shared
>>> memory/evtchanl).
>>>
>>> I experienced the above behavior also when had only one VCPU er DomU.
>>>   
>>
>> I've seen similar things with just normal domain save/restore, where the
>> timer interrupt seems to be failing.  Can you ssh into the domain?  I
>> found that I couldn't do an interactive ssh (hung at the prompt), but a
>> non-interactive command would work, so I could cat /proc/interrupts.
> 
> Did either of you try disabling the setting of sched_clock_stable in
> arch/x86/kernel/cpu/intel.c:early_init_intel()? I found this to be a
> requirement in our pv kernels (though in connection with the use of
> C-states, not with S3).
> 
Before I try it -- can you explain what would be the theory behind it,
specifically how this would be related to HT? Clearly it is a HT
problem, and intuitively, I would expect this to be a Xen-side problem,
rather than DomU-side?

joanna.

signature.asc
Description: OpenPGP digital signature

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

WARNING - OLD ARCHIVES

xen-devel

Re: [Xen-devel] Re: DomU lockups after resume from S3 on Core i5 proces