xen-devel

[Top] [All Lists]

Re: [Xen-devel] [PATCH] scrub pages on guest termination

from [Ben Guthro]

[Permanent Link][Original]

To:	Keir Fraser <keir.fraser@xxxxxxxxxxxxx>
Subject:	Re: [Xen-devel] [PATCH] scrub pages on guest termination
From:	Ben Guthro <bguthro@xxxxxxxxxxxxxxx>
Date:	Fri, 23 May 2008 13:01:16 -0400
Cc:	xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxx>, Robert Phillips <rphillips@xxxxxxxxxxxxxxx>
Delivery-date:	Fri, 23 May 2008 10:01:42 -0700
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxx
In-reply-to:	<C45CA9BA.18E7A%keir.fraser@xxxxxxxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<C45CA9BA.18E7A%keir.fraser@xxxxxxxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent:	Thunderbird 2.0.0.14 (X11/20080501)

Yes, sorry - should have removed our terminology from the description.
Node=physical machine
VS=HVM guest w/ pv-on-hvm drivers
Looking back at the original bug report - it seems to indicate it was migrating from a system with 2 processors to one with 8

Specifcally - from
Dell Precision WorkStation 380
Processor:    Intel(R) Pentium(R) D CPU 2.80GHz
# of CPUs:    2
Speed:    2.8GHz

to

Supermicro X7DB8
Processor:    Genuine Intel(R) CPU @ 2.13GHz
# of CPUs:    8
Speed:    2.133 GHz

Keir Fraser wrote:

The aim of the loop was to scrub enough pages in a batch that lock contention is kept tolerably low. Even if 16 pages is not sufficient for that, I’m surprised a ‘node’ (you mean a whole system, presumably?) would appear to lock up. Maybe pages would be scrubbed slower than we’d like, but still CPUs should be able to get the spinlock often enough to evaluate whether they have spent 1ms in the loop and hence get out of there.

What sort of system were you seeing the lockup on? Does it have very many physical CPUs?

-- Keir

On 23/5/08 16:00, "Ben Guthro" <bguthro@xxxxxxxxxxxxxxx> wrote:

This patch solves the following problem. When a large VS terminates, the node locks
up. The node locks up because the page_scrub_kick routine sends a softirq to
all processors instructing them to run the page scrub code. There they interfere
with each other as they serialize behind the page_scrub_lock.

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
[Xen-devel] [PATCH] scrub pages on guest termination, Ben Guthro Re: [Xen-devel] [PATCH] scrub pages on guest termination, Keir Fraser Re: [Xen-devel] [PATCH] scrub pages on guest termination, Ben Guthro <= Re: [Xen-devel] [PATCH] scrub pages on guest termination, Keir Fraser

Previous by Date:	[Xen-devel] what needs to be config'd to fix "Unknown boot option `pciback.permissive'"?, snowcrash+xen@xxxxxxxxx
Next by Date:	Re: [Xen-devel] [PATCH] scrub pages on guest termination, Keir Fraser
Previous by Thread:	Re: [Xen-devel] [PATCH] scrub pages on guest termination, Keir Fraser
Next by Thread:	Re: [Xen-devel] [PATCH] scrub pages on guest termination, Keir Fraser
Indexes:	[Date] [Thread] [Top] [All Lists]