xen-devel

[Top] [All Lists]

Re: [Xen-devel] Re: NUMA and SMP

from [tgh]

[Permanent Link][Original]

To:	Emmanuel Ackaouy <ack@xxxxxxxxxxxxx>
Subject:	Re: [Xen-devel] Re: NUMA and SMP
From:	tgh <tianguanhua@xxxxxxxxxx>
Date:	Tue, 20 Mar 2007 21:10:10 +0800
Cc:	Ryan Harper <ryanh@xxxxxxxxxx>, "Petersson, Mats" <Mats.Petersson@xxxxxxx>, xen-devel <xen-devel@xxxxxxxxxxxxxxxxxxx>, David Pilger <pilger.david@xxxxxxxxx>, Anthony Liguori <aliguori@xxxxxxxxxxxxxxxxxx>
Delivery-date:	Tue, 20 Mar 2007 06:09:19 -0700
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxx
In-reply-to:	<8790346913e7b2e96fdc58199e039895@xxxxxxxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<907625E08839C4409CE5768403633E0B018E1879@xxxxxxxxxxxxxxxxx> <8790346913e7b2e96fdc58199e039895@xxxxxxxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent:	Thunderbird 1.5.0.7 (Windows/20060909)

I am puzzled ,what is the page migration?
Thank you in advance


Emmanuel Ackaouy 写道:

On the topic of NUMA:

I'd like to dispute the assumption that a NUMA-aware OS can actually
make good decisions about the initial placement of memory in a
reasonable hardware ccNUMA system.

How does the OS know on which node a particular chunk of memory
will be most accessed? The truth is that unless the application or
person running the application is herself NUMA-aware and can provide
placement hints or directives, the OS will seldom beat a round-robin /
interleave or random placement strategy.

To illustrate, consider an app which lays out a bunch of data in memory
in a single thread and then spawns worker threads to process it.

Is the OS to place memory close to the initial thread? How can itpossibly

know how many threads will eventually process the data?

Even if the OS knew how many threads will eventually crunch the data,
it cannot possibly know at placement time if each thread will work on an

assigned data subset (and if so, which one) or if it will act as apipeline

stage with all the data being passed from one thread to the next.

If you go beyond initial memory placement or start considering memory
migration, then it's even harder to win because you have to pay copy
and stall penalties during migrations. So you have to be real smart
about predicting the future to do better than your ~10-40% memory
bandwidth and latency hit associated with doing simple memory
interleaving on a modern hardware-ccNUMA system.

And it gets worse for you when your app is successfully taking advantage
of the memory cache hierarchy because its performance is less impacted
by raw memory latency and bandwidth.

Things also get more difficult on a time-sharing host with competing
apps.

There is a strong argument for making hypervisors and OSes NUMA
aware in the sense that:
1- They know about system topology

2- They can export this information up the stack to applications andusers3- They can take in directives from users and applications topartition the

host and place some threads and memory in specific partitions.
4- They use an interleaved (or random) initial memory placement strategy
by default.

The argument that the OS on its own -- without user or application
directives -- can make better placement decisions than round-robin or
random placement is -- in my opinion -- flawed.

I also am skeptical that the complexity associated with page migration
strategies would be worthwhile: If you got it wrong the first time, what
makes you think you'll do better this time?

Emmanuel.


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel



_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
Re: [Xen-devel] Re: NUMA and SMP, tgh <= RE: [Xen-devel] Re: NUMA and SMP, Petersson, Mats Re: [Xen-devel] Re: NUMA and SMP, tgh RE: [Xen-devel] Re: NUMA and SMP, Petersson, Mats Re: [Xen-devel] Re: NUMA and SMP, Ryan Harper RE: [Xen-devel] Re: NUMA and SMP, Petersson, Mats Re: [Xen-devel] Re: NUMA and SMP, Daniel Stodden Re: [Xen-devel] Re: NUMA and SMP, tgh Re: [Xen-devel] Re: NUMA and SMP, Daniel Stodden Re: [Xen-devel] Re: NUMA and SMP, tgh Re: [Xen-devel] Re: NUMA and SMP, Daniel Stodden

Previous by Date:	Re: [Xen-devel] HVM save/restore issue, Zhai, Edwin
Next by Date:	RE: [Xen-devel] Re: NUMA and SMP, Petersson, Mats
Previous by Thread:	[Xen-users] Ubuntu Xen 3.0.3 xm list hangs, Thomas Fazekas
Next by Thread:	RE: [Xen-devel] Re: NUMA and SMP, Petersson, Mats
Indexes:	[Date] [Thread] [Top] [All Lists]