xen-devel

[Top] [All Lists]

Re: [Xen-devel] Xen 3.4.1 NUMA support

from [Andre Przywara]

[Permanent Link][Original]

To:	Dulloor <dulloor@xxxxxxxxx>
Subject:	Re: [Xen-devel] Xen 3.4.1 NUMA support
From:	Andre Przywara <andre.przywara@xxxxxxx>
Date:	Tue, 10 Nov 2009 08:49:56 +0100
Cc:	George Dunlap <george.dunlap@xxxxxxxxxxxxx>, Dan Magenheimer <dan.magenheimer@xxxxxxxxxx>, "xen-devel@xxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxx>, Keir Fraser <Keir.Fraser@xxxxxxxxxxxxx>, Papagiannis Anastasios <apapag@xxxxxxxxxxxx>
Delivery-date:	Mon, 09 Nov 2009 23:51:54 -0800
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to:	<940bcfd20911092256t6c664d09ofced3db50211b6da@xxxxxxxxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<bd4f4a54-5269-42d8-b16d-cbdfaeeba361@default> <4AF82F12.6040400@xxxxxxx> <4AF82FD8.6020409@xxxxxxxxxxxxx> <4AF89D06.9010204@xxxxxxx> <940bcfd20911092256t6c664d09ofced3db50211b6da@xxxxxxxxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent:	Thunderbird 2.0.0.21 (X11/20090329)

Dulloor wrote:

I am not finding this. Can you please point to the code ?

tools/python/xen/xend/XendDomainInfo.py (around line 2600)
with the core code being:
-------------
      index = nodeload.index( min(nodeload) )
      cpumask = info['node_to_cpu'][index]
  for v in range(0, self.info['VCPUs_max']):
      xc.vcpu_setaffinity(self.domid, v, cpumask)
--------------

The code got introduced with c/s 17131 and later got refined with c/s17247 and c/s 17709.


numa=on/off is only for setting up numa in xen (similar to the linux
knob, but turned off by default). The allocation of memory from a
single node (that you observe) could be because of the way
alloc_heap_pages is implemented (trying to allocate from all the heaps
from a node, before trying the next one)

Yes, but if the domain is pinned before it allocated it's memory, thenthe natural behavior of Xen is to take memory from this local node.

- try looking at dump_numa
output. And, affinities are not set anywhere based on the node from
which allocation happens.

It is the other way round, first the domain is pinned, later the memoryis allocated (based on the node to which the currently scheduled CPU isbelonging to).


Regards,
Andre.


-dulloor

On Mon, Nov 9, 2009 at 5:51 PM, Andre Przywara <andre.przywara@xxxxxxx> wrote:

George Dunlap wrote:

Andre Przywara wrote:

BTW: Shouldn't we set finally numa=on as the default value?

Is there any data to support the idea that this helps significantly on
common systems?

I don't have any numbers handy, but I will try if I can generate some.

Looking from a high level perspective it is a shame that it's not the
default: With numa=off the Xen domain loader will allocate physical memory
from some node (maybe even from several nodes) and will schedule the guest
on some other (even rapidly changing) nodes. According to Murphy's law you
will end up with _all_ the memory access of a guest to be remote. But in
fact a NUMA architecture is really beneficial for virtualization: As there
are close to zero cross domain memory accesses (except for Dom0), each node
is more or less self contained and each guest can use the node's memory
controller almost exclusively.
But this is all spoiled as most people don't know about Xen's NUMA
capabilities and don't set numa=on. Using this as a default would solve
this.

Regards,
Andre.

--
Andre Przywara
AMD-Operating System Research Center (OSRC), Dresden, Germany
Tel: +49 351 448 3567 12
----to satisfy European Law for business letters:
Advanced Micro Devices GmbH
Karl-Hammerschmidt-Str. 34, 85609 Dornach b. Muenchen
Geschaeftsfuehrer: Andrew Bowd; Thomas M. McCoy; Giuliano Meroni
Sitz: Dornach, Gemeinde Aschheim, Landkreis Muenchen
Registergericht Muenchen, HRB Nr. 43632


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
Re: [Xen-devel] Xen 3.4.1 NUMA support, (continued) Re: [Xen-devel] Xen 3.4.1 NUMA support, Dulloor Re: [Xen-devel] Xen 3.4.1 NUMA support, George Dunlap Re: [Xen-devel] Xen 3.4.1 NUMA support, Dulloor Re: [Xen-devel] Xen 3.4.1 NUMA support, Juergen Gross Re: [Xen-devel] Xen 3.4.1 NUMA support, George Dunlap Re: [Xen-devel] Xen 3.4.1 NUMA support, Keir Fraser Re: [Xen-devel] Xen 3.4.1 NUMA support, Andre Przywara Re: [Xen-devel] Xen 3.4.1 NUMA support, George Dunlap Re: [Xen-devel] Xen 3.4.1 NUMA support, Andre Przywara Re: [Xen-devel] Xen 3.4.1 NUMA support, Dulloor Re: [Xen-devel] Xen 3.4.1 NUMA support, Andre Przywara <= Re: [Xen-devel] Xen 3.4.1 NUMA support, Andre Przywara RE: [Xen-devel] Xen 3.4.1 NUMA support, Ian Pratt Re: [Xen-devel] Xen 3.4.1 NUMA support, Keir Fraser RE: [Xen-devel] Xen 3.4.1 NUMA support, Ian Pratt Re: [Xen-devel] Xen 3.4.1 NUMA support, Keir Fraser Re: [Xen-devel] Xen 3.4.1 NUMA support, George Dunlap RE: [Xen-devel] Xen 3.4.1 NUMA support, Ian Pratt Re: [Xen-devel] Xen 3.4.1 NUMA support, Keir Fraser RE: [Xen-devel] Xen 3.4.1 NUMA support, Ian Pratt Re: [Xen-devel] Xen 3.4.1 NUMA support, Jan Beulich

Previous by Date:	Re: [Xen-devel] VF as default interface on dom0, Satish Chowdhury
Next by Date:	[Xen-devel] Re: [PATCH] e820: fix clip_to_limit(), Xiao Guangrong
Previous by Thread:	Re: [Xen-devel] Xen 3.4.1 NUMA support, Dulloor
Next by Thread:	Re: [Xen-devel] Xen 3.4.1 NUMA support, Andre Przywara
Indexes:	[Date] [Thread] [Top] [All Lists]