xen-devel

[Top] [All Lists]

Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface

from [Andre Przywara]

[Permanent Link][Original]

To:	Dulloor <dulloor@xxxxxxxxx>
Subject:	Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface
From:	Andre Przywara <andre.przywara@xxxxxxx>
Date:	Tue, 3 Aug 2010 23:21:53 +0200
Cc:	"xen-devel@xxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxx>
Delivery-date:	Tue, 03 Aug 2010 14:25:58 -0700
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to:	<AANLkTimhagg4fUjZ+QwaFtzaLJwGdwbhcx0T6q60Zchj@xxxxxxxxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<1BEA8649F0C00540AB2811D7922ECB6C9338B4CC@xxxxxxxxxxxxxxxxxxxxxxxxxxxx> <AANLkTimK3KCFz8K7ETcJtOVe9kSCXnhN0BKnrFKkvwMd@xxxxxxxxxxxxxx> <AANLkTimSDabceF5sFwHK3N6a0X1eYiFqaF6BgGhbVcj6@xxxxxxxxxxxxxx> <4C581BA6.3030502@xxxxxxx> <AANLkTimhagg4fUjZ+QwaFtzaLJwGdwbhcx0T6q60Zchj@xxxxxxxxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent:	Thunderbird 2.0.0.18 (X11/20081105)

Dulloor wrote:

On Tue, Aug 3, 2010 at 6:37 AM, Andre Przywara <andre.przywara@xxxxxxx> wrote:

Dulloor wrote:

Interface definition. Structure that will be shared with hvmloader (with
HVMs)
and directly with the VMs (with PV).

-dulloor

Signed-off-by : Dulloor <dulloor@xxxxxxxxx>

+/* vnodes are 1GB-aligned */
+#define XEN_MIN_VNODE_SHIFT (30)

Why that? Do you mean guest memory here? Isn't that a bit restrictive?
What if the remaining system resources do not allow this?
What about a 5GB guest on 2 nodes?
In AMD hardware there is minimum shift of 16MB, so I think 24 bit would
be better.

Linux has stricter restrictions on min vnode shift (256MB afair). And,
I remember
one of the emails from Jan Beulich where the minimum node size was discussed
(but in another context). I will get verify my facts and reply on this.

OK. I was just asking cause I wondered how the PCI hole issue is solved(I haven't managed to review these patches today).

256 MB looks OK to me.

+struct xen_vnode_info {
+    uint8_t mnode_id;  /* physical node vnode is allocated from */
+    uint32_t start;    /* start of the vnode range (in pages) */
+    uint32_t end;              /* end of the vnode range (in pages) */
+};
+

+struct xen_domain_numa_info {
+    uint8_t version;    /* Interface version */
+    uint8_t type;       /* VM memory allocation scheme (see above) */
+
+    uint8_t nr_vcpus;

Isn't that redundant with info stored somewhere else (for instance
in the hvm_info table)?

But, this being a dynamic structure, nr_vcpus and nr_vnodes determine the
actual size of the populated structure. It's just easier to use in the
above helper macros.

Right. That is better. My concern was how to deal with possibleinconsistencies. But the number of VCPUs shouldn't be a problem.

+    uint8_t nr_vnodes;
+    /* data[] has the following entries :
+     * //Only (nr_vnodes) entries are filled, each sizeof(struct
xen_vnode_info)
+     * struct xen_vnode_info vnode_info[nr_vnodes];

Why would the guest need that info (physical node, start and end) here?
Wouldn't be just the size of the node's memory sufficient?

I changed that from size to (start, end) on last review. size should
be sufficient since
all nodes are contiguous. Will revert this back to use size.

start and end look fine on the first glance, but you gain nothing inusing this if you only allow one entry per node. See the simple exampleof 4GB in 2 nodes, the SRAT looks like this:

node0: 0-640K
node0: 1MB - 2GB
node1: 2GB - 3.5GB
node1: 4GB - 4.5GB

In my patches I did this hole-punching in hvmloader and only send 2G/2Gvia hvm_info.From an architectural point of view the Xen tools code shouldn't dealwith these internals if this can be hidden in hvmloader.


Regards,
Andre.


--
Andre Przywara
AMD-Operating System Research Center (OSRC), Dresden, Germany
Tel: +49 351 448-3567-12


_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, (continued) Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Dulloor Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Andre Przywara Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Keir Fraser Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Dulloor Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Andre Przywara Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Keir Fraser RE: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Dan Magenheimer Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Andre Przywara Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Keir Fraser Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Dulloor Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Andre Przywara <=

Previous by Date:	Re: [Xen-devel] HVM hypercalls, Ruslan Nikolaev
Next by Date:	Re: [xen-devel][vNUMA v2][PATCH 1/8] Config options, Andre Przywara
Previous by Thread:	Re: [xen-devel][vNUMA v2][PATCH 2/8] public interface, Dulloor
Next by Thread:	[xen-devel][vNUMA v2][PATCH 3/8] Basic cpumap utilities, Dulloor
Indexes:	[Date] [Thread] [Top] [All Lists]