xen-devel

[Top] [All Lists]

[Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") f

from [Anthony Liguori]

[Permanent Link][Original]

To:	Dan Magenheimer <dan.magenheimer@xxxxxxxxxx>
Subject:	[Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux
From:	Anthony Liguori <anthony@xxxxxxxxxxxxx>
Date:	Thu, 09 Jul 2009 18:33:15 -0500
Cc:	npiggin@xxxxxxx, akpm@xxxxxxxx, xen-devel@xxxxxxxxxxxxxxxxxxx, tmem-devel@xxxxxxxxxxxxxx, kurt.hackel@xxxxxxxxxx, Rusty Russell <rusty@xxxxxxxxxxxxxxx>, linux-kernel@xxxxxxxxxxxxxxx, dave.mccracken@xxxxxxxxxx, linux-mm@xxxxxxxxx, chris.mason@xxxxxxxxxx, sunil.mushran@xxxxxxxxxx, Avi Kivity <avi@xxxxxxxxxx>, jeremy@xxxxxxxx, Schwidefsky <schwidefsky@xxxxxxxxxx>, Marcelo Tosatti <mtosatti@xxxxxxxxxx>, alan@xxxxxxxxxxxxxxxxxxx, Balbir Singh <balbir@xxxxxxxxxxxxxxxxxx>
Delivery-date:	Thu, 09 Jul 2009 16:33:49 -0700
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to:	<7cb22078-f200-45e3-a265-10cce2ae8224@default>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<7cb22078-f200-45e3-a265-10cce2ae8224@default>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent:	Thunderbird 2.0.0.21 (X11/20090320)

Dan Magenheimer wrote:

But this means that either the content of that page must have been
preserved somewhere or the discard fault handler has sufficient
information to go back and get the content from the source (e.g.
the filesystem).  Or am I misunderstanding?


As Rik said, it's the later.

With tmem, the equivalent of the "failure to access a discarded page"
is inline and synchronous, so if the tmem access "fails", the
normal code immediately executes.

Yup. This is the main difference AFAICT. It's really just APIsemantics within Linux.

You could clearly use the volatile state of CMM2 to implement tmem as anAPI in Linux. The get/put functions would set a flag such that if thediscard handler was invoked as long as that operation happened, theoperation could safely fail. That's why I claimed tmem is a subset of CMM2.

I suppose changing Linux to utilize the two tmem services
as described above is a semantic change.  But to me it
seems no more of a semantic change than requiring a new
special page fault handler because a page of memory might
disappear behind the OS's back.

But IMHO this is a corollary of the fundamental difference.  CMM2's
is more the "VMware" approach which is that OS's should never have
to be modified to run in a virtual environment.  (Oh, but maybe
modified just slightly to make the hypervisor a little less
clueless about the OS's resource utilization.)

While I always enjoy a good holy war, I'd like to avoid one here becauseI want to stay on the topic at hand.

If there was one change to tmem that would make it more palatable, forme it would be changing the way pools are "allocated". Instead ofgetting an opaque handle from the hypervisor, I would force the guest toallocate it's own memory and to tell the hypervisor that it's a tmempool. You could then introduce semantics about whether the guest wasallowed to directly manipulate the memory as long as it was in thepool. It would be required to access the memory via get/put functionsthat under Xen, would end up being a hypercall and a copy. Presumablyyou would do some tricks with ballooning to allocate empty memory in Xenand then use those addresses as tmem pools. On KVM, we could dosomething more clever.

The big advantage of keeping the tmem pool part of the normal set ofguest memory is that you don't introduce new challenges with respect tomemory accounting. Whether or not tmem is directly accessible from theguest, it is another memory resource. I'm certain that you'll want todo accounting of how much tmem is being consumed by each guest, and Istrongly suspect that you'll want to do tmem accounting on a per-processbasis. I also suspect that doing tmem limiting for things like cgroupswould be desirable.

That all points to making tmem normal memory so that all thatinfrastructure can be reused. I'm not sure how well this maps to Xenguests, but it works out fine when the VMM is capable of presentingmemory to the guest without actually allocating it (via overcommit).


Regards,

Anthony Liguori

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
Re: [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, (continued) Re: [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Anthony Liguori Re: [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Jeremy Fitzhardinge Re: [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Anthony Liguori [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Rik van Riel [Xen-devel] RE: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Dan Magenheimer [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Rik van Riel [Xen-devel] RE: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Dan Magenheimer [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Anthony Liguori [Xen-devel] RE: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Dan Magenheimer [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Rik van Riel [Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Anthony Liguori <=

Previous by Date:	[Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Rik van Riel
Next by Date:	[Xen-devel] Re: [dm-devel] IO controller mini-summit -- Japan, Oct 2009, Akio Takebe
Previous by Thread:	[Xen-devel] Re: [RFC PATCH 0/4] (Take 2): transcendent memory ("tmem") for Linux, Rik van Riel
Next by Thread:	[Xen-devel] [RFC PATCH 1/4] (Take 2): tmem: Core API between kernel and tmem, Dan Magenheimer
Indexes:	[Date] [Thread] [Top] [All Lists]