WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-devel

RE: [Xen-devel] Time skew on HP DL785 (and possibly other boxes)

To: Dan Magenheimer <dan.magenheimer@xxxxxxxxxx>, "Xen-Devel (E-mail)" <xen-devel@xxxxxxxxxxxxxxxxxxx>
Subject: RE: [Xen-devel] Time skew on HP DL785 (and possibly other boxes)
From: "Tian, Kevin" <kevin.tian@xxxxxxxxx>
Date: Sat, 28 Mar 2009 10:29:39 +0800
Accept-language: en-US
Acceptlanguage: en-US
Cc: "john.v.morris@xxxxxx" <john.v.morris@xxxxxx>
Delivery-date: Fri, 27 Mar 2009 19:30:08 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <055de860-7f5f-496c-81ae-df1bf383d4bc@default>
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References: <055de860-7f5f-496c-81ae-df1bf383d4bc@default>
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
Thread-index: AcmvHb47i2Q8Vk51RP2FH2jfDBbG4wALuoAw
Thread-topic: [Xen-devel] Time skew on HP DL785 (and possibly other boxes)
>From: Dan Magenheimer
>Sent: 2009年3月28日 4:50
>
>(Raising a yellow flag because this could turn into
>a serious issue for Xen and it may take quite a bit
>of work to come up with a solution.)
>
>We recently measured Xen system time skew on an HP DL785
>and found it to be horrible... nearly a quarter millisecond
>worst case (with only about 10000 samples so it may get worse).
>
>This box uses 8 quad-core AMD chips connected via
>hypertransport.  BUT each chip is on a separate motherboard.
>On this system hypertransport is fast and cross-node
>memory accesses are fast enough so that these NUMA systems
>need not behave like NUMA systems from a memory access
>perspective.  So Xen just views the system as a 32-cpu box
>(other than some code in the memory allocator that tries
>to allocate near-memory where possible, but silently falls
>back to far-memory if necessary) and guest vcpus migrate
>freely between the nodes.  (Correct?)

Then instead user'd better to enable NUMA aware bits with Xen which
imposes some affinity limitation but looks a reasonable model on large 
scale system.

Thanks,
Kevin

>
>However, I'm told that its not possible to route a clocksource
>over hypertransport, so TSC's on processors on different
>motherboards may be VERY different and apparently the
>mechanisms for synchronizing Xen system time across
>motherboards may not be up to the challenge.  As a result,
>OS's and apps sensitive to time that are running on PV
>domains may be in for a rough ride on systems like this.
>(HVM domains may run into other problems because time will
>apparently stop for a "long time".)
>
>Since systems like this are targeted for consolidation
>and virtualization, I see this as a potentially big problem
>as it may appear to real Xen customers as bizarre
>non-reproducible problems, such as "make" failing,
>leading to questions about the stability and viability
>of using Xen.
>
>Comments?
>
>Dan
>
>_______________________________________________
>Xen-devel mailing list
>Xen-devel@xxxxxxxxxxxxxxxxxxx
>http://lists.xensource.com/xen-devel
>
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel