|   xen-devel
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support	in	 Xen 
| To: | "Jiang, Yunhong" <yunhong.jiang@xxxxxxxxx> |  
| Subject: | Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support	in	 Xen |  
| From: | Christoph Egger <Christoph.Egger@xxxxxxx> |  
| Date: | Wed, 17 Sep 2008 11:43:31 +0200 |  
| Cc: | "Tian, Kevin" <kevin.tian@xxxxxxxxx>,	"xen-devel@xxxxxxxxxxxxxxxxxxx" <xen-devel@xxxxxxxxxxxxxxxxxxx>,	"Shan, Haitao" <haitao.shan@xxxxxxxxx>,	Gavin Maltby <Gavin.Maltby@xxxxxxx>,	Keir Fraser <keir.fraser@xxxxxxxxxxxxx>,	Haitao Shan <maillists.shan@xxxxxxxxx> |  
| Delivery-date: | Wed, 17 Sep 2008 02:44:28 -0700 |  
| Envelope-to: | www-data@xxxxxxxxxxxxxxxxxxx |  
| In-reply-to: | <E2263E4A5B2284449EEBD0AAB751098401ABBE479B@xxxxxxxxxxxxxxxxxxxxxxxxxxxx> |  
| List-help: | <mailto:xen-devel-request@lists.xensource.com?subject=help> |  
| List-id: | Xen developer discussion <xen-devel.lists.xensource.com> |  
| List-post: | <mailto:xen-devel@lists.xensource.com> |  
| List-subscribe: | <http://lists.xensource.com/mailman/listinfo/xen-devel>,	<mailto:xen-devel-request@lists.xensource.com?subject=subscribe> |  
| List-unsubscribe: | <http://lists.xensource.com/mailman/listinfo/xen-devel>,	<mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe> |  
| References: | <C4EEE682.2707B%keir.fraser@xxxxxxxxxxxxx>	<48D0C868.76E4.0078.0@xxxxxxxxxx>	<E2263E4A5B2284449EEBD0AAB751098401ABBE479B@xxxxxxxxxxxxxxxxxxxxxxxxxxxx> |  
| Sender: | xen-devel-bounces@xxxxxxxxxxxxxxxxxxx |  
| User-agent: | KMail/1.9.7 |  
| On Wednesday 17 September 2008 11:20:57 Jiang, Yunhong wrote:
> >-----Original Message-----
> >From: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
> >[mailto:xen-devel-bounces@xxxxxxxxxxxxxxxxxxx] On Behalf Of Jan Beulich
> >Sent: 2008年9月17日 15:06
> >To: Christoph Egger; Gavin Maltby
> >Cc: Haitao Shan; Tian, Kevin; xen-devel@xxxxxxxxxxxxxxxxxxx;
> >Shan, Haitao; Keir Fraser
> >Subject: Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline
> >support in Xen
> >
> >>>> Gavin Maltby <Gavin.Maltby@xxxxxxx> 17.09.08 06:17 >>>
> >>
> >>I don't see this as a problem for machine check correctness.
> >>
> >>If dom0 asks to offline a cpu (because it believes the cpu is
> >
> >busted and
> >
> >>a threat to uptime), that decision is fundamentally asynchronous
> >>to the actual error handling that occured at machine check exception
> >>time:
> >>
> >>  - running in whatever context
> >>  - MCE occurs
> >>  - trap to hypervisor MCE handler
> >>       . this decides on hypervisor panic, or other appropriate
> >>         immediate (in handler) response
> >>       . telemetry forwarded to dom0 for logging and analysis
> >>  - assume no hypervisor panic
> >>  - eons pass during which any unconstrained bad data remaining
> >>    after initial handling may go anywhere
> >>  - dom0 gets telemetry and let's say diagnoses a fault and
> >>    decides to call back into the hypervisor to offline the
> >>    offending cpu
> >>
> >>Note the "eons pass" bit;  tonnes of instructions may run on the
> >>bad cpu in this time, and a few more for some offline delay won't
> >>hurt.
> >
> >Shouldn't this possibly be handled the other way around: If a
> >recoverable
> >MCE happened, immediately stop scheduling anything on the affected
> >CPU(s), until Dom0 tells you otherwise (and of course as long as there
> >remains at least one CPU to run on).
>
> Current MCE handling in Xen has no mechanism to achieve this.
It has since c/s 17968.
Christoph
-- 
AMD Saxony, Dresden, Germany
Operating System Research Center
Legal Information:
AMD Saxony Limited Liability Company & Co. KG
Sitz (Geschäftsanschrift):
   Wilschdorfer Landstr. 101, 01109 Dresden, Deutschland
Registergericht Dresden: HRA 4896
vertretungsberechtigter Komplementär:
   AMD Saxony LLC (Sitz Wilmington, Delaware, USA)
Geschäftsführer der AMD Saxony LLC:
   Dr. Hans-R. Deppe, Thomas McCoy
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel
 | 
 
| <Prev in Thread] | Current Thread | [Next in Thread> |  | 
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in Xen, (continued)
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in Xen, Gavin Maltby
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in	 Xen, Jan Beulich
RE: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in	 Xen, Jiang, Yunhong
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support	in	 Xen,
Christoph Egger <=
RE: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support	in	 Xen, Ke, Liping
RE: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in	 Xen, Jiang, Yunhong
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in  Xen, Keir Fraser
RE: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in  Xen, Jiang, Yunhong
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in  Xen, Keir Fraser
RE: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in  Xen, Jiang, Yunhong
RE: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in Xen, Shan, Haitao
Re: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in Xen, Keir Fraser
RE: [Xen-devel] Re: [PATCH 1/4] CPU online/offline support in Xen, Shan, Haitao
 |  |  |