xen-devel

[Top] [All Lists]

Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels

from [Jan Beulich]

[Permanent Link][Original]

To:	"Jeremy Fitzhardinge" <jeremy@xxxxxxxx>
Subject:	Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels
From:	"Jan Beulich" <JBeulich@xxxxxxxxxx>
Date:	Wed, 25 Aug 2010 08:52:08 +0100
Cc:	"Xen-devel@xxxxxxxxxxxxxxxxxxx" <Xen-devel@xxxxxxxxxxxxxxxxxxx>, Tom Kopec <tek@xxxxxxx>, Linux Kernel Mailing List <linux-kernel@xxxxxxxxxxxxxxx>, Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx>, Stable Kernel <stable@xxxxxxxxxx>, Daniel Stodden <daniel.stodden@xxxxxxxxxx>
Delivery-date:	Wed, 25 Aug 2010 00:53:03 -0700
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to:	<4C743B2C.8070208@xxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<4C743B2C.8070208@xxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx

 >>> On 24.08.10 at 23:35, Jeremy Fitzhardinge <jeremy@xxxxxxxx> wrote:
> We worked out the root cause was that it was incorrectly treating Xen
> events as level rather than edge triggered interrupts, which works fine
> unless you're handling one interrupt, the interrupt gets migrated to
> another cpu and then re-raised.  This ends up losing the interrupt
> because the edge-triggering of the second interrupt is lost.

While this description would seem plausible at the first glance, it
doesn't match up with unmask_evtchn() already taking care of
exactly this case. Or are you implicitly saying that this code is
broken in some way (if so, how, and shouldn't it then be that
code that needs fixing, or removing if you want to stay with the
edge handling)?

I do however agree that using handle_level_irq() is problematic
(see http://lists.xensource.com/archives/html/xen-devel/2010-04/msg01178.html),
but as said there I think using the fasteoi logic is preferable. No
matter whether using edge or level, the ->end() method will
never be called (whereas fasteoi calls ->eoi(), which would
just need to be vectored to the same function as ->end()).
Without end_pirq() ever called, you can't let Xen know of
bad PIRQs (so that it can disable them instead of continuing
to call the [now shortcut] handler in the owning domain).

> The other change changes IPI and VIRQ event sources to use
> handle_percpu_irq, because treating them as level is also wrong, and
> they're actually inherently percpu events, much like LAPIC vectors.

This doesn't seem right for the general VIRQ case: global ones
should not be disallowed migration between CPUs. Since in your
model the requestor has to pass IRQF_PERCPU anyway,
shouldn't you make the selection of the handler dependent
upon this flag?

Jan

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
[Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jeremy Fitzhardinge Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jan Beulich <= Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Daniel Stodden Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jan Beulich Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jeremy Fitzhardinge Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jan Beulich Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jeremy Fitzhardinge Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jan Beulich Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Daniel Stodden Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Daniel Stodden Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jeremy Fitzhardinge Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jan Beulich

Previous by Date:	RE: [Xen-devel] [PATCH] Fix hvm vcpu hotplug bug, Liu, Jinsong
Next by Date:	Re: [Xen-devel] Freeze with 2.6.32.19 and xen-4.0.1rc5, acpi problems, Claus Rosenberger
Previous by Thread:	[Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Jeremy Fitzhardinge
Next by Thread:	Re: [Xen-devel] [GIT PULL] Fix lost interrupt race in Xen event channels, Daniel Stodden
Indexes:	[Date] [Thread] [Top] [All Lists]