This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
Home Products Support Community News


Re: [Xen-devel] Network stops responding after some time

To: Kirk Allan <kallan@xxxxxxxxxx>,xen-devel@xxxxxxxxxxxxxxxxxxx
Subject: Re: [Xen-devel] Network stops responding after some time
From: Adnan Khaleel <adnan@xxxxxxxxxx>
Date: Wed, 9 Aug 2006 14:49:03 -0500
Cc: Ian.Pratt@xxxxxxxxxxxx
Delivery-date: Thu, 10 Aug 2006 01:53:30 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
In-reply-to: <44D8C421.39DB.0076.0@xxxxxxxxxx>
List-help: <mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id: Xen developer discussion <xen-devel.lists.xensource.com>
List-post: <mailto:xen-devel@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
Reply-to: adnan@xxxxxxxxxxxxxxxxxxx
Sender: xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
Hi Krik,

Your problem seems to be identical to mine. Is your kernel x86-64 as well? I 
was wondering if this
problem also manifested in 32bit as well and I never got around to trying to 

Hope we can find a fix for this.


----- Original Message -----
From: Kirk Allan <kallan@xxxxxxxxxx>
To: xen-devel@xxxxxxxxxxxxxxxxxxx
Cc: Ian.Pratt@xxxxxxxxxxxx, Keir Fraser <Keir.Fraser@xxxxxxxxxxxx>, 
Sent: Tue,  8 Aug 2006 18:04:33 -0500
Subject: Re: [Xen-devel] Network stops responding after some time

> I am seeing the same dom0 problem on a box that also has an integrated nForce3
> Ethernet adapter.  The box started out with SLES 10 but I removed the SUSE
> supplied Xen components and am using Xen-unstable change set 10982.
> I am not starting xend, so bridging is not involved.  To eliminate the LAN
> driver issue, I used an Intel E100 adapter.
> It seems that networking goes along OK as long as traffic is light.  To 
> trigger
> the problem, I download a CD ISO image.  Downloading an ISO causes the failure
> every time.  The download progress varies from not even getting started to 426
> MB out of the 676 MB.  When the download progress stops, checking
> /proc/interrupts shows that interrupts for the LAN adapter have stopped.
> As suggested, I tried the 'ioapic_ack=old' Xen boot parameter.  It didn't 
> help.
> I don't believe it is a LAN driver problem because doing a 'rmmod e100' and an
> 'insmod e100.ko' does not get things going again.
> Attached are various log files that will hopefully be of some use.
> xendebug * contains the serial output as xen is booting up.  It also contains
> the dump_irqs from the 'i' and th print_IO_APIC_keyhandler from the 'z' after
> doing the <ctrl a> <ctrl a><ctrl a> from the debug terminal.  I added counters
> to mask_and_ack_level_ioapic_vector and end_level_ioapic_vector.  The
> interesting thing here is that the e100 is using int 19 and 
> mask_and_ack_level_ioapic_vector, end_level_ioapic_vector, and int 19 from
> /proc/interrupts all have the save value (317059) when interrupts stop.  Also
> the irr is set to 1 when interrupts stop.
> cpuinfo * contents of /proc/cpuinfo
> hwinfo * results of doing 'hwinfo'
> dmesg_native * dmesg from a native kernel boot.
> dmesg * dmesg from booting the xen kernel.  After interrupts stop, there were 
> no
> new messages.
> messages - the /var/log/messages for the xen booting kernel.  It also contains
> the 'rmmod e100' and the 'insmod e100' messages after interrupts stopped for 
> the
> e100. 
> ifconfig * the results of doing ifconfig after interrupts stop, after 'rmmod
> e100', and after 'insmod e100' all catted together.
> ints * the results of /proc/interrupts on a native boot, after a xen kernel
> boot, and after interrupts stop on the xen kernel.  Native uses int  201 for 
> the
> e100 and xen uses int 19.
> Any help on this issue is greatly appricated.
> Thanks,
> Kirk
> >>> On Wed, Jul 26, 2006 at  4:07 AM, in message
> <00812518ddfbf4c535e7a4d25d8bbab3@xxxxxxxxxxxx>, Keir Fraser
> <Keir.Fraser@xxxxxxxxxxxx> wrote: 
> > On 24 Jul 2006, at 19:49, Adnan Khaleel wrote:
> > 
> >> I need help in trying to understand why the ethernet driver has locked 
> >> up and how I can go about outputting debug messages. I see in 
> >> /proc/interrrupts that the interrupt count of eth0 just stops 
> >> incrementing. I've tried different (3Com, Realtek 8169, Realtek 8139) 
> >> based network cards and this happens with all. I'm not sure what is so 
> >> unique about my system that might be causing this to lockup, its a 
> >> regular Nforce3 based system with 512MB ram. This problem does not 
> >> happen if I'm not using a Xen enabled kernel. This is entirely 
> >> happening in dom0 and there aren't any user domains so its not a 
> >> bridging issue. I've also disabled the Xen backend drivers (netbk.ko 
> >> and netloop.ko) so its talking to the network chip directly.
> >>
> >> Any help? Please?
> > 
> > Firstly, you need to repro on the kernel from our xen- unstable 
> > repository, which is based on kernel.org Linux 2.6.16. Then build the 
> > same kernel for native i386 and get boot output. Send the unified diff 
> > (diff - u) of the two boot outputs. You may need to tweak the 
> > configuration of the native build to get the output similar to that of 
> > the Xen- based kernel --  we can tell you how to do that when we see your 
> > initial diff.
> > 
> > Another thing worth trying is 'ioapic_ack=old' as a Xen boot parameter 
> > in your bootloader configuration file. It probably won't help, but 
> > worth a try.
> > 
> >   --  Keir
> > 
> > 
> > _______________________________________________
> > Xen- devel mailing list
> > Xen- devel@xxxxxxxxxxxxxxxxxxx
> > http://lists.xensource.com/xen- devel

Xen-devel mailing list