This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
Home Products Support Community News


[Xen-users] Pb with 3ware 9550SX-4LP / high IO activities

To: Xen Users <Xen-users@xxxxxxxxxxxxxxxxxxx>
Subject: [Xen-users] Pb with 3ware 9550SX-4LP / high IO activities
From: Michael Kress <kress@xxxxxxxxxxx>
Date: Mon, 25 Sep 2006 01:29:12 +0200
Delivery-date: Sun, 24 Sep 2006 16:29:19 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Thunderbird (Windows/20060909)

I'm having problems with a 3ware 9550SX-4LP during high IO activities:
the kernel produces a dump and halts the machine. This only shows up
using a xen kernel.

To be more precise:
When I activate the write cache on the controller and then produce high
IO traffic in Xen0 there's a kernel dump and the system halts
completely. I even can't go up in the console's history (with pg-up).
Sorry, but there's also no trace in the syslog file I could give you,
just the screen (see below).
For producing "high IO traffic" it's just enough to do a
mkfs.ext3 /dev/VolGroup00/lvx01
on a logical volume of 100GB size.
I haven't got any guests runing yet.
The mkfs.ext3 goes up to about 530 of 800 inode tables and the system
halts. If I do a dd in another process, the system crashes even earlier.

The dump which I can gather from the screen:
[<d105Z395>] scsi_device_unbusy+0x45z0x80 [scsi_mod]
[<d10534fa>] scsi_softirq_done+0xaa/0x120 [scsi_mod]
[<c01d4a9b>] blk_done_softirq+0x9b/BxcB
[<c012Sa73>] __do_softirq+0x93i8x130
[<c0125b95>] do_softirq+8x85/0xa0
[<c0106b7f>] do_IRQ+0x1fi8x30
[<c023dbb2>] evtchn_do_upcal1+0x92/0x110
[<c8185148>] hypervisor_callback+0x2ci8x34
[<c01e5594>] __copy_from_user_II+8x34/0x50
[<c0146c6b>] yeneric_file_buffered_write+0x22b/0x6c0
[<d1121814>] __ext3_journal_stop+8x24/0x50 [ext3]
[<c014740d>] __yeneric_file_aio_write_nolock+0x30di8x580
[<c023dbb2>] evtchn_do_upcal1+0x92/0x110
[<c0152fe6>] zap_pte_ranye+8x286/0x3f0
[<c0147978>] yeneric_file_aio_write+0x88i8x120
[<d1116814>] ext3_file_write+0x44i8xc5 [ext3]
[<c01693ba>] do_sync_write+0xcai8x130
[<c0153c57>] zeromap_pte_ranye+8x147/0x1f0
[<c0137170>] autoremove_wake_function+0x0/0x60
[<c021c8a4>] read zero+0x1d4i8x230
[<c01a04d9>] dnotify_parent+8x39/0xa0
[<c01695e6>] vfs_write+0x1c6/0x1d0
[<c01696c1>] sys_write+8xS L0x80
[<c0104f85>] suscall_ca11+0x7/Bxb

Can you help me with that problem? I'd like to activate the cache
without having these crashes as writing is much more fast!
The controller setting "Queuing" doesn't spoil the effect, i.e.
caching: off, queuing: on, crash: no
caching: off, queuing: off, crash: no
caching: on, queuing: off or on, crash: yes

I can use a bare CentOS 4.4 with all updates (currently with kernel
2.6.9) but without xen and the system works flawlessly, even with
caching on.

My setup:
3ware 9550SX-4LP + BBU
4x250GB Seagate ST3250820AS SATAII with NCQ
Board: Supermicro X6DH8-G2+
2 x Intel xeon 3.6GHz
Xen0: CentOS4.4 with Xen 3.0.2 and a self-compiled-kernel 2.6.16 (the
one that "came" along with xen's setup routine).

Thank you


Michael Kress, kress@xxxxxxxxxxx
http://www.michael-kress.de / http://kress.net
P E N G U I N S   A R E   C O O L

Xen-users mailing list