WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

[Xen-users] Kernel panic is occured when multi VMs is booting togeter

To: xen-users@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-users] Kernel panic is occured when multi VMs is booting togeter
From: dongkyu lee <dongq.lee@xxxxxxxxx>
Date: Mon, 31 May 2010 19:37:20 +0900
Delivery-date: Mon, 31 May 2010 03:38:42 -0700
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:x-goomoji-body :date:message-id:subject:from:to:content-type; bh=x70gkVKRmwW7WMXJuu9VDjtnrBRY7UY7TFh3Uzljo40=; b=XB2G5u5HFZxXg/njNhY0SUNftOT7TEXf9dorfDNwjPlJRkQMj6uiaepnr2+jYjn930 xQkyrezZ/Dc1/EL1Sl+RBHySow+wTTsUMIRAh67oq4kjR0tcf7ascWpfnT9cxzwp5ClX VxoF3NetW1ocZjHFIGfrz9oTI6qH/4TUYH1UU=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:x-goomoji-body:date:message-id:subject:from:to :content-type; b=NKVktaUYtCQPgu5sQvHlFQZOlUiaLAxapI9J0Zh0hFKMJ8/a9MJXVG1mHv0WvY0sgr t8GwT1yqnKs5soIWVqBuNlJgtskq9xJSmU29pAEPp8MNtVti5kgAESI2EXZF6xbHIeKO 123+knc3UBNGlhAxDjBBR/glXaAorFb+x+Yew=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
This error is not 100% reproducible. However, kernel panic has been occurred many times for last two months.
It usually happens when multi VMs (in our case, 14 VMs) are booting together. 

After we made three big changes to gain more availibility, it began to happen.

Changes are :
   1. Use NAS as vm storage for migration from local disk
   2. Use two bondings for switch HA (with 4 nics) from no bondings
   3. Use Out of Band (OOB) in case of being unable to connect by ssh/rsh 

We thought it should be safer system than before but it is apparently not.
Any advise would be appreciated.

<System Infomation>

1. Xen Version : 3.4.1
2. dom0 memory is set to "dom0_mem=2G"
3. Physical Server Model: HP DL360G6 (Nehalem Server, 48GB RAM)

<Panic Message>

blkback: ring-ref 8, event-channel 9, protocol 1 (x86_64-abi)
blkback: ring-ref 9, event-channel 10, protocol 1 (x86_64-abi)
Unable to handle kernel paging request at ffff880074ec2b68 RIP:
 [<ffffffff804158eb>] skb_copy_bits+0x114/0x1d3
PGD 11a4067 PUD 13a6067 PMD 154e067 PTE 0
Oops: 0000 [1] SMP
last sysfs file: /devices/xen-backend/vbd-1-51712/statistics/wr_sect
CPU 2
Modules linked in: bridge netloop netbk blktap blkbk sg bonding ipv6 xfrm_nalgo crypto_api ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi2 scsi_transport_iscsi2 scsi_transport_iscsi dm_mirror dm_multipath scsi_dh video hwmon backlight sbs i2c_ec i2c_core button battery asus_acpi ac parport_pc lp parport e1000e serial_core bnx2 hpilo serio_raw pcspkr dm_raid45 dm_message dm_region_hash dm_log dm_mod dm_mem_cache shpchp cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 0, comm: swapper Not tainted 2.6.18-164.6.1.el5xen #1
RIP: e030:[<ffffffff804158eb>]  [<ffffffff804158eb>] skb_copy_bits+0x114/0x1d3
RSP: e02b:ffff8800010dfe00  EFLAGS: 00010246
RAX: 0000000000000036 RBX: ffff8800787b16c0 RCX: 0000000000000498
RDX: ffff880079612d00 RSI: ffff880074ec2b68 RDI: ffff88006916b000
RBP: 0000000000000000 R08: ffff880079612d10 R09: 00000000000004ce
R10: 0000000000000498 R11: 0000000000000000 R12: 0000000000000036
R13: 0000000000000036 R14: 0000000000000000 R15: ffff88006916b000
FS:  00002b2df42fcdc0(0000) GS:ffffffff805ca100(0000) knlGS:0000000000000000
CS:  e033 DS: 002b ES: 002b
Process swapper (pid: 0, threadinfo ffff880000d98000, task ffff880000da17e0)
Stack:  0000000200000002  ffff88007c4f2dc0  0000000000000498  ffff8800661dbe80
 ffff88007cf37d00  ffff8800787b16c0  ffff880002cb9f68  ffffffff88478732
 ffff88007cf37800  0000000000000036
Call Trace:
 <IRQ>  [<ffffffff88478732>] :netbk:netif_be_start_xmit+0x241/0x471
 [<ffffffff8042ac95>] __qdisc_run+0x136/0x1f9
 [<ffffffff803ae6cc>] unmask_evtchn+0x2d/0xd7
 [<ffffffff8041be2d>] net_tx_action+0xc9/0xf1
 [<ffffffff80212c99>] __do_softirq+0x8d/0x13b
 [<ffffffff80260da4>] call_softirq+0x1c/0x278
 [<ffffffff8026e0ab>] do_softirq+0x31/0x98
 [<ffffffff8026df37>] do_IRQ+0xec/0xf5
 [<ffffffff803af054>] evtchn_do_upcall+0x13b/0x1fb
 [<ffffffff802608d6>] do_hypervisor_callback+0x1e/0x2c
 <EOI>  [<ffffffff802063aa>] hypercall_page+0x3aa/0x1000
 [<ffffffff802063aa>] hypercall_page+0x3aa/0x1000
 [<ffffffff802999eb>] rcu_pending+0x26/0x50
 [<ffffffff8026f4d5>] raw_safe_halt+0x84/0xa8
 [<ffffffff8026ca50>] xen_idle+0x38/0x4a
 [<ffffffff8024afa1>] cpu_idle+0x97/0xba



_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
<Prev in Thread] Current Thread [Next in Thread>
  • [Xen-users] Kernel panic is occured when multi VMs is booting togeter, dongkyu lee <=