WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

[Xen-users] Dell Poweredge 2650 - heavy IO hangs domU machines; xen 2.0.

To: xen-users@xxxxxxxxxxxxxxxxxxx
Subject: [Xen-users] Dell Poweredge 2650 - heavy IO hangs domU machines; xen 2.0.7, xen kernel 2.6.11.12
From: Stephen Bosch <posting@xxxxxxxxxxx>
Date: Fri, 10 Feb 2006 11:25:22 -0700
Delivery-date: Fri, 10 Feb 2006 18:36:54 +0000
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/cgi-bin/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Mozilla Thunderbird 1.0.7 (X11/20051208)
Hello:

We are running three domU machines on a Dell 2650 and using Bacula to do
backups to an Exabyte VXA SCSI tape drive attached to the external
channel of a PERC 3 DC, with a RAID 1 running on the internal channel.

Xen version is 2.0.7
Kernel is xen-kernel-2.6.11.12

We have the bacula storage daemon running on dom0.

When we begin a large backup (several gigabytes), all of the domU
machines will lock up, regardless of whether they are involved in the
backup or not.

Characteristics of the lockup:
- We lose all network connectivity to all of them. We cannot ping or ssh
to them -- you cannot do anything. Even an nmap fails.

- the dom0 is still running fine.

- We can 'xm console' to the affected domU's and get a login prompt, but
we can only enter the login id; the login times out waiting for the
password prompt.


Eventually, the bacula backup will time out: at this point, the machines
come back to life. This takes about 15 - 20 minutes. The backup,
however, does not complete successfully. In fact, very little happens on
the backup at all :)

We're very puzzled by this -- we suspect an interrupt issue, but we
really don't have a clue where to start looking. Other people seem to
have reported similar IO-related problems.

Ideas?

-Stephen-

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

<Prev in Thread] Current Thread [Next in Thread>