WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

[Xen-users] xen cluster down

To: "Xen Users" <xen-users@xxxxxxxxxxxxxxxxxxx>
Subject: [Xen-users] xen cluster down
From: "J. D." <jdonline@xxxxxxxxx>
Date: Tue, 7 Oct 2008 12:53:16 -0400
Delivery-date: Tue, 07 Oct 2008 09:54:00 -0700
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:mime-version:content-type; bh=z5u1mn1iAF+dR55mpNLjP/xSzlUbfUVQd0+cymYAulI=; b=eGc1vwPsR7yX26EVul0Wt4of17USjBF/BbLzD4OIBww2J7FBLEJ4hufTcIYUsVJNTE jR80esbq0rFR9oUzkEVAD9LbrZP0TcdFSRCGnM+98iQeAeQe75r3BDc0ZeYWCshjzuEd 17MLWUhDMZamyvlbjhgM8q3ygG4wCBSVIusnY=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:mime-version:content-type; b=xqd9rBAjHeRQadJb+OYrVZ8zw+gVtRIloagFhJNzLeQaZSAE1AudU6POEj93/jOoCi 5rg99ZGq29HhyMhC6SAq4hfgIqxykhT7/j11bUO4aZjVnJFkQMI855d5uNRUrS//XqAP DXnq7SqHvl3dKfpJ1XuIj8FQCq+MfcwvY5bw0=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
Hey Ladies and Gentlemen,

We had a problem with our rh5.2 xen cluster last Friday. One of the nodes in our
eight node cluster locked up and became unreachable while cloning a guest.
Shortly thereafter our other seven nodes connection to the san timed out
and a fence war ensued.

Our eight nodes all have a shared directory on the san where the domU guest
disk images are stored. The directory itself is /guests. Once the nodes lost the
ability to read that directory Bad Stuff happened.

This occurred once before during the early testing phases but after rebooting all the nodes everything went
back to normal. Now we are only sporadically able to mount the shared storage /guests
directory.  Has any one else seen similar behavior, or have any ideas on which direction
to go?

Here are the details of the config:
8 1950 dell nodes w/ 32GB ram running rh5.2 dom0
domU are mostly rh5.2 HVM w/ a few rh5.2 PV and one 2003 HVM
redhat cluster suite, conga
EMC CX310 san using emcpowerpath software to provide the /guests directory to
dom0's (GFS2)
Each dom0 is dual-homed and the config is handled via a custom network-bridge
script (network-multi-bridge) on each node

Any advice greatly appreciated,

J. D.
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
<Prev in Thread] Current Thread [Next in Thread>
  • [Xen-users] xen cluster down, J. D. <=