WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] Preventing DomU corruption in case of Split-Brain of hea

To: "Dr. Volker Jaenisch" <volker.jaenisch@xxxxxxxxx>
Subject: Re: [Xen-users] Preventing DomU corruption in case of Split-Brain of heartbeat
From: Florian Manschwetus <florianmanschwetus@xxxxxx>
Date: Fri, 17 Oct 2008 17:06:50 +0200
Cc: xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Fri, 17 Oct 2008 08:07:38 -0700
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <48F89242.5050509@xxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <48F89242.5050509@xxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
User-agent: Thunderbird 2.0.0.17 (X11/20080926)
Dr. Volker Jaenisch schrieb:
> Hi Xen-Users!
> 
> We run an large HA XEN system based on heartbeat2.
> 
> Storage base is an infiniband storage cluster exporting iSCSI devices
> to the frontend HA XEN Machines. The iSCSI devices are used as pysical
> devices
> for the domUs using the block-iscsi mechanism (by the way thanks for
> this cool script).
> 
> Recently we had a split brain in our heartbeat system. This causes both
> of our XEN servers to
> fetch the iSCSI-Device and run the domU on it. This resulted in severe
> damage of the filesystem of the domU.
> 
> Is there a method to limit the number of iscsi-sessions per iSCSI
> target, to prohibit the double aquisition
> of a iSCSI device.
Afaik, you should use a quorum disk (a disk with a clusterfs which
allows concurrent access) so that each server could touch a file on it
to leave a time stamp. So a server could be assumed to be  down when he
misses three times in a row or so to update his time stamp (disconnected
from storage or sth). Then the other servers could jump in. when it
comes back online on the same way a rejoin could be planned.

Florian

> 
> Or does anybody here has a alternative solution to this problem?
> 
> Thanks in advance
> 
> Best regards,
> 
> Volker
> 
> 
> 
> 

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
<Prev in Thread] Current Thread [Next in Thread>