WARNING - OLD ARCHIVES

This is an archived copy of the Xen.org mailing list, which we have preserved to ensure that existing links to archives are not broken. The live archive, which contains the latest emails, can be found at http://lists.xen.org/
   
 
 
Xen 
 
Home Products Support Community News
 
   
 

xen-users

Re: [Xen-users] Hadoop Cluster on Xen

To: rajan chandi <chandi.rajan@xxxxxxxxx>
Subject: Re: [Xen-users] Hadoop Cluster on Xen
From: Taeho Kang <tkang1@xxxxxxxxx>
Date: Tue, 10 Nov 2009 10:15:49 +0900
Cc: Lance Haig <lhaig@xxxxxxxxxxxx>, xen-users@xxxxxxxxxxxxxxxxxxx
Delivery-date: Mon, 09 Nov 2009 17:16:33 -0800
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:cc:content-type; bh=8QCfZrj2EHCE1t/6MOmM253k4Qf0vPFyvM2oy/qfKlg=; b=UsZk66hE47rp0riBkQ7o2Mp9MliuZ6QzjV0iukHnJQl1DT4Bwbb9uB0sm2ktHG/wbB mqZR/WHNHNQwkvLnWk/jhv2gEWYTfKZWQLOfLBCzQCoSZPfzYyxlxbVoaNbjRwhKSTHI T+0jgHgLCYqPbYhmhMCGQDEAlCECT5bHTwWRo=
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; b=fDM80FIzWkqDJjNE7+pZkraZg7imRzmKktiNnpvieeIm4Y2TGssV8oXm0eJBKLjWsN d7ACRWNCp2QzWPZxJzLdzbHShfYfRlGosPCiKgOtm8DZ2pgTK8xyvp9q4d2i2zNWgOVH VUhLAbAF5Y5UcV1pSgB80szXPV5cXxoy70jH4=
Envelope-to: www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to: <a70bf0e20911090209r4ae29d25w16f2c4c9aea2462c@xxxxxxxxxxxxxx>
List-help: <mailto:xen-users-request@lists.xensource.com?subject=help>
List-id: Xen user discussion <xen-users.lists.xensource.com>
List-post: <mailto:xen-users@lists.xensource.com>
List-subscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=subscribe>
List-unsubscribe: <http://lists.xensource.com/mailman/listinfo/xen-users>, <mailto:xen-users-request@lists.xensource.com?subject=unsubscribe>
References: <1257507099.2901.2.camel@lancehaig> <a70bf0e20911060520r6552b4a1m7a986ebe8bd9792@xxxxxxxxxxxxxx> <4AF7247F.6070603@xxxxxxxxxxxx> <a70bf0e20911090209r4ae29d25w16f2c4c9aea2462c@xxxxxxxxxxxxxx>
Sender: xen-users-bounces@xxxxxxxxxxxxxxxxxxx
The part that can be problematic is the ones that involve I/O - network and disk. 

You have to configure network interface in a way that each domU VM with Hadoop instances on a single dom0 don't fight for a single physical network interface, as it is possible that lots of data could be moved around in between Hadoop Datanodes. 

You probably have to install more than one network interface so that each VM can have a dedicated one. 

Same goes for the disk I/O - you don't want domU's on a single dom0 fight for accesses to a single disk. Make sure you distribute disk accesses among domU's... 

These issues aside, it's a clever way to deploy Hadoop. Amazon offers EC2 with Hadoop, and I think it's pretty much the same as what you are trying to do.

/Taeho  

On Mon, Nov 9, 2009 at 7:09 PM, rajan chandi <chandi.rajan@xxxxxxxxx> wrote:

Hi Lance,

It's not hard to create the cluster. We are using Xen over Debian Lenny.

We're yet to release our SAAS product in production.

So, We've not load tested it yet.

What were your decision factors in choosing HD over other File Systems?

Our research states that MogileFS is more scalable and good for small files..while HDFS is good for large files.

Regards
Rajan


On Mon, Nov 9, 2009 at 1:35 AM, Lance Haig <lhaig@xxxxxxxxxxxx> wrote:
Hi rajan,

We currently run a 10 hadoop cluster on 3 vmware hosts and I want to create a bigger cluster on a xen cluster of servers. so I was just wanting to find out how easy it is. Our HD cluster maxes out the vmware CPUs for about 4 Hours a day doing our analysis.

So I think your answer says it all for me. It is possible so we should test.

Thanks

Lance

rajan chandi wrote:
We are working on MogileFS cloud storage over Xen!

We're going to store documents of small size ~1 MB.

What is your business case?

Regards
Rajan

On Fri, Nov 6, 2009 at 5:01 PM, Lance Haig <lhaig@xxxxxxxxxxxx <mailto:lhaig@xxxxxxxxxxxx>> wrote:

   Hi all,


   Has anyone created a Xen cluster to run a hadoop vm cluster?

   I would be interested in how it performs

   Thanks

   Lance


   --
   This message has been scanned for viruses and
   dangerous content by MailScanner, and is
   believed to be clean.


   _______________________________________________
   Xen-users mailing list
   Xen-users@xxxxxxxxxxxxxxxxxxx <mailto:Xen-users@xxxxxxxxxxxxxxxxxxx> --
This message has been scanned for viruses and
dangerous content by *MailScanner* <http://www.mailscanner.info/>, and is

believed to be clean.


------------------------------------------------------------------------

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users


--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.



_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users

_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
<Prev in Thread] Current Thread [Next in Thread>