On Fri, 2007-01-12 at 20:59 +0530, Ligesh wrote:
> On Fri, Jan 12, 2007 at 10:13:08PM +0800, Tim Post wrote:
> > On Fri, 2007-01-12 at 17:01 +0530, Ligesh wrote:
> > >
> > > Thanks.
> > >
> >
> > I figured I'd chirp in.
> >
> > You guys are trying to make an exact science out of something really
> > dynamic, but I agree an application educated enough to pull this off is
> > sorely needed.
>
>
> If we can't do exact science when we have got the kernel tighly under our
> fists--or at least under an LVM--, then what's the point of it all? :-)
>
> >
> > Lets look at a small paravirtualized domain running AMP, supporting 3 -
> > 5 virtual hosts, each of those vhosts is a blog, forum, wiki, something
> > database driven.
> >
> > 2 Of them are just wordpress blogs, MyISAM tables. 3 of them use innodb
> > tables (row level locking).
> >
>
> Yeah the thing is, virtuozzo has been in the industry for 5 years now, and
> they
> have been doing mostly live backups--though of course, it is always
> recommended to shut
> the vps down, it isn't mandatory like in the case of xen--. So recovering
> from an application
> crash seem to be pretty much possible, especially if it is a propery designed
> software.
> We need to scale to have the ability to manage 10,000 vpses without every
> worrying about what's going on inside each one.
> And this IS the real world situation, as far as hosting is concerned.
I think worrying about what's going on inside of each one *should* be a
goal :) Remember some key differences in VZ :
Burstable ram doesn't lend well to processes caching,
Doesn't lend well to each guest utilizing its own swap,
Single kernel
If *anyone* should be looking at each VM individually, its VZ because it
would be so easy for them to do so.
All we need is a peep at /proc to determine if it is , or isn't a good
time to sync and take a snapshot.
100% swap usage with 100% inode use is *not* a good time to sync
disks :)
> Anyway, again, we are missing the obvious. Linux has a software suspend
> feature.
> The thing with software suspend is that it has to sync the disk properly,
> since the system has to
> work under normal bootup too, and thus merely saving the state won't be
> sufficient to ensure data integrity.
My point exactly.
> This is exactly what Mark has been pointing out. A sync-and-save implemented
> in the domU. And it can actually ensure 100% safe backups.
> So implementing the software suspend inside the domU is all that we need.
>
> xm sync-and-save domU file
> snapshot domU
> backup snapshot + file
> xm restore domU
>
Agreed, but where is the sanity check to ensure 'now is a good time to
sync' ? There *has* to be some sort of communication with the guest if
this is going to be automated, I would think?
My other point is , what happens with guests that have 200 GB root file
systems? That's a heap of HD space needed to back up each one.
What we need (for disaster recovery) is :
/etc
/var
/home
/usr/local
.. and incremental thereof, on a typical (classic) linux system.
While I agree that lvm snapshots are the easiest way (now), if
developing something better, shouldn't a smaller option come into play?
I'd rather run a daemon on the guests (or kernel module) named xenbackup
than have to take a snapshot of every guest that needed a backup.
I don't mean to be argumentative, but real world also implies space
limitations for cost effective operation, cheap HD's, and hosts who sell
enormous chunks of them cheaply :)
Best,
--Tim
_______________________________________________
Xen-users mailing list
Xen-users@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-users
|