[Box Backup] BoxBackup Server Side Management Specs (Draft0.01)
Ben Summers
boxbackup@fluffy.co.uk
Thu, 23 Sep 2004 09:35:46 +0100
On 23 Sep 2004, at 06:51, Garry Glendown wrote:
> richard_eigenmann wrote:
>> If we implemented such a kind of "backup recognition" algorythm this
>> could
>> speed up backups of remote laptops as perhaps the documents the
>> laptop user
>> has been working on have already been backed up from users back at
>> the base.
>> I imagine this sort of feature could save somewhere between 0.5 and 5
>> GB per
>> workstation that is doing a full backup. This could be significant to
>> the
>> scalability of the boxbackup.
>> Of course this sort of thing probably would lead to massive
>> redevelopment of
>> code and should only be undertaken if there is a very strong demand.
>> I for
>> one don't need it.
>
> Your suggestion might have been derived from what some companies
> already sell as backup solutions ... e.g., InterXion, a large hoster,
> is selling exactly this feature set ... they say they have all mayor
> Windows version and many M$ apps "on file" and if the version found on
> the client machine matches the one on file, it is only stored as
> reference ... they even do this for other files, dynamically extending
> this to everything stored on their server ... IIRC, they still store 3
> copies of each file even with matches ...
>
> Setting up such a feature will probably require a database of MD5 (or
> similar) checksums, filesize plus possibly file names (to reduce
> search time, at the cost of maybe not recognizing a match, but at the
> gain of reducing theoretical overlaps in MD5 checksum) ... probably
> want to define a minimum file size below which you wouldn't want to do
> all the searching/md5'ing due to little gain by matches ... plus if a
> file is used as a reference, the server must maintain it as long as
> there are at least 1 references to it ... if it is changed or deleted
> on the server it originated at, the old version must be kept intact
> for the other backups ... some work, but doable ...
I'll add it to my list of features for the future.
>
> But then, at the moment a nice frontend for M$-users might be of a
> slightly higher importance ...
Yes, there does seem to be a lot of demand for a Win32 version!
Ben