SpiderOak Review - avdi.codes

Dan Mayer asked about my experience with SpiderOak backup. I realized if I was going to type up my thoughts I might as well post them where others could benefit from them as well.

SpiderOak is a cloud-based backup, file synchronization, and sharing service. Unlike services such as Dropbox and ZumoDrive, the emphasis is on the backup part rather than the sync-and-share part.

At present, I have about 384GB from four different machines backed up on SpiderOak’s servers. After SpiderOak’s de-duplication and compression, this comes to about 115GB of usage, so I fit easily in the 200GB plan.

Since I started using SpiderOak a year or so ago, there have been several instances where I was able to retrieve an accidentally deleted file, or grab a set files that I had only saved on another machine. I’ve also switched over to using SpiderOak for synchronizing the large set of files that I keep consistent across all my machines. I used to use a VCS for this, but that was tedious and breakage prone. With SpiderOak it pretty much just works. I’ve also made occasional use of the filesharing features. So it has definitely been useful to me.

Pros:

Unlike many cloud-based storage systems, SpiderOak uses zero-knowledge encryption. That means that data is chunked and encrypted on the client side, and all the SpiderOak servers know is opaque data. For me, this is an essential requirement for a full-fledged backup service (as compared to a sync-and-share tool).
They support and even encourage you to use it for as many machines as you want. There is no cap on number of systems backed up.
It works on Windows, Mac OSX, and Linux. This was a big selling point for me, since I tend to have heterogeneous systems, or at least I did for a while.
The de-duplication works well, and means that (for instance) if you have your entire music collection on two separate machines, the backup won’t take any extra space for the second machine. And then you can have SpiderOak sync the two collections!
SpiderOak keeps version histories of files, not just a single backup.
At $1/GB/year, it is cheaper than many of the alternatives.

Cons:

Because it is zero-knowledge, SpiderOak can’t delegate any of its processing to the cloud. All the chunking, deduplication, encryption, and sync calculations have to be done on the client side. This can put a considerable burden on your systems in terms of CPU and memory usage. It’s not unusual for CPU and disk usage to suddenly spike for a few seconds after changing some files, as SpiderOak detects the change and backs it up.
In particular, it has an issue when its index contains backups with an exceptionally large number of directories – as happens when backing up the full contents of linux systems. In this case, its ambient RAM usage can balloon. It’s not unusual for SpiderOak to consume 500MB-1Gb of RAM on my machine. The SpiderOak developers are working on moving all of the file index into an on-disk instead of in-memory database, at which point this should no longer be an issue. I don’t know what the ETA on that is, though; I’ve been waiting for a while.
Again probably as a result of its zero-knowledge nature, it isn’t as snappy a file-syncing tool as something like Dropbox, nor does it have options for having shared sync folders across multiple users. So there are somethings that I still use Dropbox for.
The UI is only so-so, and as yet doesn’t lend itself very well to managing your backup set. I’d love to do soem selective pruning in my backup fileset, but there’s no easy way to zero in on which directories are taking up the most space.
SpiderOak does not provide any kind of hardcopy backup service. I’d happily pay them extra to send me a stack of DVDs with the opaque, encrypted, de-duplicated data on them once a year – preferably along with a boot disk that had SO preinstalled – but no such luck.
I can’t find any detailed technical information on SpiderOak’s cloud. I’d love to know more about their physical security and how much redundancy they have.

4 comments

Good review. Based on your review and my own experience, I think SpiderOak stands apart for at least four reasons: (1) better security, (2) ability to backup multiple computers and removable storage devices for free, (3) Local Copy feature, and (4) more free storage space.

#1 – As discussed in your article, SpiderOak provides 100% local encryption of data before it is sent to the SpiderOak server, and they have a comprehensive Zero Knowledge Policy, which means they never have access to your data. They also have an option for 2-Factor Authentication.

#2 – Another major selling point is that SpiderOak lets you backup multiple computers AND removable storage (i.e., my USB drives).

#3 – SpiderOak has a powerful “Local Copy” feature that allows you to specify a local drive, a network folder, or your FTP or SFTP server as a secondary place to store the encrypted data blocks SpiderOak creates. The location doesn’t have to be secure because the data blocks will remain encrypted, just like they are when uploaded to the SpiderOak Cloud. “In addition to providing a local copy of the all of the data stored within SpiderOak, this option will drastically increase large restores and downloads as SpiderOak will check with the local copy before downloading a folder or file from the SpiderOak Cloud.”

#4 – Plus, in addition to the 2GB free storage that they advertise, you can get an additional 1GB bonus free for life (3GB total) using the following referral link: https://spideroak.com/download/referral/24515ffa6eff79fd7e06a0c08d9e5253

I would use the Amazon.com Cloud Drive or Box.net for simple file storage and sharing, but for secure, reliable backups of important or personal data, I would choose SpiderOak.

As it so happens, I’m a backend engineer with SpiderOak who just happened to come across this post while going through your archives. Glad to hear you like the service! Has anything in particular gotten better with your experience?

Avdi Grimm says:

October 3, 2011 at 9:17 pm

Actually i’m afraid I just canceled my account because it wasn’t improving fast enough to be worth the price. In particular the lack of a way to force a FULL offline local backup set to be kept in parallel with the cloud one made it less useful than some of the competition. And the wait for the memory usage improvements was really aggravating.

Reply

Configured SpiderOak and now I easily backup, sync,
share, remotely access and store all my data. I can add as many different
computers and drives to a single account so all my data can live in a
central location. If you are interested, please be sure to use
the download link below and we will both receive an additional free 1 GB
of storage. And you can also feel free to refer as many friends as you
like. If you use the WORLDBACKUPDAY promo code you can gain 6 GB in
total.

https://spideroak.com/download/referral/4b45441744da1e2594e73340d41e2553

Pros:

Cons:

4 comments

Leave a Reply Cancel reply