Where have all the files gone?

I thought you all might be interested to see where all your files live when you back up with Carbonite. This is one aisle of disk drives from our Boston data center. What you're looking at are arrays of 16 1TB data-center grade drives in a RAID-6 array. 3 of the 16 drives would have to fail simultaneously before we would lose any data. This RAID configuration is 36 million times more reliable than a single disk drive. Generally we don't even wait for a drive to fail — we have software that can tell when a drive is starting to get flakey and an alarm goes off on our operations console. A technician pulls the disk and puts in a new one. Within an hour, the new disk is automatically rebuilt and the full redundancy is restored. Every day we back up almost 60 million new files. We have backed up over 11 billion files since we turned our data center on in May 2006. The data center has over 9 petabytes of storage (a petabyte is a million gigabytes). All of this data flows in and out of our data center on two little fiber optic cables the size of a lamp cord. Truly amazing.


Dave
CEO, Carbonite

Related posts

Comments

September 29. 2008 19:40

Eric

Hi Dave,

I'm new in the world of carbonite. First test with you and I find the installation and configuration very easy. A gold medal for you.

I have find the Blog and this post. Nice picture of your DataCenter. Is it possible to know wich configuration you use like Server model, switch and control process ?

Sorry for my poor english, i do my best.

Thank's

Eric

September 30. 2008 15:49

Dave

Eric - What you see in the picture are arrays of 16 1TB drives. These arrays use RAID6 redundancy, meaning that 3 of the 16 drives would have to fail at the same time before any data is lost – an extremely unlikely event (estimated once in several million years). The rest of the infrastructure is extremely complex and I couldn’t begin to describe it on this blog. But suffice it to say that the data comes into our data center at extremely high speeds (up to 20 gigabits/sec), is captured in temporary high-speed cache, and then written out to the storage arrays that you see in the picture. The data are organized so that most drives can be powered down much of the time, thereby saving electricity. The file system that controls all of this was specially designed by us since no commercial file systems can handle anywhere near the kind of volume that we now back up. We have a whole team of engineers working on this back-end infrastructure and another team that works on the PC portion of the product.

Dave

Add comment


(Will show your Gravatar icon)  

  Country flag





Live preview

January 7. 2009 05:58