Deduplication ratios and their impact on DR cost savings

There is an interesting blog discussion between Dipash Patel from CommVault and W. Curtis Preston from Backup Central and TruthinIT regarding the increasing or decreasing benefits of deduplication ratios. They take different perspectives on the benefits of increasing deduplication ratios and I will highlight their points and add an additional one to consider.

Patel argues that increasing deduplication ratios beyond 10:1 provides only a marginal benefit. He calculates that going from 10:1 to 20:1 results in only a 5% increase in capacity efficiency and suggests that this provides only a marginal benefit. He adds that vendors who suggest that a doubling in deduplication ratios will result in a doubling cost savings are using a “sleight of hand.” He makes an interesting point, but I disagree with his core statement that increasing deduplication ratios beyond 10:1 provides only marginal savings.

Preston responds to Patel by suggesting that there is a real cost to purchase, manage and power/cool disk systems. An increase in deduplication ratios from 10:1 to 20:1 reduces the amount of required disk storage by two. (e.g. 10 TB at 10:1 requires 1 TB of disk while a 20:1 reduction would require .5TB disk.) He argues that this provides real management cost, power and cooling savings. I believe that Preston makes a good point, but there is another element that is also worth considering.

Most end users purchase deduplication with an end goal of replicating their data for disaster recovery purposes. The benefit of deduplication is not just about retaining data locally, but also about reducing bandwidth requirements for replication. The implications of going from 10:1 to 20:1 can have a major impact on replication and disaster recovery and in some cases can make the difference between meeting or missing SLAs. If we take the same example above, 10TB will shrink to 1 TB and will take 49.4 hours to replicate over a T-3. The same math with a 20:1 ratio yields 24.7 hours. (The model assumes that the T-3 delivers 45 Mb/sec and can be fully utilized for backup.) In this scenario, if the customer’s requirement is to get data offsite in 24 hours, they barely miss it with 20:1 and completely miss it with 10:1. If the deduplication ratio were to increase by one point to 21:1, the customer could replicate their data in 23.5 hours and meet their window. As you can see, in this case, data reduction ratios really matter. In fact, they are critical to the customer meeting their SLAs.

In summary, I believe that Patel is wrong in his assessment of deduplication ratios. Increasing ratios can and will have a meaningful impact on customer environments and suggesting that the benefits only result in 5% savings is misleading.

Be Sociable, Share!
  • Twitter
  • Facebook
  • email
  • StumbleUpon
  • Delicious
  • LinkedIn

13 Responses to “Deduplication ratios and their impact on DR cost savings”

  1. I didn’t even think about the bandwidth differences. That’s huge!

  2. There’s also a computational power trade-off for the dedupe ratio, though it’s much harder to characterize than storage or bandwidth trade-offs. It just takes more compute power to wring the redundancy out the data using a combination of matching, differencing and compression technologies. We’re all familiar with the compression trade-offs in Winzip or 7zip. Similar considerations apply in dedupe algorithms.

  3. JPP,My understanding of rsync is that it tnrrsfeas files across a network using only the blocks or chunks that have changed and that it can use compression as well. I don’t believe it does any sort of TCP optimization like traditional WAN optimization solutions do. Veeam’s WAN op implementation does use TCP optimizations(inflated window sizes, etc) in addition to deduplication at the block level. Veeam also has a dedicated cache on both sides of the link to store the block level information, which helps with the warm tnrrsfeas.

  4. Hallo,wir stellen selbstverständlich noch jede Menge Fotos ein, es folgt auch noch ein Film, wird aber noch ein paar Tage dauern, aber da kommt noch einigesLiebe GrüßeDas Heggestorzeteam

  5. Te jó ég, drága Nénjécske Pácinban (BAZ megye) mennyi ilyet csinált nekünk. Arrafelé “pergelt vajnak” hívják, és, hmm, szóval nem épp nagy gazdagságra utaló eledel. Mi kenyérre kentük, mustár a tetejére és hamm. Remélem, Nagyapa is, Nénjécske is látja fentrÅ‘l, hogy a “finom népek” is ismerik a mi pergelt vajunkat.

  6. Gratulerer – all grunn til Ã¥ være kry… :o) Nydelige bilder!Kunne godt ha tenkt meg det magasinet, men forstÃ¥r det kan bli litt vanskelig Ã¥ fÃ¥ tak i. Kan jo se pÃ¥ nettutgaven, men det blir ikke helt det samme..Ønsker deg en fin dag videre og en herlig helg!Klem fra Hildegunn

  7. I really appreciate your comment Roderick and made the necessary change on the aristocratic title. Sorry for my error!Actually if you could send me your postal address, I would forward the local school’s DVD on Woodlawn House plus the article from the Sunday Times (if I can find it) regarding the haunting.RegardsBrendan

  8. Borda! Non è una proposta CEI, ma una proposta democratica dei maggiori studiosi di critica testuale autori del Novum Testamentum Graece da cui quasi TUTTI traggono per le traduzioni del Nuovo Testamento.

  9. Frankly I think that’s absolutely good stuff.

  10. None. But we are having two garage sales, on huge sale and two barn sales. Our weekly newspaper covers town 2 small towns. Maybe 16,000 people.

Trackbacks/Pingbacks

  1. Tweets that mention About Restore » Blog Archive » Deduplication ratios and their impact on DR cost savings -- Topsy.com - April 1, 2010

    […] This post was mentioned on Twitter by Jay Livens. Jay Livens said: Blog post adding to @wcpreston's deduplication debate: http://bit.ly/bzub5s – Deduplication ratios and their impact on DR cost savings […]

  2. Ip-toolBox.com » Dedupe Ratios Do Matter - April 16, 2010

    […] I’d like to thank Curtis Preston (Blog Entry 1, Blog Entry 2) and Sepaton’s Jay Livens (Blog Entry) for adding to the discussion in their blog […]

  3. How to REALLY analyze dedupe ratios and their impact on cost savings - Backup Central - February 15, 2017

    […] Jay Livens of SEPATON posted his thoughts on this subject on his aboutrestore.com blog.  In addition to the power/cooling costs I […]

Leave a Reply