<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>About Restore &#187; Deduplication</title>
	<atom:link href="http://www.aboutrestore.com/category/deduplication/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.aboutrestore.com</link>
	<description>Blogging about backup, recovery and marketing in the storage industry.</description>
	<lastBuildDate>Mon, 14 Mar 2011 20:20:22 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Deduplication Strategy and Dell/Ocarina</title>
		<link>http://www.aboutrestore.com/2010/07/22/deduplication-strategy-and-dellocarina/</link>
		<comments>http://www.aboutrestore.com/2010/07/22/deduplication-strategy-and-dellocarina/#comments</comments>
		<pubDate>Thu, 22 Jul 2010 19:27:30 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[dedupe]]></category>
		<category><![CDATA[primary storage]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=1148</guid>
		<description><![CDATA[This week, Dell acquired Ocarina, a provider of primary storage deduplication.  The acquisition provides technology that they can integrate with existing storage platforms such as EqualLogic.  However, Dell also sells deduplication technology from EMC/Data Domain, CommVault and Symantec.  Dave West at CommVault suggests that these technologies are complementary, and I agree. However, the announcement raises [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2010%252F07%252F22%252Fdeduplication-strategy-and-dellocarina%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22Deduplication%20Strategy%20and%20Dell%2FOcarina%22%20%7D);"></div>
<p>This week, Dell acquired Ocarina, a provider of primary storage deduplication.  The acquisition provides technology that they can integrate with existing storage platforms such as EqualLogic.  However, Dell also sells deduplication technology from <a href="http://www.dell.com/us/en/enterprise/storage/dell-emc-dd140/pd.aspx?refid=dell-emc-dd140&amp;cs=555&amp;s=biz" target="_blank">EMC/Data Domain</a>, <a href="http://www.dell.com/content/products/productdetails.aspx/powervault-disk-backup-commvault?c=us&amp;l=en&amp;s=bsd&amp;cs=04" target="_blank">CommVault</a> and <a href="http://www.dell.com/content/products/productdetails.aspx/powervault-disk-backup-symantec?c=us&amp;l=en&amp;s=bsd&amp;cs=04" target="_blank">Symantec</a>.  Dave West at CommVault <a href="http://news.commvault.com/DavidWest/000049_A_Complementary_Approach_to_Deduplication.asp" target="_blank">suggests that these technologies are complementary</a>, and I agree. However, the announcement raises a significant strategic question &#8211; which is a better deduplication strategy, “one size fits all” or “best of breed”?</p>
<p>Deduplication is an important technology in the datacenter and reduces power footprint and cooling requirements.  However, it typically brings a performance trade-off during read or write operations due to the additional processing required to re-hydrate or deduplicate data.  The benefits of the technology are compelling and we have seen multiple large companies promote different deduplication strategies.  Their approaches fall into two broad categories:“best of breed” (BoB) or “one size fits all,” (OSFA) and the choice of approach has a major impact.  Let’s look at each strategy individual.<span id="more-1148"></span></p>
<p><strong>One size fits all</strong></p>
<p>This approach is best exemplified by EMC’s Viper project and HP’s StoreOnce initiative.  The idea is to create one global deduplication technology that is used across all applications and hardware.  In theory, this allows for deduplication with consistent results and the ability to move deduplicated data from one system to another with out having to re-hydrate data.  The idea is a bold one, but has some challenges in implementation.</p>
<p>The fundamental problem is that both data characteristics and the requirement for read/write performance vary widely.  For example, compare the performance and SLA requirements of an Oracle database to a user file backup.  Not only are these data types completely unrelated, but they are also written to storage in a very different ways.  Given these variations how can you use one deduplication algorithm?  Deduplication technology is complex and must be designed to solve specific problems.  A generic algorithm is just that and will likely provide mediocre results across the board.</p>
<p>These vendors have chosen a difficult path and their challenge is to create a flexible deduplication algorithm that will work effectively across all data types and business SLAs.  Having seen deduplication development first hand, I can tell you that it is extremely complex and creating a generic solution that works across all tiers of storage and types of data and provides compelling results is extraordinarily challenging if not impossible.  It will be interesting to see how these vendors address this.</p>
<p><strong>Best of breed</strong></p>
<p>This approach is best exemplified by Dell and their recent acquisition of Ocarina.  In this model, a vendor assembles a range of data reduction options to meet different needs.  For example, a supplier might offer an in-band compression option that would minimize the performance impact and provide an adequate but limited data reduction benefit for performance sensitive customers.  They would also offer a backup-specific deduplication solution to minimize backup windows, recovery times and storage footprint.  By offering these options, the vendor could provide solutions that meet a range of customer needs.</p>
<p>The challenge with the BoB approach is that you do not gain the interoperabilty benefits of OSFA.  Moving data from one environment to another will require the re-hydration and re-deduplication of the data, potentially impacting performance.  Although this creates inefficiencies, it does provide the benefit of optimized data reduction that best aligns with business requirements</p>
<p><strong>Summary</strong></p>
<p>Based on my experience with deduplication in the secondary storage arena, the BoB approach is the way to go.  The OSFA model promises a level of simplicity with one algorithm used universally; however, it also creates substantial development and management complexities due to varying data and business requirements.  The vendors promoting this approach position it as a long-term strategy and so it is unlikely to appear any time soon.  This means that it is unclear if/when the vision will come to fruition and it is certainly possible that it may never live up to expectations.</p>
<p>The BoB approach provides an immediate time-to-market benefit.  There are suppliers providing a range of data reduction options today that fit nicely into this model, and Dell appears to be pursuing this approach with Ocarina for primary storage and various partners for backup deduplication.  The challenge is that it can add to management complexity and force re-hydration as data moves between solutions.  However, even with these limitations, I believe that this is the best approach available today.  It provides the flexibility and optimized performance that customers’ need to meet their varying business requirements.</p>
<p><strong>The Future</strong></p>
<p>Long-term, I believe that we will see a combination of these two models.  The simplicity of OSFA is beneficial, but business requirements will mandate the improved performance and data reduction characteristics of BoB.  The best strategy is to define two or three broad tiers of data and use appropriate data reduction methods for each.  The data reduction algorithms will likely vary by tier and may range from simple compression to sophisticated deduplication approaches.  An important consideration is manageability and scalability of each tier and SEPATON will play an important role by providing a highly scalable platform for protecting, storing and retaining secondary data. SEPATON has spent a great deal of time solving the deduplication conundrums in the secondary storage area – with our ContentAware approach, we can deliver both the speed and simplicity of the OSFA approach and the flexibility and efficiency of the BoB approach. Take my advice. BoB is the way to go.</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2008/10/07/inline-deduplication-what-your-mother-never-told-you/' rel='bookmark' title='Inline Deduplication: What Your Mother Never Told You'>Inline Deduplication: What Your Mother Never Told You</a></li>
<li><a href='http://www.aboutrestore.com/2010/03/03/tsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer/' rel='bookmark' title='TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer'>TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer</a></li>
<li><a href='http://www.aboutrestore.com/2008/10/17/hifn-%e2%80%93-commoditizing-hash-based-deduplication/' rel='bookmark' title='HIFN – Commoditizing hash-based deduplication?'>HIFN – Commoditizing hash-based deduplication?</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2010/07/22/deduplication-strategy-and-dellocarina/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Storage pools and why they matter</title>
		<link>http://www.aboutrestore.com/2010/06/28/storage-pools-and-why-they-matter/</link>
		<comments>http://www.aboutrestore.com/2010/06/28/storage-pools-and-why-they-matter/#comments</comments>
		<pubDate>Mon, 28 Jun 2010 14:11:29 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[dedupe]]></category>
		<category><![CDATA[deltascale]]></category>
		<category><![CDATA[multi-tenancy]]></category>
		<category><![CDATA[storage pools]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=1136</guid>
		<description><![CDATA[Today SEPATON announced the addition of Storage Pools to our data protection platform.  The technology marks a major step in the path to data protection lifecycle management, and I am excited about the new functionality and wanted share some brief thoughts. To summarize, storage pooling allows data to be segmented into discrete pools that do [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2010%252F06%252F28%252Fstorage-pools-and-why-they-matter%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22Storage%20pools%20and%20why%20they%20matter%22%20%7D);"></div>
<p>Today <a href="http://sepaton.com/news/press-releases/sepaton-protects-the-cloud" target="_blank">SEPATON announced</a> the addition of Storage Pools to our data protection platform.  The technology marks a major step in the path to data protection lifecycle management, and I am excited about the new functionality and wanted share some brief thoughts.</p>
<p>To summarize, storage pooling allows data to be segmented into discrete pools that do not share deduplication.  Data sent to one pool will only be deduplicated against information in that pool and will not co-mingle with other data.  Additionally, pools provide configuration flexibility by supporting different types of disks with different performance profiles.  Pools also benefit from SEPATON’s DeltaScale architecture which allows for dynamic capacity and performance scalability.  Pools are a no-cost option with our latest software release and customers have the ability to implement them in the way that best meets their business requirements.  Some of the benefits include:</p>
<p><span id="more-1136"></span></p>
<p><strong>Service levels</strong></p>
<p>The logical separation of pools enables end users to provide differing levels of service.  Pools can be configured with different node and storage configurations thus allowing an administrator to provide different ingest and density metrics for each pool.  However, the environment is still managed in one system image and can scale capacity or performance.  This prevents system sprawl as is common with many of our competitors.</p>
<p><strong>Multi-tenancy</strong></p>
<p>Pools prevent ingested data from comingling with data in other pools.  This is vital for those environments where data segregation is critical.  A classic example is a service provider who wants to offer accelerated backup as a service.  They must keep each customer’s data separate, and an S2100 with Storage Pools will cost effectively address this challenge and provide system scalability to meet future growth requirements.  With competing solutions, the service provider would need to implement separate systems for each customer.  The management complexity would increase rapidly and it gets even more difficult as data grows.</p>
<p>Another benefit of storage pooling is that it ensures that one pool’s capacity does not impact another’s.  If an appliance is segmented into pool A and B and A fills up, pool B will not be affected.  The benefit is that in a pooled system, the operator can provide consistent capacity and performance to each pool and not have to worry about how growth in one pool will impact another.  Of course, the rapid growth in the first pool may provide a revenue opportunity for the service provider and they can benefit from SEPATON’s dynamic capacity scalability.</p>
<p><strong>Charge-back</strong></p>
<p>Separating data is valuable, but it is critical to understand system utilization to enable effective chargeback.  The new release also provides capacity reporting to allow users to quantify the capacity used by each pool.  This allows for effective chargeback since you can clearly delineate storage usage.</p>
<p><strong>Future Protection</strong></p>
<p>Disk technology is always changing.  In the world of SATA, we continue to see increasing densities and new controller technologies.  For example, HDS <a href="http://www.hds.com/corporate/press-analyst-center/press-releases/2010/gl100622.html" target="_blank">recently announced</a> an upgrade to their AMS series of disk arrays and SEPATON is shipping the new technology in our platform.  Inevitably, future innovations will occur and as they do, Storage Pools will allow the different disk technologies to co-exist in a transparent fashion and allow for data migration.</p>
<p><strong>Single System Image</strong></p>
<p>Most importantly, Storage Pool technology relies on SEPATON’s core appliance technology.  The ability to dynamically scale capacity and performance is a hallmark of SEPATON’s DeltaScale architecture.  The same functionality extends to Storage Pools thus allowing end users to grow the system as needs change while maintaining single system manageability and detailed reporting.</p>
<p>In summary, storage pools is a new technology that brings a range of new features to SEPATON’s platform.  The functionality is unique in the deduplication appliance space and when combined with SEPATON’s dynamic scalability can provide tremendous customer benefits.</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2008/09/10/ibm-storage-announcement/' rel='bookmark' title='IBM Storage Announcement'>IBM Storage Announcement</a></li>
<li><a href='http://www.aboutrestore.com/2008/09/16/a-little-bit-off-topic-%e2%80%93-deduplication-and-primary-storage/' rel='bookmark' title='A little bit off topic – deduplication and primary storage'>A little bit off topic – deduplication and primary storage</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2010/06/28/storage-pools-and-why-they-matter/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Domain &amp; GDA – Bolt-on to the rescue</title>
		<link>http://www.aboutrestore.com/2010/04/14/data-domain-gda-%e2%80%93-bolt-on-to-the-rescue/</link>
		<comments>http://www.aboutrestore.com/2010/04/14/data-domain-gda-%e2%80%93-bolt-on-to-the-rescue/#comments</comments>
		<pubDate>Wed, 14 Apr 2010 16:53:40 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Backup]]></category>
		<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[Restore]]></category>
		<category><![CDATA[data domain]]></category>
		<category><![CDATA[emc]]></category>
		<category><![CDATA[hash]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=1040</guid>
		<description><![CDATA[One of biggest challenges facing today&#8217;s datacenter managers is protecting the vast quantities of data being generated. As volumes have increased, customers have looked for larger and larger backup solutions. Multi-node global deduplication systems have become critical to enable companies to meet business requirements and EMC/Data Domain&#8217;s response to these challenges has been &#8220;add another box&#8221; [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2010%252F04%252F14%252Fdata-domain-gda-%2525e2%252580%252593-bolt-on-to-the-rescue%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22Data%20Domain%20%26%20GDA%20%E2%80%93%20Bolt-on%20to%20the%20rescue%22%20%7D);"></div>
<p>One of biggest challenges facing today&#8217;s datacenter managers is protecting the vast quantities of data being generated. As volumes have increased, customers have looked for larger and larger backup solutions. Multi-node global deduplication systems have become critical to enable companies to meet business requirements and EMC/Data Domain&#8217;s response to these challenges has been &#8220;add another box&#8221; which is their answer to all capacity or performance scalability questions. It appears that Data Domain has acknowledged that this argument no longer resonates and has reverted to Plan B, bolt-on GDA.</p>
<p>The use of the term &#8220;bolt-on&#8221; stems from a <a href="http://www.dedupematters.com/brianbilesblog/2008/12/keeping-it-real.html" target="_blank">previous blog post</a> by EMC/Data Domain&#8217;s VP of Product Management, Brian Biles. In the entry, he characterizes other deduplication vendors as bolt-on solutions, and the obvious implication is that Data Domain is better because it is not a bolt-on. Few would agree with this assertion, but it is an interesting opinion and I will return to this later.</p>
<p><span id="more-1040"></span></p>
<p>On the surface, the GDA announcement includes all of the buzzwords including &#8220;dual controller&#8221;, &#8220;global deduplication&#8221; and &#8220;transparent load-balancing&#8221;. The solution sounds impressive, right? Well so did the <a href="http://www.aboutrestore.com/2008/08/13/rube-goldberg-reborn-as-a-vtl/" target="_blank">DL3D 4000</a> and <a href="http://www.aboutrestore.com/2009/11/12/bye-bye-edldl3d-15003000-it-was-nice-knowing-you/" target="_blank">remember what happened to the DL3D family</a>? The simple fact is that EMC/Data Domain (DD) has announced a product that may be impressive on paper, but the reality is much different. DD has been rumored to be developing a true global deduplication solution for the last few years, and it is hard to believe that this is the result.</p>
<p>GDA is unlike any other deduplication appliance. Every vendor (including DD pre-GDA) has built appliances that are self-contained and designed to minimize the administration, configuration and modification of backup environments. In short, the solutions are designed for simplicity. The GDA is different; it is essentially two separate DD880&#8242;s and a heavy OST client that hashes and software compresses the data and sends it to one of the two boxes based on the first digits of the hash. By forcing media servers to perform highly CPU-centric processing, the GDA moves the problem to the media server. The problem now rests on the end users shoulders to manage and size every server to ensure it meets appropriate GDA specifications. Essentially, the GDA has assimilated the media servers into its &#8220;appliance&#8221; realm and pushed processing and management activities onto the customer. It is important to note that GDA requires Symantec OST technology which is available on NetBackup and BackupExec. If you run any other backup application then GDA is not an option for you.</p>
<p>Here are some points to consider related to the implementation:</p>
<ul>
<li>How do you size the required media server upgrades to meet your performance requirements? </li>
<li>Will you need to purchase new media servers to gain the processing power required? </li>
<li>Who do you call if your performance does not meet expectations? </li>
<li>Are you comfortable running such highly CPU intensive agents on your media servers which may already be resource constrained? </li>
<li>What happens if you have some applications that backup directly to tape using a LAN free approach such as NDMP or RMAN? </li>
<li>What if you are running backup applications like IBM TSM, EMC&#8217;s own NetWorker or CommVault that do not support OST? </li>
</ul>
<p>These are real end user concerns. We have plenty of customers with 20+ media servers who would need to completely overhaul their environment to implement GDA. These customers would scoff at adding DD&#8217;s agents to their already overburdened servers.  In contrast, a true multi-node global deduplication solution like SEPATON&#8217;s can be connected and will just work as a backup target. Agents are not required when backing up to a VTL.</p>
<p>In summary, it appears that EMC/Data Domain is tacitly acknowledging the limitations of their hash-based architecture. They spent years trying to develop an integrated global deduplication solution and the best they could come up with is a cobbled together system including dual 880&#8242;s and an unwieldy OST client. It appears that the GDA has more in common with the DL3D4000 then I originally thought! Isn&#8217;t it ironic that the biggest opponent of bolt-on technology has so fully embraced the concept in the GDA?</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2009/04/17/sepaton-versus-data-domain/' rel='bookmark' title='SEPATON Versus Data Domain'>SEPATON Versus Data Domain</a></li>
<li><a href='http://www.aboutrestore.com/2009/06/04/netapp-and-emc-duel-to-the-death-for-data-domain/' rel='bookmark' title='NetApp and EMC Duel to the Death for Data Domain'>NetApp and EMC Duel to the Death for Data Domain</a></li>
<li><a href='http://www.aboutrestore.com/2009/06/17/netapp-and-data-domain-%e2%80%98til-death-or-a-better-offer-from-emc-do-we-part/' rel='bookmark' title='NetApp and Data Domain: ‘Til death or a better offer from EMC do we part'>NetApp and Data Domain: ‘Til death or a better offer from EMC do we part</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2010/04/14/data-domain-gda-%e2%80%93-bolt-on-to-the-rescue/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Deduplication ratios and their impact on DR cost savings</title>
		<link>http://www.aboutrestore.com/2010/03/31/deduplication-ratios-and-their-impact-on-dr-cost-savings/</link>
		<comments>http://www.aboutrestore.com/2010/03/31/deduplication-ratios-and-their-impact-on-dr-cost-savings/#comments</comments>
		<pubDate>Wed, 31 Mar 2010 21:32:37 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Backup]]></category>
		<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[Replication]]></category>
		<category><![CDATA[blog]]></category>
		<category><![CDATA[dedupe]]></category>
		<category><![CDATA[w curtis preston]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=1026</guid>
		<description><![CDATA[There is an interesting blog discussion between Dipash Patel from CommVault and W. Curtis Preston from Backup Central and TruthinIT regarding the increasing or decreasing benefits of deduplication ratios. They take different perspectives on the benefits of increasing deduplication ratios and I will highlight their points and add an additional one to consider. Patel argues [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2010%252F03%252F31%252Fdeduplication-ratios-and-their-impact-on-dr-cost-savings%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22Deduplication%20ratios%20and%20their%20impact%20on%20DR%20cost%20savings%22%20%7D);"></div>
<p>There is an interesting blog discussion between Dipash Patel from <a href="http://www.commvault.com/" target="_blank">CommVault</a> and W. Curtis Preston from <a href="http://www.backupcentral.com/" target="_blank">Backup Central</a> and <a href="http://www.truthinit.com/" target="_blank">TruthinIT</a> regarding the increasing or decreasing benefits of deduplication ratios. They take different perspectives on the benefits of increasing deduplication ratios and I will highlight their points and add an additional one to consider.</p>
<p><a href="http://news.commvault.com/DipeshPatel/000039_How_to_analyze_dedupe_ratios_and_its_impact_on_cost_savings.asp" target="_blank">Patel argues</a> that increasing deduplication ratios beyond 10:1 provides only a marginal benefit. He calculates that going from 10:1 to 20:1 results in only a 5% increase in capacity efficiency and suggests that this provides only a marginal benefit. He adds that vendors who suggest that a doubling in deduplication ratios will result in a doubling cost savings are using a &#8220;sleight of hand.&#8221; He makes an interesting point, but I disagree with his core statement that increasing deduplication ratios beyond 10:1 provides only marginal savings.</p>
<p><span id="more-1026"></span></p>
<p><a href="http://www.backupcentral.com/index.php?option=com_content&amp;task=view&amp;id=305&amp;Itemid=47" target="_blank">Preston responds</a> to Patel by suggesting that there is a real cost to purchase, manage and power/cool disk systems. An increase in deduplication ratios from 10:1 to 20:1 reduces the amount of required disk storage by two. (e.g. 10 TB at 10:1 requires 1 TB of disk while a 20:1 reduction would require .5TB disk.) He argues that this provides real management cost, power and cooling savings. I believe that Preston makes a good point, but there is another element that is also worth considering.</p>
<p>Most end users purchase deduplication with an end goal of replicating their data for disaster recovery purposes. The benefit of deduplication is not just about retaining data locally, but also about reducing bandwidth requirements for replication. The implications of going from 10:1 to 20:1 can have a major impact on replication and disaster recovery and in some cases can make the difference between meeting or missing SLAs. If we take the same example above, 10TB will shrink to 1 TB and will take 49.4 hours to replicate over a T-3. The same math with a 20:1 ratio yields 24.7 hours. (The model assumes that the T-3 delivers 45 Mb/sec and can be fully utilized for backup.) In this scenario, if the customer&#8217;s requirement is to get data offsite in 24 hours, they barely miss it with 20:1 and completely miss it with 10:1. If the deduplication ratio were to increase by one point to 21:1, the customer could replicate their data in 23.5 hours and meet their window. As you can see, in this case, data reduction ratios really matter. In fact, they are critical to the customer meeting their SLAs.</p>
<p>In summary, I believe that Patel is wrong in his assessment of deduplication ratios. Increasing ratios can and will have a meaningful impact on customer environments and suggesting that the benefits only result in 5% savings is misleading.</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2010/03/03/tsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer/' rel='bookmark' title='TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer'>TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer</a></li>
<li><a href='http://www.aboutrestore.com/2008/07/23/deltastor-deduplication-cont/' rel='bookmark' title='DeltaStor Deduplication, cont&#8230;.'>DeltaStor Deduplication, cont&#8230;.</a></li>
<li><a href='http://www.aboutrestore.com/2008/07/18/deltastor-deduplication/' rel='bookmark' title='DeltaStor Deduplication'>DeltaStor Deduplication</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2010/03/31/deduplication-ratios-and-their-impact-on-dr-cost-savings/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>TSM Target Deduplication: You Get What You Pay For</title>
		<link>http://www.aboutrestore.com/2010/03/16/tsm-target-deduplication-you-get-what-you-pay-for/</link>
		<comments>http://www.aboutrestore.com/2010/03/16/tsm-target-deduplication-you-get-what-you-pay-for/#comments</comments>
		<pubDate>Tue, 16 Mar 2010 18:37:19 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[dedupe]]></category>
		<category><![CDATA[enterprise strategy group]]></category>
		<category><![CDATA[esg]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[tsm]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=993</guid>
		<description><![CDATA[I was recently pondering TSM&#8217;s implementation of target deduplication and decided to review ESG&#8217;s Lab Validation report on IBM TSM 6.1. There is quite a bit of good information in the paper, and some really interesting data about TSM&#8217;s target deduplication. Before discussing the results, it is important to understand the testing methodology. Enterprise Strategy [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2010%252F03%252F16%252Ftsm-target-deduplication-you-get-what-you-pay-for%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22TSM%20Target%20Deduplication%3A%20You%20Get%20What%20You%20Pay%20For%22%20%7D);"></div>
<p>I was recently pondering TSM&#8217;s implementation of target deduplication and decided to review <a href="http://www.ibm.com/developerworks/wikis/download/attachments/110166870/ESG+Lab+Validation+IBM+TSM+6.1+Apr+09.pdf?version=1">ESG&#8217;s Lab Validation report on IBM TSM 6.1</a>. There is quite a bit of good information in the paper, and some really interesting data about TSM&#8217;s target deduplication.</p>
<p>Before discussing the results, it is important to understand the testing methodology. <a href="http://www.enterprisestrategygroup.com/" target="_blank">Enterprise Strategy Group</a> clearly states that the article was based on &#8220;hands-on testing [in IBM's Tucson, AZ labs], audits of IBM test environments, and detailed discussions with IBM TSM experts.&#8221; (page 5) This means that IBM installed and configured the environment and allowed ESG to test the systems and review the results. Clearly, IBM engineers are experts in TSM and so you would assume that any systems provided would be optimally configured for performance and deduplication. The results experienced by ESG are likely the best case scenario since the average customer may not have the flexibility (or knowledge) to configure a similar system. This is not a problem, per se, but readers should keep this in mind.</p>
<p><span id="more-993"></span></p>
<p>The whitepaper highlights the data reduction realized using TSM and mentions capacity savings of 19:1. However, if you look carefully, you see that the space savings calculations are based on capacity reduction from TSM&#8217;s proprietary progressive incremental technology <em>and</em> deduplication. Progressive incremental technology reduces the amount of storage required by bypassing full backups. The really interesting question is &#8220;what additional benefits are gained by using TSM&#8217;s target deduplication?&#8221; Fortunately, ESG provides an answer.</p>
<p>The idea behind deduplication is that it provides capacity savings by removing redundancies within backup data. Theoretically, IBM should have an advantage deduplicating TSM backups since they are intimately familiar with the application and its data formats. However, ESG&#8217;s results do not support this assertion. The paper states, &#8220;Data deduplication enhanced data reduction in TSM nearly 50% over progressively incremental backup schemes alone.&#8221; (page 8 ) This suggests that IBM&#8217;s deduplication provides a 2:1 space savings! Wow, talk about a minimal benefit; you could get close to the same results with hardware compression.</p>
<p>In summary, TSM deduplication appears to provide minimal capacity savings while creating <a href="http://www.aboutrestore.com/2009/02/18/tsm-deduplication/" target="_blank">management challenges</a>. As an end user concerned about backup and recovery, you should carefully evaluate your options. The improved manageability, performance and data reduction of dedicated target deduplication appliances like <a href="http://www.sepaton.com/">SEPATON&#8217;s</a> <a href="http://sepaton.com/product-tour">S2100 product family</a> is a better option for all but the smallest environments. TSM includes target deduplication for free, but remember, in this case, you get what you pay for!</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2010/03/03/tsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer/' rel='bookmark' title='TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer'>TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer</a></li>
<li><a href='http://www.aboutrestore.com/2010/03/31/deduplication-ratios-and-their-impact-on-dr-cost-savings/' rel='bookmark' title='Deduplication ratios and their impact on DR cost savings'>Deduplication ratios and their impact on DR cost savings</a></li>
<li><a href='http://www.aboutrestore.com/2009/02/18/tsm-deduplication/' rel='bookmark' title='TSM Deduplication'>TSM Deduplication</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2010/03/16/tsm-target-deduplication-you-get-what-you-pay-for/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer</title>
		<link>http://www.aboutrestore.com/2010/03/03/tsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer/</link>
		<comments>http://www.aboutrestore.com/2010/03/03/tsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer/#comments</comments>
		<pubDate>Wed, 03 Mar 2010 14:20:03 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[dedupe]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[tsm]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=966</guid>
		<description><![CDATA[TSM presents unique deduplication challenges due to its progressive incremental backup strategy and architectural design. This contrasts with the traditional full/incremental model used by competing backup software vendors. The result is that TSM users will see smaller deduplication ratios than their counterparts using NetBackup, NetWorker or Data Protector. This post explores four key reasons why [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2010%252F03%252F03%252Ftsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22TSM%20and%20Deduplication%3A%204%20Reasons%20Why%20TSM%20Deduplication%20Ratios%20Suffer%22%20%7D);"></div>
<p>TSM presents unique deduplication challenges due to its progressive incremental backup strategy and architectural design. This contrasts with the traditional full/incremental model used by competing backup software vendors. The result is that TSM users will see smaller deduplication ratios than their counterparts using NetBackup, NetWorker or Data Protector. This post explores four key reasons why TSM is difficult to deduplicate.</p>
<p><span id="more-966"></span></p>
<p><strong>Progressive incremental model<br /></strong>TSM&#8217;s incremental only approach presents the biggest challenge to deduplication. By sending only changed data, TSM limits the amount of redundant information stored and hence deduplication benefits. This is in stark contrast to most other backup applications that rely on frequent full backups; it is these jobs that provide the best data reduction. However TSM is often configured to perform nightly full backups of data types such as Exchange, Oracle or SQL. These backups will provide similar deduplication benefits as seen in other applications, but typically represent a fraction of total data protected.<strong><br /></strong></p>
<p><strong>Data movement<br /></strong>Deduplication algorithms use various methods to recognize redundant information. They analyze data as it is written to the system to find the redundancies. In traditional backup applications, data is typically written to a disk or tape device and remains there until expiration. In contrast, TSM writes data to a given pool and then will run multiple processes which move the data such as reclamation. These processes create deduplication challenges because they force the appliance to constantly re-hydrate data and then re-deduplicate data. The frequent data movement creates inconsistent data patterns that can be difficult for deduplication mechanisms to recognize resulting in decreased data reduction.</p>
<p><strong>Data fragmentation</strong><br />TSM is designed to maximize performance by using multi-streaming. This means that a given job could backup to almost any disk or tape device and its data could be multiplexed or mixed with data from any other server. This can be a challenge because data becomes fragmented and will be written in many different physical tape or disk locations. Deduplication struggles because these segments can be small and the algorithm must effectively recognize the varying block sizes to achieve the best possible data reduction. In practice this reduces data reduction.</p>
<p><strong>Reclamation/Overwrites<br /></strong>Since TSM typically spreads a backup job across multiple pieces of media, data becomes fragmented. TSM has a process called reclamation which reduces fragmentation by moving data onto new cartridges and expiring the previous cartridges. Reclamation is designed to minimize the number of cartridges required in a TSM environment, but it can be I/O intensive. As the process it runs, it moves unexpired data onto new cartridges and expires the old cartridges. This can be a challenge for deduplication systems that utilize a batch cleaning process. These systems run a weekly or more frequent process to deleted expired cartridges; free space is not realized until the cleaning is completed. Deduplication ratios will decline since the expired cartridges will consume disk space. The other challenge is that the clean process can be I/O intensive and could negatively impact other TSM processes.</p>
<p>TSM is a very powerful application with a unique data protection model. Unfortunately, the same technology that reduces backup windows also creates challenges for deduplication. The result is that deduplication ratios decrease in TSM environments. However, even with smaller ratios, deduplication still provides a strong benefit for TSM environments particularly with deduplicated replication.</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2010/03/31/deduplication-ratios-and-their-impact-on-dr-cost-savings/' rel='bookmark' title='Deduplication ratios and their impact on DR cost savings'>Deduplication ratios and their impact on DR cost savings</a></li>
<li><a href='http://www.aboutrestore.com/2008/07/25/6-reasons-not-to-deduplicate-data/' rel='bookmark' title='6 Reasons not to Deduplicate Data'>6 Reasons not to Deduplicate Data</a></li>
<li><a href='http://www.aboutrestore.com/2009/02/18/tsm-deduplication/' rel='bookmark' title='TSM Deduplication'>TSM Deduplication</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2010/03/03/tsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The Demise of the NearStore VTL: A historical perspective</title>
		<link>http://www.aboutrestore.com/2010/02/03/the-demise-of-the-nearstore-vtl-a-historical-perspective/</link>
		<comments>http://www.aboutrestore.com/2010/02/03/the-demise-of-the-nearstore-vtl-a-historical-perspective/#comments</comments>
		<pubDate>Wed, 03 Feb 2010 17:57:34 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[Virtual Tape]]></category>
		<category><![CDATA[data domain]]></category>
		<category><![CDATA[ddup acquisition]]></category>
		<category><![CDATA[netapp]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=886</guid>
		<description><![CDATA[Rumors have been circulating for months about the demise of NetApp&#8217;s VTL offering. Today, Beth Pariseau from SearchDataBackup published the first public confirmation that development on the product has ceased. It is not a surprise, but makes for an interesting case study. NetApp acquired VTL technology with their purchase of Alacritus for $11 million back in [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2010%252F02%252F03%252Fthe-demise-of-the-nearstore-vtl-a-historical-perspective%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22The%20Demise%20of%20the%20NearStore%20VTL%3A%20A%20historical%20perspective%22%20%7D);"></div>
<p>Rumors have been circulating for months about the demise of NetApp&#8217;s VTL offering. Today, Beth Pariseau from <a href="http://searchdatabackup.techtarget.com/" target="_blank">SearchDataBackup</a> published the <a href="http://searchdatabackup.techtarget.com/news/article/0,289142,sid187_gci1380567,00.html" target="_blank">first public confirmation</a> that development on the product has ceased. It is not a surprise, but makes for an interesting case study.</p>
<p>NetApp acquired VTL technology with their <a href="http://www.netapp.com/us/company/news/news-rel-20050407.html" target="_blank">purchase of Alacritus</a> for $11 million back in 2005. Alacritus provided a software only VTL solution that ran on a Linux platform. Their product specifications appeared impressive, but they had limited success in the US. Our partners in Asia saw them more frequently. For NetApp, the acquisition made sense because it represented a relatively cost-effective entry into the rapidly growing VTL market. However, as in most things, the difficulties were in the details.</p>
<p>NetApp&#8217;s core intellectual property is their ONTAP operating system and associated WAFL filesystem. These components provide the intelligence and value-added features of their arrays. The challenge for NetApp after acquiring Alacritus was the integration of the two technologies.</p>
<p><span id="more-886"></span></p>
<p>NetApp&#8217;s short term strategy after the purchase was to build a VTL using Linux, Alacritus, x86 servers and existing disk shelves, and they released their <a href="http://www.netapp.com/us/company/news/news_rel_20061017.html" target="_blank">first system in 2006</a>. This leveraged their hardware design and configuration, but bypassed their core software IP. Customers could reuse VTL disk shelves in their filers if they decommissioned the VTL, but there was no ability to share the same storage between filers and VTLs. This is your classic island of storage problem.</p>
<p>The next trend in the market was deduplication. NetApp recognized the importance of the technology and <a href="http://www.netapp.com/us/company/news/news_rel_20070515.html" target="_blank">announced</a> A-SIS deduplication for their filers in May of 2007. However, the Alacritus VTL did not support ONTAP/WAFL and so the technology could not be reused. NetApp then embarked on a project to develop a second deduplication engine (after A-SIS) for use on their VTL.  They said that the new deduplication system would be <a href="http://www.networkcomputing.com/data-center/netapp-de-dupes.php?type=article" target="_blank">available in June of the same year</a>, and I believe that they underestimated the difficulty of creating a new engine. The development took far longer than expected and they <a href="http://www.netapp.com/us/company/news/news-rel-20081028.html" target="_blank">announced the product</a> in October of 2008. To this day, we have seen few implementations of NetApp VTLs with deduplication, and I often wonder if this is because their new deduplication engine does not work effectively.  Given the limited success, it is not surprising that NetApp has officially ceased development.</p>
<p>In my opinion, NetApp&#8217;s VTL failed due to a lack of commitment. To be successful, they needed to port the Alacritus solution to ONTAP/WAFL to allow it to co-exist transparently with existing technology. I believe that NetApp was focused on snapshots and replication for backup and recovery and so the NearStore VTL was not given the attention it deserved. Had they integrated Alacritus and ONTAP, they had the potential to create a unified filer, VTL and deduplication appliance which would have been very competitive.  Instead, they chose to leave Alacritus as a separate product, and NetApp&#8217;s aggressive pursuit of Data Domain marked the end of the NearStore VTL.</p>
<p>The irony is that I believe NetApp had an opportunity to be an early player in the VTL space. They had the relationships and footprint to aggressively sell the solutions. Unfortunately, their critical missteps and lack of focus limited their success.</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2009/06/17/netapp-and-data-domain-%e2%80%98til-death-or-a-better-offer-from-emc-do-we-part/' rel='bookmark' title='NetApp and Data Domain: ‘Til death or a better offer from EMC do we part'>NetApp and Data Domain: ‘Til death or a better offer from EMC do we part</a></li>
<li><a href='http://www.aboutrestore.com/2008/12/03/surviving-a-down-economy-a-vendor-perspective/' rel='bookmark' title='Surviving A Down Economy &#8211; A vendor Perspective'>Surviving A Down Economy &#8211; A vendor Perspective</a></li>
<li><a href='http://www.aboutrestore.com/2009/07/08/its-final-emc-acquires-data-domain/' rel='bookmark' title='It&#8217;s final &#8211; EMC acquires Data Domain'>It&#8217;s final &#8211; EMC acquires Data Domain</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2010/02/03/the-demise-of-the-nearstore-vtl-a-historical-perspective/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Four Must Ask Questions About Metadata and Deduplication</title>
		<link>http://www.aboutrestore.com/2009/12/22/4-must-ask-questions-about-metadata-and-deduplication/</link>
		<comments>http://www.aboutrestore.com/2009/12/22/4-must-ask-questions-about-metadata-and-deduplication/#comments</comments>
		<pubDate>Tue, 22 Dec 2009 19:14:39 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[dedupe]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=862</guid>
		<description><![CDATA[When backing up data to a deduplication system, two types of data are generated. The first comprises objects being protected such as the Word documents, databases or Exchange message stores. These files will be deduplicated and for simplicity I will call this “object storage”. The second type of data generated is metadata. This is information [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2009%252F12%252F22%252F4-must-ask-questions-about-metadata-and-deduplication%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22Four%20Must%20Ask%20Questions%20About%20Metadata%20and%20Deduplication%22%20%7D);"></div>
<p>When backing up data to a deduplication system, two types of data are generated. The first comprises objects being protected such as the Word documents, databases or Exchange message stores. These files will be deduplicated and for simplicity I will call this “object storage”. The second type of data generated is metadata. This is information that is used by the deduplication software to recognize redundancies and potentially re-hydrate data in the case of restoration. These two types of data are critical and are typically required for writing the data to the system and potentially reading data. Here are four key questions that you should ask about protecting metadata.</p>
<p><span id="more-862"></span></p>
<ol>
<li><strong>How is metadata stored?</strong></li>
<p>It is important to understand how the storage system is configured to protect this data. Simple questions like “how much storage is allocated to metadata?” can be vital since running out of metadata space is essentially the equivalent of running out of object space. (e,g, if you run out of metadata capacity then you cannot add any more items to object storage since this creates metadata.) I have heard of numerous scenarios with competing systems where customers run out of metadata space while still having ample object space available. This results in inefficient space utilization.</p>
<li><strong>How is metadata protected?</strong></li>
<p>Metadata is a critical part of the deduplication process and its loss or corruption can be very problematic. You should understand how metadata is protected. Are backup copies made? If so, how are they created, stored and recovered.</p>
<li><strong>How does metadata storage requirements grow over time?</strong></li>
<p>As mentioned above, running out of metadata space results in poor capacity utilization. You should understand how metadata is created and its growth over time. In many implementations, the size depends on deduplication efficiency and so the more data you backup that cannot be deduplicated (e.g. compressed or encrypted data), the more metadata space you will use. Replication may also require meta space and you should ascertain this impact as well.</p>
<li><strong>What happens if the metadata it is corrupted?</strong></li>
<p>Since this data is critical for the operation of deduplication, you should understand what happens if you permanently lose the metadata repository due to corruption or other problem. In extreme cases this could result in a complete system loss while in other cases the impact may be a reduction in deduplication ratios. The impact will vary by deduplication algorithm.</p>
</ol>
<p>In summary, the creation management and handling of metadata is a key component of every deduplication algorithm. Many vendors prefer to gloss over the importance of metadata and focus on other areas such as deduplication ratios or post-process vs inline. The reality is that the management and creation of metadata is critical and each end user should evaluate how this effects various solutions and their business SLAs. In a future blog post, I will discuss how SEPATON addresses each of these four questions.</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2008/09/16/a-little-bit-off-topic-%e2%80%93-deduplication-and-primary-storage/' rel='bookmark' title='A little bit off topic – deduplication and primary storage'>A little bit off topic – deduplication and primary storage</a></li>
<li><a href='http://www.aboutrestore.com/2010/03/03/tsm-and-deduplication-4-reasons-why-tsm-deduplication-ratios-suffer/' rel='bookmark' title='TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer'>TSM and Deduplication: 4 Reasons Why TSM Deduplication Ratios Suffer</a></li>
<li><a href='http://www.aboutrestore.com/2008/10/07/inline-deduplication-what-your-mother-never-told-you/' rel='bookmark' title='Inline Deduplication: What Your Mother Never Told You'>Inline Deduplication: What Your Mother Never Told You</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2009/12/22/4-must-ask-questions-about-metadata-and-deduplication/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bye, bye EDL/DL3D 1500/3000, it was nice knowing you</title>
		<link>http://www.aboutrestore.com/2009/11/12/bye-bye-edldl3d-15003000-it-was-nice-knowing-you/</link>
		<comments>http://www.aboutrestore.com/2009/11/12/bye-bye-edldl3d-15003000-it-was-nice-knowing-you/#comments</comments>
		<pubDate>Thu, 12 Nov 2009 14:35:07 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[data domain]]></category>
		<category><![CDATA[dl3d]]></category>
		<category><![CDATA[edl]]></category>
		<category><![CDATA[emc]]></category>
		<category><![CDATA[quantum]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=845</guid>
		<description><![CDATA[The email below appeared in my inbox yesterday.  The EDL/DL3D 1500/3000 has officially been discontinued.  It was obvious from the moment EMC purchased Data Domain that the Quantum stuff was dead, but it took time for EMC to finally admit this.  The strongest statement came in Frank Slootman&#8217;s TechTarget interview.  Clearly the EMC/QTM relationship was a [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2009%252F11%252F12%252Fbye-bye-edldl3d-15003000-it-was-nice-knowing-you%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22Bye%2C%20bye%20EDL%2FDL3D%201500%2F3000%2C%20it%20was%20nice%20knowing%20you%22%20%7D);"></div>
<p>The email below appeared in my inbox yesterday.  The EDL/DL3D 1500/3000 has officially been discontinued.  It was obvious from the moment EMC purchased Data Domain that the Quantum stuff was dead, but it took time for EMC to finally admit this.  The strongest statement came in <a href="http://searchdatabackup.techtarget.com/news/article/0,289142,sid187_gci1371305,00.html" target="_blank">Frank Slootman&#8217;s TechTarget interview</a>.  Clearly the EMC/QTM relationship was a rocky one from the beginning and so the outcome is not surprising.</p>
<p><span id="more-845"></span></p>
<p style="text-align: center;"><a href="http://www.aboutrestore.com/wp-content/uploads/2009/11/DL3DEOL.jpg" rel="lightbox[845]"><img class="aligncenter size-medium wp-image-846" style="border: 0pt none;" title="DL3D End-of-Life Email" src="http://www.aboutrestore.com/wp-content/uploads/2009/11/DL3DEOL-251x300.jpg" alt="DL3D End-of-Life Email" width="251" height="300" /></a><br />
 <span style="font-size: x-small;"><em>Click for larger image</em></span></p>
<p>I find it amusing that just over a year ago, EMC was promoting the Quantum technology and said that “<em>These offerings compete directly with Data Domain and are sharply differentiated by their superior, policy-based de-duplication capabilities, including de-duplication checks between local and remote sites, better scalability, Five-9s availability, and unmatched service and support.</em>” (<a href="http://www.aboutrestore.com/wp-content/uploads/2009/11/DL3DAnnounce.pdf" target="_blank">Local copy of the source</a>, <a href="http://www.emc-centera.com/pdf/Disk%20Library%20Documents/EMC%20Disk%20Library%20Changes%20the%20Economics%20of%20Disk%20Backup%20Vs%20Tape%20Backup.pdf" target="_blank">original source</a>.)  EMC has since completed the acquisition of Data Domain and Quantum is now the pariah and Data Domain is the best thing ever.  Frank Slootman, VP of EMC’s backup group, said in the interview linked above, “<em>We&#8217;re swapping a lot of those [Quantum-based] boxes out at zero revenue. We&#8217;ve taken out about a dozen and we&#8217;ll continue to take out a similar number this quarter. Customers don&#8217;t want it.”</em> Wow, so much for sharply differentiated products with superior technology!  I feel sorry for end users who bought into EMC’s DL3D story and now have to deal with a forklift upgrade.</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2008/08/01/dl3d-discussion/' rel='bookmark' title='DL3D Discussion'>DL3D Discussion</a></li>
<li><a href='http://www.aboutrestore.com/2009/07/07/emc-one-ups-netapp/' rel='bookmark' title='EMC one-ups NetApp'>EMC one-ups NetApp</a></li>
<li><a href='http://www.aboutrestore.com/2009/06/09/emc-and-data-domain-it-was-the-best-of-times-it-was-the-worst-of-times/' rel='bookmark' title='EMC and Data Domain: It was the best of times, it was the worst of times'>EMC and Data Domain: It was the best of times, it was the worst of times</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2009/11/12/bye-bye-edldl3d-15003000-it-was-nice-knowing-you/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Domain keynote at SNW &#8211; Slootman&#8217;s surprising response</title>
		<link>http://www.aboutrestore.com/2009/10/20/data-domain-keynote-snw/</link>
		<comments>http://www.aboutrestore.com/2009/10/20/data-domain-keynote-snw/#comments</comments>
		<pubDate>Tue, 20 Oct 2009 19:58:56 +0000</pubDate>
		<dc:creator>Jay Livens</dc:creator>
				<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[data domain]]></category>
		<category><![CDATA[emc]]></category>
		<category><![CDATA[tradeshow]]></category>

		<guid isPermaLink="false">http://www.aboutrestore.com/?p=837</guid>
		<description><![CDATA[I attended multiple keynote and breakout sessions at SNW last week, but my busy meeting schedule conflicted with many of the morning sessions. I was able to attend to Data Domain’s talk given by Frank Slootman and wanted to provide some commentary. The bulk of the session was boring and included what appeared to be [...]]]></description>
			<content:encoded><![CDATA[
<div class="topsy_widget_data topsy_theme_blue" style="float: right;margin-left: 0.75em; background: url(data:,%7B%20%22url%22%3A%20%22http%253A%252F%252Fwww.aboutrestore.com%252F2009%252F10%252F20%252Fdata-domain-keynote-snw%252F%22%2C%20%22style%22%3A%20%22big%22%2C%20%22title%22%3A%20%22Data%20Domain%20keynote%20at%20SNW%20-%20Slootman%27s%20surprising%20response%22%20%7D);"></div>
<p>I attended multiple keynote and breakout sessions at SNW last week, but my busy meeting schedule conflicted with many of the morning sessions. I was able to attend to Data Domain’s talk given by Frank Slootman and wanted to provide some commentary.</p>
<p>The bulk of the session was boring and included what appeared to be a standard corporate slide deck which I am sure any salesperson could present in their sleep.  The presentation could be summarized with Data Domain’s usual message: <em>inline deduplication is good and everything else is bad, and, of course, Data Domain’s deduplication is the best</em>.  I was definitely hoping for something more interesting and was sorely disappointed; however, things changed when it came to the Q&amp;A.</p>
<p>Just to provide a bit of background, my experience with SNW is described <a href="http://www.aboutrestore.com/2009/10/16/snw-recap/">here</a>.  There were a large number of end users in attendance both at the expo and the keynote sessions and I estimate that many of the show’s 900 end users were in attendance for this talk.  At the end of the planned remarks, Slootman opened the floor to questions.</p>
<p><span id="more-837"></span></p>
<p>The first question came from a Data Domain end user who was familiar with Slootman.  The customer started by saying “Frank, we met when you visited our company.”  Slootman acknowledged the statement although it was not clear that he knew the person and things went downhill from there.</p>
<p>The customer was a Data Domain user and expressed frustration about the lack of basic capacity planning tools.  He was annoyed about his systems’ limited advanced warning when space was running low.  The result was that he had to fight with procurement to expedite the purchase of new shelves to ensure that he did not violate business SLAs.  His seemingly simple request was a tool that would allow him to forecast storage requirements for his Data Domain systems so he could plan purchases in advance.  Seems like a reasonable request, right?  Well, not according to Slootman.</p>
<p>First Slootman suggested that the end user pay attention to the automated capacity warnings.  The end user replied “Yes, but when those arrive it is too late.”  (I wonder if this can be adjusted.)  The customer pressed the point and said “I really need this and you promised it!” At this point Slootman became agitated and you would think that he would put on his “the customer is always right” hat and say something vague like “I will look into it and see if I can get it prioritized” or perhaps “I understand your frustration, let’s meet after the show and discuss it.”  Nope, none of the niceties from Slootman, he went straight for the jugular.  His answer was “Look, I have 50 other priorities just like this on the list and yours is on the bottom!”  Ouch!  How is that for customer service?  At this point, the end user’s microphone was disabled and questioning moved elsewhere.</p>
<p>The entire discussion was surprising.  My table comprised a mix of vendors and end users and everyone was looking at each other in wonder.  I even heard a couple of people say “can you believe he said that?”  I was equally surprised especially since everything Slootman had previously said was boring corporate fluff and I would never imagined that he would respond so aggressively to a customer.</p>
<p>In summary, the question about capacity planning was the highlight of the Slootman’s talk.  As the head of EMC’s Backup and Recovery division, you would think that Slootman would take a customer-centric approach to the situation.  Sure the end user was frustrated and perhaps even a bit aggressive, but I would have expected Slootman to take the high road and avoid alienating a customer in a public forum.  Clearly he did not.  Being the head of the division, it makes you wonder what values he is instilling in his own organization.  Is he suggesting that this is an appropriate way to treat customers?  Would it be acceptable for his reps to say the same thing to an agitated end user?  Clearly the answer should be no, but is it?</p>
<p>What do you think?  Is this an appropriate response to a question in a public forum?  Have you seen a similar attitude from others in his group?</p>

<p>Related posts:<ol>
<li><a href='http://www.aboutrestore.com/2009/06/29/poll-who-will-acquire-data-domain/' rel='bookmark' title='Poll: Who will acquire Data Domain?'>Poll: Who will acquire Data Domain?</a></li>
<li><a href='http://www.aboutrestore.com/2009/07/08/its-final-emc-acquires-data-domain/' rel='bookmark' title='It&#8217;s final &#8211; EMC acquires Data Domain'>It&#8217;s final &#8211; EMC acquires Data Domain</a></li>
<li><a href='http://www.aboutrestore.com/2010/04/14/data-domain-gda-%e2%80%93-bolt-on-to-the-rescue/' rel='bookmark' title='Data Domain &amp; GDA – Bolt-on to the rescue'>Data Domain &#038; GDA – Bolt-on to the rescue</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://www.aboutrestore.com/2009/10/20/data-domain-keynote-snw/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic (Feed is rejected)
Page Caching using disk: enhanced (User agent is rejected)
Database Caching 1/94 queries in 0.067 seconds using disk: basic
Object Caching 2854/2973 objects using disk: basic

Served from: www.aboutrestore.com @ 2012-02-07 19:42:59 -->
