Skip to content

Topics
Industries
Resources
About Us
- About Samsung Insights
- Our Experts

Subscribe to Insights

Get the latest insights from Samsung delivered right to your inbox.

Email address*

See our Privacy Policy

Follow Us

Subscribe

Subscribe to Insights

Get the latest insights from Samsung delivered right to your inbox.

Email address*

See our Privacy Policy

Samsung Business Insights

Home / Memory & Storage

Featured posts in

Memory & Storage

High-capacity SSDs speed AI workloads for enterprises

What you need to know about SSD size and why it matters

Why it’s time to upgrade your data center storage solution

Commissioned Content

Memory & Storage

How SSDs Are Enabling In-Line Deduplication Beyond Backups

Published Jul 28, 2017By: Logan Harbaugh

Deduplication is the overall process of removing duplicate files, or even duplicate blocks of data within files, for a more efficient use of space. Originally, the processing relegated necessary deduplication to backup systems. However, flash is now so fast that deduplication can be used with main storage architectures, and in the process, the effective capacity of SSD-based systems has increased making them more competitive with HDD-based systems.

Getting Some Background

Deduplication as a concept was originally inspired by email. When someone sent out an email to hundreds of people with a file attached, the email server would be swamped. The systems would send out small emails and a link to the attached file, so that it would only be downloaded as recipients opened the message and then double-clicked on the file.

Initial versions of deduplicated backups also worked at the file level. As files were backed up, the system looked for duplicate files, and any file that had already been backed up would not be backed up again. Steps were taken to ensure that the file was actually identical to the one already backed up, not just a file with the same name and size.

Discover the Cost Advantages of SSDs

White Paper

Read this white paper on the total cost of ownership advantages of using SSDs. Download Now

An example that might jump to mind here is a file such as doc1.docx, which might be replicated in multiple users’ home directories; deduplication can reach very high levels of efficiency if the full operating system and application directories of each PC are backed up. Many, or even most of the files amounting to dozens or hundreds of megabytes on every PC running the same version of Windows will be identical from one system to the next. This means that since most of the files are identical, a full backup of 10 PCs will use up only a little more space than a full backup of one.

Updating Hardware for In-Line Deduplication

Because of the time required to search through data already stored on the backup tapes or hard drives, original deduplication systems used post-processing. This process backed up data to a landing zone, removed duplicates and then re-wrote the backup without the files that had already been backed up.

As systems became more efficient and processors became more powerful, in-line processing became possible, reducing the overall storage required for a landing zone. However, taking the deduplication process down to the block, or even sub-block level, so small parts within files could be deduplicated, increased the amount of processing again.

The next big step in deduplication came with SSDs. SSDs are so much faster than hard drives or tapes, so it was possible for in-line deduplication, not only for backups, but for online data as well. This allows extra copies to be removed in real time, yielding the same kinds of savings in storage capacity for front-line systems. Even Windows Server 2016 now includes deduplication features.

Increasing Efficiencies

There are some particular types of data that can yield spectacular savings — as with the example above, operating systems and application directories are largely the same from one system to the next. Virtual desktop infrastructure systems and server virtualization systems can have very large numbers of virtual systems that use very little more space than one system would.

On the other hand, systems that have very little in common from one dataset to the next, such as compressed graphics files, or databases with encryption of fields throughout the database, will see relatively little benefit from deduplication.

Still, within the age of data generation, deduplication can provide a real-time way for organizations to keep databases clean and easy to use, and increase accessibility across the entire business.

Find the best storage solutions for your business by checking out our award-winning selection of SSDs for the enterprise.

Posts By

Logan Harbaugh

Logan Harbaugh is an IT consultant and reviewer. He has worked in IT for over 20 years, and was a senior contributing editor with InfoWorld Labs as well as a senior technology editor at Information Week Labs. He has written reviews of enterprise IT products including storage, network switches, operating systems, and more for many publications and websites, including Storage Magazine, TechTarget.com, StateTech, Information Week, PC Magazine and Internet.com. He is the author of two books on network troubleshooting.

View more posts by Logan Harbaugh

Share This

Related Posts

Memory & Storage

Upgrading to SSDs and incorporating legacy systems

Upgrading to new SSDs and doing data migration from legacy systems to modern databases doesn't mean a rip and replace. Here's why.

Video Memory & Storage

Is upgrading to SSDs worth it for gamers?

Are internal SSDs good for gaming? The answer is a resounding, "Yes!" Here's how SSDs can enhance the gaming experience.

Memory & Storage

Get data center speed with the right solid state drive

Data center speed is critical for keeping up with technical loads. Here's how to choose the Samsung SSD solution that's right for your use case.

Featured Posts

Memory & Storage

High-capacity SSDs speed AI workloads for enterprises

Learn more about how NVMe SSDs power AI, machine learning and data analytics.

Memory & Storage

What you need to know about SSD size and why it matters

Choosing the right solid state drives (SSDs) can be confusing. Here's what you need to know about SSD sizes, form factors and use.

Memory & Storage

Why it’s time to upgrade your data center storage solution

Learn about how to upgrade a data center, what data center migration risks are, and why modern data center technologies are key.

How can we help you?

Shop special offers

Find out about offers on the latest Samsung technology.

Speak to a solutions expert

Get expert advice from a solutions consultant.

Talk to an expert

How can we help you?

Who are you buying for?

I'm buying for myself

Get latest offers

I'm buying for a small business

Get latest offers

I'm buying for a large enterprise

Talk to an expert

1

2

3

Speak to a solutions expert

Our solutions architects are ready to collaborate with you to address your biggest business challenges.

First name

Last name

Phone number

Email

State

I would like to be notified by email of future case studies, white papers, webinars and other educational content

By continuing you are agreeing to our privacy policy

1

2

3

Our solutions architects are ready to collaborate with you to address your biggest business challenges.

Company name

Title

Number of employees

Which Product are you interested in? Select all that apply

Mobile Phones
Wearables
Tablets
Laptops/2-in-1
Mobile Security
Business Services
Displays & Digital Signage
Hospitality TVs
Monitors
Memory & Storage
Wireless Networks

1

2

3

Our solutions architects are ready to collaborate with you to address your biggest business challenges.

Industry of interest?
Select all that apply

Education
Finance
Government
Healthcare
Hospitality
Legal
Manufacturing
Public Safety
Retail
Transportation
Other

How can we help?

Thank You

A member of our solutions architect team will be in touch with you soon.

View all Memory & Storage

Posts on this site reflect the personal views of each author and do not necessarily represent the views and opinions of Samsung Electronics America. Regular contributors are compensated for their time and expertise. All information shared on this website is for educational purposes only.

samsung.com
Contact
Privacy
Legal