Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    data analytics for pharmacy trends
    How Data Analytics Is Tracking Trends in the Pharmacy Industry
    5 Min Read
    car expense data analytics
    Data Analytics for Smarter Vehicle Expense Management
    10 Min Read
    image fx (60)
    Data Analytics Driving the Modern E-commerce Warehouse
    13 Min Read
    big data analytics in transporation
    Turning Data Into Decisions: How Analytics Improves Transportation Strategy
    3 Min Read
    sales and data analytics
    How Data Analytics Improves Lead Management and Sales Results
    9 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: DQ Alert: Easy Savings by Removing Dups
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Uncategorized > DQ Alert: Easy Savings by Removing Dups
Uncategorized

DQ Alert: Easy Savings by Removing Dups

DataQualityEdge
DataQualityEdge
3 Min Read
SHARE

Poor data quality costs over $600 billion annually as per TDWI surveys and studies.

Whether you are the VP looking over your latest sales, the secretary compiling a mailing list, the data quality analyst rummaging through data sets, the business analyst working on a data integration project, or an accountant going over the projected budget.

Whether you are in a large Fortune 500 company, in the government, or a small community association you will benefit.

Duplicate records

Duplicate records of customers cause discontented customers and multiple mail-outs. How is it possible to have duplicate records?

More Read

Image
5 Challenges Facing the Internet of Things
Defining Your Data Quality Problems
A Reply to All PR People
Skittles Homepage Now Pointing to Wikipedia, What’s Next?
Text Analytics in Telecommunications – Part 3
  • Records are manually entered twice;
  • Processes create record twice;
  • Participants registered under multiple names;
  • Participants registered at multiple locals.

So before you send out a mailing list to your community members, or potential program participants for marketing campaigns or program sign-ups for this year’s sports, arts, sales, and/or membership drive seasons:

  1. Sort that list by name. Why because you may have the child on the list twice, or three times; after sorting by name;
  2. Sort by address, you may have the same family household multiple times because one the parents registered …


Poor data quality costs over $600 billion annually as per TDWI surveys and studies.

Whether you are the VP looking over your latest sales, the secretary compiling a mailing list, the data quality analyst rummaging through data sets, the business analyst working on a data integration project, or an accountant going over the projected budget.

Whether you are in a large Fortune 500 company, in the government, or a small community association you will benefit.

Duplicate records

Duplicate records of customers cause discontented customers and multiple mail-outs. How is it possible to have duplicate records?

  • Records are manually entered twice;
  • Processes create record twice;
  • Participants registered under multiple names;
  • Participants registered at multiple locals.

So before you send out a mailing list to your community members, or potential program participants for marketing campaigns or program sign-ups for this year’s sports, arts, sales, and/or membership drive seasons:

  1. Sort that list by name. Why because you may have the child on the list twice, or three times; after sorting by name;
  2. Sort by address, you may have the same family household multiple times because one the parents registered under their name, and/or siblings are registered with your organization as well.

Don’t know how to sort… here’s a good way to start if you’re using Microsoft Excel… highlight your records and click on this little icon , it’s easy as breaking eggs.

Or

New SQL type programming here’s a simple query that will identify duplicate records.

SELECT attribute1, attribute2, attribute3, attributen… count(*)
FROM
dbo.tablex
GROUP BY attribute1, attribute2, attribute3, attributen…
HAVING (COUNT(*) > 1)

I hope this helps anyone working with customer lists.

TAGGED:data quality
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

cybersecurity essentials
Cybersecurity Essentials For Customer-Facing Platforms
Exclusive Infographic IT Security
ai for making lyric videos
How AI Is Revolutionizing Lyric Video Creation
Artificial Intelligence Exclusive
intersection of data and patient care
How Healthcare Careers Are Expanding at the Intersection of Data and Patient Care
Big Data Exclusive
dedicated servers for ai businesses
5 Reasons AI-Driven Business Need Dedicated Servers
Artificial Intelligence Exclusive News

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

DQ Problems? Start a Data Quality Recognition Program!

5 Min Read

Perfect Data and Other Data Quality Myths

5 Min Read

Conducting A/B Tests: Subject Lines

3 Min Read

Book Review: Data Modeling for Business

4 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

data-driven web design
5 Great Tips for Using Data Analytics for Website UX
Big Data
AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?