Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    business using business intelligence
    How to Use a Competitive Intelligence Dashboard to Turn Market Data Into Smarter Marketing Decisions 
    9 Min Read
    unusual trading activity
    Signal Or Noise? A Decision Tree For Evaluating Unusual Trading Activity
    3 Min Read
    software developer using ai
    How Data Analytics Helps Developers Deliver Better Tech Services
    8 Min Read
    ai for stock trading
    Can Data Analytics Help Investors Outperform Warren Buffett
    9 Min Read
    media monitoring
    Signals In The Noise: Using Media Monitoring To Manage Negative Publicity
    5 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: book review “Data Preparation For Data Mining”
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > book review “Data Preparation For Data Mining”
Business IntelligenceData Mining

book review “Data Preparation For Data Mining”

TimManns
TimManns
2 Min Read
SHARE

Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical ba…


Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical basis.
– For example one technique I use to redistribute heavily skewed data is simple binning by count. I work in telecommunications and the behavioural data is always extremely skewed. Log functions don’t work so I often use SQL to convert variables into 100 percentile bins (where each bin has the same number of rows (customers) in it). That type of insight isn’t in the book, but several statistically based alternatives are. I’m not convinced they would work with extremely skewed data, but they are well explained and useful insights.
2) no mention of SQL or step-by-step examples of data manipulation (nothing like ‘before and ‘after’ pictures). Ideas or examples for derived variables are lacking too.

More Read

CRM Paradigm Shift
That’s Sick! Text Mining and Words with Multiple Definitions
How BI and Data Analytics Pros Used Twitter in December
Is Big Data Winning or Losing?
Bob Gourley on the Ethics, Analytics and Future of Big Data

So far I’ve read through the first 275 pages and the odd additional chapter. Its surprisingly easy to read and explains the statistics well. Its definately a book I will refer to, and well worth buying.

– Tim
Link to original post

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

ai product development
Why Businesses Outsource AI Product Development Companies
Exclusive News
banking tools
The Fintech and Banking Tools Global Entrepreneurs Rely On
Fintech Infographic
business using business intelligence
How to Use a Competitive Intelligence Dashboard to Turn Market Data Into Smarter Marketing Decisions 
Analytics Big Data Exclusive Marketing
fda14abd c869 4da5 943c c036ad8efc2e
How Data-Driven Journalists Are Using API News Apps to Improve Reporting
Big Data Exclusive News

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

Guy Kawasaki’s Alltop Announces Version 3.0

3 Min Read

Harnessing Collective Intelligence in Decision Making through Big Data Analytics

27 Min Read

Knowledge work and micro-processes

1 Min Read

Letter to Operations: no more spreadsheets for Purchasing

4 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
giveaway chatbots
How To Get An Award Winning Giveaway Bot
Big Data Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?