Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    predictive analytics risk management
    How Predictive Analytics Is Redefining Risk Management Across Industries
    7 Min Read
    data analytics and gold trading
    Data Analytics and the New Era of Gold Trading
    9 Min Read
    composable analytics
    How Composable Analytics Unlocks Modular Agility for Data Teams
    9 Min Read
    data mining to find the right poly bag makers
    Using Data Analytics to Choose the Best Poly Mailer Bags
    12 Min Read
    data analytics for pharmacy trends
    How Data Analytics Is Tracking Trends in the Pharmacy Industry
    5 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: book review “Data Preparation For Data Mining”
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > book review “Data Preparation For Data Mining”
Business IntelligenceData Mining

book review “Data Preparation For Data Mining”

TimManns
TimManns
2 Min Read
SHARE

Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical ba…


Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical basis.
– For example one technique I use to redistribute heavily skewed data is simple binning by count. I work in telecommunications and the behavioural data is always extremely skewed. Log functions don’t work so I often use SQL to convert variables into 100 percentile bins (where each bin has the same number of rows (customers) in it). That type of insight isn’t in the book, but several statistically based alternatives are. I’m not convinced they would work with extremely skewed data, but they are well explained and useful insights.
2) no mention of SQL or step-by-step examples of data manipulation (nothing like ‘before and ‘after’ pictures). Ideas or examples for derived variables are lacking too.

More Read

Know Your S#*!: Maximize Web Conversion with A/B Testing
10 Reasons Why Now Is the Time to Get into Big Data
Could AI Have Prevented the Houston Metro Bus Incident?
McKinsey Says Cloud Computing ‘Makes No Sense’
Data-Driven Guide To Growing SaaS Business Traffic Through SEO

So far I’ve read through the first 275 pages and the odd additional chapter. Its surprisingly easy to read and explains the statistics well. Its definately a book I will refer to, and well worth buying.

– Tim
Link to original post

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

street address database
Why Data-Driven Companies Rely on Accurate Street Address Databases
Big Data Exclusive
predictive analytics risk management
How Predictive Analytics Is Redefining Risk Management Across Industries
Analytics Exclusive Predictive Analytics
data analytics and gold trading
Data Analytics and the New Era of Gold Trading
Analytics Big Data Exclusive
student learning AI
Advanced Degrees Still Matter in an AI-Driven Job Market
Artificial Intelligence Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

Technology and the Effective Marketer

6 Min Read

Some Thoughts on Pushing BI Beyond Business Managers

3 Min Read

Next Generation Warranty Systemsv

1 Min Read
Audience Intelligence
AnalyticsBusiness IntelligenceSocial Data

Get Deep Audience Intelligence Where It Matters: From Your Ads

6 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai is improving the safety of cars
From Bolts to Bots: How AI Is Fortifying the Automotive Industry
Artificial Intelligence
data-driven web design
5 Great Tips for Using Data Analytics for Website UX
Big Data

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?