Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    media monitoring
    Signals In The Noise: Using Media Monitoring To Manage Negative Publicity
    5 Min Read
    data analytics
    How Data Analytics Can Help You Construct A Financial Weather Map
    4 Min Read
    financial analytics
    Financial Analytics Shows The Hidden Cost Of Not Switching Systems
    4 Min Read
    warehouse accidents
    Data Analytics and the Future of Warehouse Safety
    10 Min Read
    stock investing and data analytics
    How Data Analytics Supports Smarter Stock Trading Strategies
    4 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: book review “Data Preparation For Data Mining”
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > book review “Data Preparation For Data Mining”
Business IntelligenceData Mining

book review “Data Preparation For Data Mining”

TimManns
TimManns
2 Min Read
SHARE

Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical ba…


Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical basis.
– For example one technique I use to redistribute heavily skewed data is simple binning by count. I work in telecommunications and the behavioural data is always extremely skewed. Log functions don’t work so I often use SQL to convert variables into 100 percentile bins (where each bin has the same number of rows (customers) in it). That type of insight isn’t in the book, but several statistically based alternatives are. I’m not convinced they would work with extremely skewed data, but they are well explained and useful insights.
2) no mention of SQL or step-by-step examples of data manipulation (nothing like ‘before and ‘after’ pictures). Ideas or examples for derived variables are lacking too.

More Read

“Tech Savvy” Means “Customer Savvy” for Midsized Companies
The 2009 Rexer Data Mining Survey – A conversation with Karl Rexer
First Look – FICO Model Central
Electronic textiles (e-textiles) are fabrics that have…
DMR Poll: Privacy Issues when Merging CRM and Web Customer Data

So far I’ve read through the first 275 pages and the odd additional chapter. Its surprisingly easy to read and explains the statistics well. Its definately a book I will refer to, and well worth buying.

– Tim
Link to original post

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

online business using analytics
Why Some Businesses Seem to Win Online Without Ever Feeling Like They Are Trying
Exclusive News
edi compliance with AI
AI Is Transforming EDI Compliance Services
Exclusive News
companies using big data
5 Industries Driving Big Data Technology Growth
Big Data Exclusive
software developer using ai
California AI Companies That Are Set for Long-Term Growth
Development Exclusive

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

SPSS launches PASW 13

3 Min Read
ai in seo
Artificial IntelligenceExclusive

Can AI Help with Regional Nuances in International SEO?

8 Min Read
big data AI for Businesses
Artificial IntelligenceBig DataBusiness IntelligenceExclusive

Here’s How Your Small Business Can Thrive in a World of AI and Big Data

6 Min Read

NYT on Big Data and R

2 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai chatbot
The Art of Conversation: Enhancing Chatbots with Advanced AI Prompts
Chatbots
ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?