Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    warehouse accidents
    Data Analytics and the Future of Warehouse Safety
    10 Min Read
    stock investing and data analytics
    How Data Analytics Supports Smarter Stock Trading Strategies
    4 Min Read
    predictive analytics risk management
    How Predictive Analytics Is Redefining Risk Management Across Industries
    7 Min Read
    data analytics and gold trading
    Data Analytics and the New Era of Gold Trading
    9 Min Read
    composable analytics
    How Composable Analytics Unlocks Modular Agility for Data Teams
    9 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: book review “Data Preparation For Data Mining”
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > book review “Data Preparation For Data Mining”
Business IntelligenceData Mining

book review “Data Preparation For Data Mining”

TimManns
TimManns
2 Min Read
SHARE

Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical ba…


Just before Christmas I bought myself yet another data mining book (i have a few dozen). This one somehow slipped by me for 10 years but I’m glad I finally stumbled upon it. Originally published in 1999, Dorian Pyle wrote “Data Preparation For Data Mining” before Data Mining was less wide spread and ‘Predictive Analytics’ wasn’t the buzz word it is today.

The only few criticisms I could possibly raise are;
1) that everything has a statistical basis.
– For example one technique I use to redistribute heavily skewed data is simple binning by count. I work in telecommunications and the behavioural data is always extremely skewed. Log functions don’t work so I often use SQL to convert variables into 100 percentile bins (where each bin has the same number of rows (customers) in it). That type of insight isn’t in the book, but several statistically based alternatives are. I’m not convinced they would work with extremely skewed data, but they are well explained and useful insights.
2) no mention of SQL or step-by-step examples of data manipulation (nothing like ‘before and ‘after’ pictures). Ideas or examples for derived variables are lacking too.

More Read

Other characteristics of decision-centric organizations
Dataset too big for R ?
Do You Have Any Rights in the Age of Big Data Analytics?
Benefits Of Using Reporting and Analytics Tools in Small Businesses
Big Data and the Demise of Analog Retail

So far I’ve read through the first 275 pages and the odd additional chapter. Its surprisingly easy to read and explains the statistics well. Its definately a book I will refer to, and well worth buying.

– Tim
Link to original post

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

ai and satelite technology
How Machine Learning Improves Satellite Object Tracking
Exclusive Machine Learning
Diverse Research Datasets
The 5 Best Platforms Offering the Most Diverse Research Datasets in 2026
Big Data Exclusive
macro intelligence and ai
How Permutable AI is Advancing Macro Intelligence for Complex Global Markets
Artificial Intelligence Exclusive
warehouse accidents
Data Analytics and the Future of Warehouse Safety
Analytics Commentary Exclusive

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

forecasting stock market
AnalyticsBig DataBusiness IntelligenceDecision ManagementPredictive Analytics

Forecasting the Stock Market: Lessons Learned

5 Min Read
AI chatbots
Artificial IntelligenceExclusive

How AI Chatbots Are Revolutionizing IT Operations and Customer Service

6 Min Read

Tough New Fines in Ireland for Senders of Unsolicited Mail

3 Min Read
business intelligence software
AnalyticsBest PracticesBusiness IntelligenceSoftware

Analytical Maturity Is Not Universal

4 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence
ai is improving the safety of cars
From Bolts to Bots: How AI Is Fortifying the Automotive Industry
Artificial Intelligence

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?