Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    New Data Analytics Breakthroughs Give eCommerce Startups a Fighting Chance
    New Data Analytics Breakthroughs Give eCommerce Startups a Fighting Chance
    6 Min Read
    How Data Analytics Is Reshaping Patient Financing Decisions
    How Data Analytics Is Reshaping Patient Financing Decisions
    13 Min Read
    business using business intelligence
    How to Use a Competitive Intelligence Dashboard to Turn Market Data Into Smarter Marketing Decisions 
    9 Min Read
    unusual trading activity
    Signal Or Noise? A Decision Tree For Evaluating Unusual Trading Activity
    3 Min Read
    software developer using ai
    How Data Analytics Helps Developers Deliver Better Tech Services
    8 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: How to Improve Predictive Accuracy? (Part 1)
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > How to Improve Predictive Accuracy? (Part 1)
Business IntelligenceData MiningPredictive Analytics

How to Improve Predictive Accuracy? (Part 1)

Editor SDC
Editor SDC
6 Min Read
SHARE
“Prediction is difficult, especially about the future.” – Yogi Berra (a baseball catcher)

One model or multiple models?

Several articles and blog posts have been written on Predictive Analytics and its role in improving business processes, reducing operational costs, increasing revenue among other things (see for example Eric Siegel’s article on Predictive Analytics with Data Mining: How It Works).  In spite of its widespread use and popularity, we often hear, “Past performance is no guarantee for future results…”   Obviously, a question arises naturally – How to improve predictive accuracy and hence make it more reliable?  This post discusses one such possible solution.

Contents
  • One model or multiple models?
  • Bootstrap Aggregating (Bagging)

To understand the logic behind the solution, consider the following scenario: Suppose your Business Intelligence software has developed a suitable regression model to forecast Sales Volume for the next quarter after following all the steps of the model building process scrupulously.  Further, suppose that the model is validated by employing one of the standard model validation procedures such as Cross-validation or bootstrap.  Now your model (“Expert”) is ready for deployment. …

“Prediction is difficult, especially about the future.” – Yogi Berra (a baseball catcher)

More Read

Management Excellence
R/Finance 2010 … and unicorns
Why Businesses Can No Longer Ignore Social Media Listening…
Why Telcos Can No Longer Rely on Traditional Machine Data Analytics to Deliver High Quality Service
Big Data Vs. Traditional Business Intelligence

One model or multiple models?






Several articles and blog posts have been written on Predictive Analytics and its role in improving business processes, reducing operational costs, increasing revenue among other things (see for example Eric Siegel’s article on Predictive Analytics with Data Mining: How It Works).  In spite of its widespread use and popularity, we often hear, “Past performance is no guarantee for future results…”   Obviously, a question arises naturally – How to improve predictive accuracy and hence make it more reliable?  This post discusses one such possible solution.

To understand the logic behind the solution, consider the following scenario: Suppose your Business Intelligence software has developed a suitable regression model to forecast Sales Volume for the next quarter after following all the steps of the model building process scrupulously.  Further, suppose that the model is validated by employing one of the standard model validation procedures such as Cross-validation or bootstrap.  Now your model (“Expert”) is ready for deployment.

Let us compare the above strategy with a real-life scenario:  When the Board of Directors has to take a critical decision, several experts are consulted instead of just one.  If that is the case, then when a critical futuristic revenue generation or cost cutting plan has to be launched, why should we not think of using multiple models to base our decision upon instead of just concentrating on one as planned above?   Precisely, we are going to do this here.

Bootstrap Aggregating (Bagging)






This technique was initially proposed by Breiman (1996) to improve the predictive reliability of Decision Trees.  Bagging and Boosting are the two strategies that are used to increase the predictive accuracy.  In this post we will discuss the Bagging technique.

Traditionally, a predictive model – say a regression model or a Decision Tree is developed using a given training set D.  In the Bagging method, D is split into some smaller sets of samples Di of the same size as that of D (i = 1, 2, 3….k; where, k = some suitable number).  These sets are selected by generating random samples with replacement, called Bootstrap sampling from the original set D.  Based on each bootstrap sample, a predictive model is developed.  With this, you will get an ensemble of k models as shown in the figure below:

If your goal is to predict the values say Sales Volume for the next quarter, then the Bagging rule is to use each model Mi to predict future sales and finally obtain the average predicted value.  If your goal is to build a classifier- say to identify a churner or a loan-defaulter, then using each of the k models, classify a customer as a churner or loan-defaulter and base your final decision on ‘majority vote’.
The bagged prediction or classifier often has more accuracy than a single model or classifier based on the data D.  This happens because the aggregation process reduces the instability or variability present in a single model.  The following case illustrates the advantage of Bagging:


Sales Forecast

Let us fit a regression model to predict Sales Volume based on the amount spent on Advertizing.  After fitting is done, apply Bagging tool to obtain the predictions.  The above bar chart displays the Sales forecast before Bagging (blue bars) and after Bagging (green bars) process along with their confidence levels.  As you can see, Sales forecasts after Bagging are far more reliable.

In my next post, I will be discussing about the Boosting technique to improve predictive accuracy.

TAGGED:business intelligencedata miningdecision treespredictive analytics
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

New Data Analytics Breakthroughs Give eCommerce Startups a Fighting Chance
New Data Analytics Breakthroughs Give eCommerce Startups a Fighting Chance
Analytics Big Data Exclusive
data driven businesses
How Data-Driven Businesses Choose Storage That Reduces Risk and Drag
Big Data Exclusive
Operational Data Becomes Business Value in the Age of AIoT
Operational Data Becomes Business Value in the Age of AIoT
Big Data Exclusive Internet of Things
growth guide
Growing Smarter: The Role Of Strategic Partnerships From Startup To Scale
Infographic News

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

business intelligence and data science for retail
Big DataBusiness IntelligenceBusiness RulesData ScienceExclusive

Trends In Business Intelligence And Data Science For Retail

9 Min Read
artificial intelligence can benefit the finance industry
Artificial Intelligence

How Artificial Intelligence Can Benefit The Finance Industry

6 Min Read

Gartner BI Summit 2012: The yin and yang of business and IT on the agenda

4 Min Read
business intelligence software for companies
AnalyticsBig DataBusiness IntelligenceCollaborative DataDecision ManagementExclusiveFeaturedKnowledge Management

4 Ways to Use Business Intelligence in Your Business

6 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
AI chatbots
AI Chatbots Can Help Retailers Convert Live Broadcast Viewers into Sales!
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?