By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData Collective
  • Analytics
    AnalyticsShow More
    predictive analytics in dropshipping
    Predictive Analytics Helps New Dropshipping Businesses Thrive
    12 Min Read
    data-driven approach in healthcare
    The Importance of Data-Driven Approaches to Improving Healthcare in Rural Areas
    6 Min Read
    analytics for tax compliance
    Analytics Changes the Calculus of Business Tax Compliance
    8 Min Read
    big data analytics in gaming
    The Role of Big Data Analytics in Gaming
    10 Min Read
    analyst,women,looking,at,kpi,data,on,computer,screen
    Promising Benefits of Predictive Analytics in Asset Management
    11 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-23 SmartData Collective. All Rights Reserved.
Reading: More Ways to get a Scoring Model wrong
Share
Notification Show More
Latest News
ai digital marketing tools
Top Five AI-Driven Digital Marketing Tools in 2023
Artificial Intelligence
ai-generated content
Is AI-Generated Content a Net Positive for Businesses?
Artificial Intelligence
predictive analytics in dropshipping
Predictive Analytics Helps New Dropshipping Businesses Thrive
Predictive Analytics
cloud data security in 2023
Top Tools for Your Cloud Data Security Stack in 2023
Cloud Computing
become a data scientist
Boosting Your Chances for Landing a Job as a Data Scientist
Jobs
Aa
SmartData Collective
Aa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > More Ways to get a Scoring Model wrong
Data Mining

More Ways to get a Scoring Model wrong

Editor SDC
Last updated: 2009/03/11 at 10:19 AM
Editor SDC
5 Min Read
SHARE

I got the following answer from Linkedin groups**

 

on my Ten Ways to get a Scoring Model Wrong.

  1.  Typo 
  2. Refuse to use central tendency to patch missing values. Instead, assign highest response rate because WOE says so 
  3. Marketing people tell me to force the variable into the model 
  4.  Selection bias 
  5.  Forgot to segment 
  6. Solely rely on data to segment without consulting the biz side 
  7.  Just delete observations with missing values, OK, without studying geometricl boundaries 
  8.  Using oversampling, but refuse to weight it back. That boosts lift, right? Let us do 50-50 
  9. Insist random sampling is sufficient, while stratified sampling is critical 
  10. Binning too much, or two little 
  11. Selecting variables without repeated sampling 
  12. Forgot to exclude numeric customer id from the candidate variables. AND,it pops….Well, both Unica and Kxen accepted it, So I see no problem 
  13. When the same variable is sourced by different vendors, did not look up the scales under the same name. Just combine them 
  14.  Well, SAS Enterprise Miner gave me this mode…

More Read

Interview: Françoise Soulie Fogelman, KXEN

Physicists, models, and the credit crisis
Ten ways to build a wrong scoring model

I got the following answer from Linkedin groups**

 

on my Ten Ways to get a Scoring Model Wrong.

  1.  Typo 
  2. Refuse to use central tendency to patch missing values. Instead, assign highest response rate because WOE says so 
  3. Marketing people tell me to force the variable into the model 
  4.  Selection bias 
  5.  Forgot to segment 
  6. Solely rely on data to segment without consulting the biz side 
  7.  Just delete observations with missing values, OK, without studying geometricl boundaries 
  8.  Using oversampling, but refuse to weight it back. That boosts lift, right? Let us do 50-50 
  9. Insist random sampling is sufficient, while stratified sampling is critical 
  10. Binning too much, or two little 
  11. Selecting variables without repeated sampling 
  12. Forgot to exclude numeric customer id from the candidate variables. AND,it pops….Well, both Unica and Kxen accepted it, So I see no problem 
  13. When the same variable is sourced by different vendors, did not look up the scales under the same name. Just combine them 
  14.  Well, SAS Enterprise Miner gave me this model yesterday 
  15. The binary variable is statistically significant, but there are only 27 event=1, out of ~1mm, since only 27 made some purchases.. 
  16. Well, I only have 250 events=1. But I think I can use exact logistic to make it up, all right? I got a PHD in Statistics, Trust me, my professor is OK with it. I just called her. 
  17.  Build two-stage model without Heckman adjustment 
  18. Use global mean over the WHOLE customer base to replace missing value on a much smaller universe/subset. So average networth of a high networth client group has 22% worth only 225K 
  19. I just spent the past two days boosting R-square. Now it is 92. Great. 
  20. Forgot to set descending option in proc logistic in SAS 
  21. I think we should hold out missing values when conducting EDA. 
  22. Without proper separation of ‘treatment and control 
  23. Treat business entities and individuals as equal and mix them in the same universe
  24. Runing clustering without validation 
  25. Running discriminant model without validation. So correct classification rate on development is 89% and that over validation is …35%.(no wonder you finished it in two hours and came here to ask me for a raise) 
  26. Disregard link function in multi-nomil models 
  27. I think this is a better variable: xnew=y*y*y*. It is the top variable dominating others. 
  28. Use standardized coefficient to calculate relative importance, because many people are doing and marketing loves it. 
  29. I tried Goolge Analtyics last Friday. It recommends this variable: click stream density over Thanksgivning weekend, on my web portal, on this item 
  30.  Let us treat this matrix as unary so we can apply Euclidean, since that runs faster and has a lot of optimal properties. It makes our life easier 
  31. Let us use score from that model to boost this model and use score from this model to boost it back. Is that what they call neural nets, Jia? 

Enough?

 

31 Ways to get a model wrong – and Hats off to a fellow mate in suffering -Jia**

 http://www.linkedin.com/groupAnswers?viewQuestionAndAnswers=&gid=53432&discussionID=1946379&commentID=2213879&goback=.mgr_false_0_DATE.mgr_true_1_DATE.mid_1066685320#commentID_2213879

Coming up – One Way to get a scoring model correct

Share/Save/Bookmark

TAGGED: modelling, scoring models
Editor SDC March 11, 2009
Share this Article
Facebook Twitter Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

ai digital marketing tools
Top Five AI-Driven Digital Marketing Tools in 2023
Artificial Intelligence
ai-generated content
Is AI-Generated Content a Net Positive for Businesses?
Artificial Intelligence
predictive analytics in dropshipping
Predictive Analytics Helps New Dropshipping Businesses Thrive
Predictive Analytics
cloud data security in 2023
Top Tools for Your Cloud Data Security Stack in 2023
Cloud Computing

Stay Connected

1.2k Followers Like
33.7k Followers Follow
222 Followers Pin

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

[mc4wp_form id=”1616″]

You Might also Like

Interview: Françoise Soulie Fogelman, KXEN

11 Min Read

Physicists, models, and the credit crisis

3 Min Read

Ten ways to build a wrong scoring model

3 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
AI chatbots
AI Chatbots Can Help Retailers Convert Live Broadcast Viewers into Sales!
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US

© 2008-23 SmartData Collective. All Rights Reserved.

Removed from reading list

Undo
Go to mobile version
Welcome Back!

Sign in to your account

Lost your password?