Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    unusual trading activity
    Signal Or Noise? A Decision Tree For Evaluating Unusual Trading Activity
    3 Min Read
    software developer using ai
    How Data Analytics Helps Developers Deliver Better Tech Services
    8 Min Read
    ai for stock trading
    Can Data Analytics Help Investors Outperform Warren Buffett
    9 Min Read
    media monitoring
    Signals In The Noise: Using Media Monitoring To Manage Negative Publicity
    5 Min Read
    data analytics
    How Data Analytics Can Help You Construct A Financial Weather Map
    4 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: RulesFest 2011 – Kenny Shi: Scalability in a Real-Time Decision Platform
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Business Intelligence > Business Rules > RulesFest 2011 – Kenny Shi: Scalability in a Real-Time Decision Platform
Business Rules

RulesFest 2011 – Kenny Shi: Scalability in a Real-Time Decision Platform

CMatignon
CMatignon
5 Min Read
SHARE

My friend Kenny has decided to also talk about performance.  Given the magnitude of the traffic on eBay and the number of rules they need to activate, I can’t think of a best person to talk about scalability issues.  Disclaimer, Kenny will be my co-presenter next week at Business Rules Forum 😉

My friend Kenny has decided to also talk about performance.  Given the magnitude of the traffic on eBay and the number of rules they need to activate, I can’t think of a best person to talk about scalability issues.  Disclaimer, Kenny will be my co-presenter next week at Business Rules Forum 😉

Kenny ShiSo many ways to fraud…  We know quite a bit about credit card and debit card fraud, and a little about insurance fraud, from our FICO days.  Fraud for online e-commerce is quite a bit more diverse.  Those fraudsters are incredibly creative when it comes to finding weaknesses in the system.  Kenny’s examples of fraud always impressed me as I do not believe I could have come up with nearly half of those scams!

The fraud detection architecture applies rules and models as a service, invoked either synchronously (SOAP interface to the application) or asynchronously (via a message bus).  The presentation focuses on making faster and more scalable real-time decisions.

From a developer / architect perspective, it would be more comfortable to preocess the cases offline of course but the nature of the business requires more timely detection – both in terms of revenue protection and customer satisfaction.

Fraud detection is the result of expert rules and data-driven analytics.  Neural nets can detect trends that are buried in past transactions.  Rules address flash fraud detection, e.g. patterns identified by fraud experts before models have had a chance to learn about those.  As expected both of them require a ton of data to make good decisions, including some sophisticated computations.  Data retrieval and normalization takes most of the transaction time.  In the end, neural net processing represents only 2% of the time, and rules 3%.

Logically, many performance efforts by Kenny’s team have been focused on data.  Data access is one area that Kenny covered.  More interestingly, he analyzed other related avenues that relate more directly to decisioning “optimizations”:

  • Unnecessary variables
  • Excessive logic
  • Conflicting logic

Independently, optimization techniques:

  • Pre-fetch — although it goes against the very flexible and dynamic nature of rules
  • Lazy load: only when needed, with smart ordering of conditions from most discriminating / cheaper to least — not always easy to estimate though
  • Pre-aggregation of data — freshness can be addressed by aggregating up to “yesterday” for example and then look at real-time data for the current day only

Distributed locality could be addressed with distributed hash tables and other architecture tactics that Kenny quickly went through.

Horizontal scability with server clusters, etc.  Database partition can help, based on data access patterns — per seller, per IP address.

Scalability for Rule Authoring

Sharing variables implies srong capabilities to catalog and describe variables so that all modelers and rules writers can safely and conveniently reuse those characteristics.

Great plug for Sparkling Logic — Thanks!  But seriously, Kenny really appreciated the fluid metaphors we came up with, which help address that very set of problems.

Scalability for Rule Deployment

The continuous integration of rules allows to detect problems earlier and more frequently.  As an added bonus the deployment is ready when it is time to deploy.

Statistical testing is similar to Champion Challenger in the sense that it allows the experts to compare the Production rules with the new Testing rules.  Kenny went a little fast

Other techniques like smoketests and incremental deployment allow to test without impacting the entire workload.

Dog-food approach: Kenny also uses rules to monitor rules.  Smart.

As you can imagine, constant monitoring allows to ensure that they are on-target with their performance objectives.

Closing remark: Fraudsters are very creative so catching them is like a game of Catcha Mouse!

 

Wow…  What a fast speaker!

Check out the abstract here.

 

Powered By WizardRSS.com | Full Text RSS Feed | Amazon Plugin | Settlement Statement

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

fda14abd c869 4da5 943c c036ad8efc2e
How Data-Driven Journalists Are Using API News Apps to Improve Reporting
Big Data Exclusive News
0622cae5 f7d7 4f74 84b5 eabd1a823dca
How Data-Driven Grocery Recommendations Help Shoppers Eat Better With Less Effort
Big Data Exclusive
business recovering from data loss
How Data-Driven Businesses Protect MySQL Databases from Shutdown
Big Data Exclusive
ai driven task management
Reducing “Work About Work” with AI Task Managers
Artificial Intelligence Exclusive

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

Big Data
Best PracticesBig DataBusiness Rules

Is Complexity Strangling the Real-World Benefits of Big Data for SMB’s?

6 Min Read
Market Trends
AnalyticsBig DataBusiness IntelligenceBusiness RulesData QualityPredictive AnalyticsWeb Analytics

In a World Full of Data, Can Analytics See the Market Trends?

4 Min Read
china zip code business intelligence
Business IntelligenceBusiness RulesExclusive

How China’s Zipcode System Fuels Business Intelligence

7 Min Read

New White Paper: Decisions at the Heart of Your Processes

2 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
giveaway chatbots
How To Get An Award Winning Giveaway Bot
Big Data Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?