Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    image fx (67)
    Improving LinkedIn Ad Strategies with Data Analytics
    9 Min Read
    big data and remote work
    Data Helps Speech-Language Pathologists Deliver Better Results
    6 Min Read
    data driven insights
    How Data-Driven Insights Are Addressing Gaps in Patient Communication and Equity
    8 Min Read
    pexels pavel danilyuk 8112119
    Data Analytics Is Revolutionizing Medical Credentialing
    8 Min Read
    data and seo
    Maximize SEO Success with Powerful Data Analytics Insights
    8 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: How to Score 300,000,000 Customer Records for $3
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Analytics > Predictive Analytics > How to Score 300,000,000 Customer Records for $3
Predictive Analytics

How to Score 300,000,000 Customer Records for $3

MichaelZeller
MichaelZeller
5 Min Read
SHARE
Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree …

More Read

Text Mining and Regular Expressions
A conversation with Jay Kreps about Project Voldemort
R, the FDA, and clinical trials
First Look: FICO Decision Optimizer
The Stakeholders

Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree algorithms which were created in several statistical tools and then exported in the Predictive Model Markup Language (PMML) standard. The PMML models subsequently were deployed and executed in the ADAPA Predictive Analytics Edition on Amazon EC2.

The fastest instance (Amazon type High-CPU XL), ADAPA scored on average over 300 million records in one hour. One hour of the High-CPU XL instance costs US$2.49 (two dollars and forty nine cents), plus a few cents for the data transfer; all in all, it adds up to less than $3 for the task.

In addition to raw processing performance for scoring data, note that ADAPA remarkable accelerates the speed of deployment and integration for predictive analytics. While it is possible to scale processing speed with additional hardware, deployment and integration are the real bottlenecks for projects. Only a framework that leverages open standards for interoperability provides the necessary agility required for proper management and deployment of predictive models.

With cloud computing and Software as a Service (SaaS), ADAPA delivers an unprecedented cost/performance ratio for implementing predictive analytics across the enterprise. Sign up for ADAPA on Amazon EC2 instantly and start using it in just a few minutes! Starting at $1 per hour for a small instance and no long-term commitment required, experience for yourself what ADAPA does for your predictive models without breaking the bank. Use your own models or try ADAPA with our PMML model examples.

Comprehensive blog featuring topics related to predictive analytics with an emphasis on open standards, Predictive Model Markup Language (PMML), cloud computing, as well as the deployment and integration of predictive models in any business process.

Link to original post

TAGGED:adapaamazon ec2cloudpmml
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

image fx (2)
Monitoring Data Without Turning into Big Brother
Big Data Exclusive
image fx (71)
The Power of AI for Personalization in Email
Artificial Intelligence Exclusive Marketing
image fx (67)
Improving LinkedIn Ad Strategies with Data Analytics
Analytics Big Data Exclusive Software
big data and remote work
Data Helps Speech-Language Pathologists Deliver Better Results
Analytics Big Data Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

Scorecards in PMML: A Primer

8 Min Read
Cloud Computing
Cloud ComputingIT

5 Reasons You Should Be Using Cloud Computing in 2018

5 Min Read

Big Data Is More Valuable with Kapow

6 Min Read

Componentizing Software

8 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI chatbots
AI Chatbots Can Help Retailers Convert Live Broadcast Viewers into Sales!
Chatbots
ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?