Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    media monitoring
    Signals In The Noise: Using Media Monitoring To Manage Negative Publicity
    5 Min Read
    data analytics
    How Data Analytics Can Help You Construct A Financial Weather Map
    4 Min Read
    financial analytics
    Financial Analytics Shows The Hidden Cost Of Not Switching Systems
    4 Min Read
    warehouse accidents
    Data Analytics and the Future of Warehouse Safety
    10 Min Read
    stock investing and data analytics
    How Data Analytics Supports Smarter Stock Trading Strategies
    4 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: How to Score 300,000,000 Customer Records for $3
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Analytics > Predictive Analytics > How to Score 300,000,000 Customer Records for $3
Predictive Analytics

How to Score 300,000,000 Customer Records for $3

MichaelZeller
MichaelZeller
5 Min Read
SHARE
Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree …

More Read

Prescriptive Analytics – A Step Beyond Predictive Analytics
Can the Future of Mobile Be Found in Social? CI & CNBC Use Social Media Analytics to Find Out
Federated Clouds
Is There One “Right” Strategy to Implement Business Intelligence?
Is the Relational Database Doomed?

Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree algorithms which were created in several statistical tools and then exported in the Predictive Model Markup Language (PMML) standard. The PMML models subsequently were deployed and executed in the ADAPA Predictive Analytics Edition on Amazon EC2.

The fastest instance (Amazon type High-CPU XL), ADAPA scored on average over 300 million records in one hour. One hour of the High-CPU XL instance costs US$2.49 (two dollars and forty nine cents), plus a few cents for the data transfer; all in all, it adds up to less than $3 for the task.

In addition to raw processing performance for scoring data, note that ADAPA remarkable accelerates the speed of deployment and integration for predictive analytics. While it is possible to scale processing speed with additional hardware, deployment and integration are the real bottlenecks for projects. Only a framework that leverages open standards for interoperability provides the necessary agility required for proper management and deployment of predictive models.

With cloud computing and Software as a Service (SaaS), ADAPA delivers an unprecedented cost/performance ratio for implementing predictive analytics across the enterprise. Sign up for ADAPA on Amazon EC2 instantly and start using it in just a few minutes! Starting at $1 per hour for a small instance and no long-term commitment required, experience for yourself what ADAPA does for your predictive models without breaking the bank. Use your own models or try ADAPA with our PMML model examples.

Comprehensive blog featuring topics related to predictive analytics with an emphasis on open standards, Predictive Model Markup Language (PMML), cloud computing, as well as the deployment and integration of predictive models in any business process.

Link to original post

TAGGED:adapaamazon ec2cloudpmml
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

data science professor
The Power of Warm-Ups: Setting the Stage for Learning
Exclusive News
cloud dataops for metering
Taming the IoT Firehose: How Utilities Are Scaling Cloud DataOps for Smart Metering
Cloud Computing Exclusive Internet of Things IT
ai in video game development
Machine Learning Is Changing iGaming Software Development
Exclusive Machine Learning News
media monitoring
Signals In The Noise: Using Media Monitoring To Manage Negative Publicity
Analytics Exclusive Infographic

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

public cloud computing
Cloud Computing

Moving to the Public Cloud? Do the Math First

4 Min Read

Predictive Analytics in the Cloud Research on SmartData Collective

2 Min Read

Open Source Analytics Reaches Main Street (and Some Other Trends in Analytics)

8 Min Read

When Crisis Hits, Technology is Dumped, and Lizard Brains Take Over

4 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai is improving the safety of cars
From Bolts to Bots: How AI Is Fortifying the Automotive Industry
Artificial Intelligence
AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?