Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    sales and data analytics
    How Data Analytics Improves Lead Management and Sales Results
    9 Min Read
    data analytics and truck accident claims
    How Data Analytics Reduces Truck Accidents and Speeds Up Claims
    7 Min Read
    predictive analytics for interior designers
    Interior Designers Boost Profits with Predictive Analytics
    8 Min Read
    image fx (67)
    Improving LinkedIn Ad Strategies with Data Analytics
    9 Min Read
    big data and remote work
    Data Helps Speech-Language Pathologists Deliver Better Results
    6 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: How to Score 300,000,000 Customer Records for $3
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Analytics > Predictive Analytics > How to Score 300,000,000 Customer Records for $3
Predictive Analytics

How to Score 300,000,000 Customer Records for $3

MichaelZeller
MichaelZeller
5 Min Read
SHARE
Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree …

More Read

Is a tweet worth a drink?
Data Mining and Predictive Analytics Contest Has a $3 Million Prize
Why Predictive Analytics is Important and More
Metrics and Tools for Social Media Analysis
Pachube is a service that enables you to connect, tag and share…

Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree algorithms which were created in several statistical tools and then exported in the Predictive Model Markup Language (PMML) standard. The PMML models subsequently were deployed and executed in the ADAPA Predictive Analytics Edition on Amazon EC2.

The fastest instance (Amazon type High-CPU XL), ADAPA scored on average over 300 million records in one hour. One hour of the High-CPU XL instance costs US$2.49 (two dollars and forty nine cents), plus a few cents for the data transfer; all in all, it adds up to less than $3 for the task.

In addition to raw processing performance for scoring data, note that ADAPA remarkable accelerates the speed of deployment and integration for predictive analytics. While it is possible to scale processing speed with additional hardware, deployment and integration are the real bottlenecks for projects. Only a framework that leverages open standards for interoperability provides the necessary agility required for proper management and deployment of predictive models.

With cloud computing and Software as a Service (SaaS), ADAPA delivers an unprecedented cost/performance ratio for implementing predictive analytics across the enterprise. Sign up for ADAPA on Amazon EC2 instantly and start using it in just a few minutes! Starting at $1 per hour for a small instance and no long-term commitment required, experience for yourself what ADAPA does for your predictive models without breaking the bank. Use your own models or try ADAPA with our PMML model examples.

Comprehensive blog featuring topics related to predictive analytics with an emphasis on open standards, Predictive Model Markup Language (PMML), cloud computing, as well as the deployment and integration of predictive models in any business process.

Link to original post

TAGGED:adapaamazon ec2cloudpmml
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

sales and data analytics
How Data Analytics Improves Lead Management and Sales Results
Analytics Big Data Exclusive
ai in marketing
How AI and Smart Platforms Improve Email Marketing
Artificial Intelligence Exclusive Marketing
AI Document Verification for Legal Firms: Importance & Top Tools
AI Document Verification for Legal Firms: Importance & Top Tools
Artificial Intelligence Exclusive
AI supply chain
AI Tools Are Strengthening Global Supply Chains
Artificial Intelligence Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

Is Cloud Computing Hurtling Towards Disaster?

4 Min Read

Big Data Without Integration Is Broken

7 Min Read
private cloud for business data
Data Management

Building a Private Cloud: A Strategic Guide

5 Min Read
supply chain logistics and cloud
Cloud Computing

How The Cloud Can Be Useful In Supply Chain Logistics

8 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence
AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?