Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    data driven insights
    How Data-Driven Insights Are Addressing Gaps in Patient Communication and Equity
    8 Min Read
    pexels pavel danilyuk 8112119
    Data Analytics Is Revolutionizing Medical Credentialing
    8 Min Read
    data and seo
    Maximize SEO Success with Powerful Data Analytics Insights
    8 Min Read
    data analytics for trademark registration
    Optimizing Trademark Registration with Data Analytics
    6 Min Read
    data analytics for finding zip codes
    Unlocking Zip Code Insights with Data Analytics
    6 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: How to Score 300,000,000 Customer Records for $3
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Analytics > Predictive Analytics > How to Score 300,000,000 Customer Records for $3
Predictive Analytics

How to Score 300,000,000 Customer Records for $3

MichaelZeller
MichaelZeller
5 Min Read
SHARE
Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree …

More Read

A video introduction to R for Excel users
A Shortcut Guide to Machine Learning and AI in The Enterprise
Advanced analytics, particularly predictive and statistical…
You Say Tomato, I Say Protocol
Integrating Predictive Analytics and BRM to Improve Health Plan Member Experience

Cloud computing promises lower cost and higher scalability. Translating this to a real-world practical application for predictive analytics, here is what this means for you in simple facts and numbers. With the Zementis ADAPA scoring engine on the Amazon Elastic Compute Cloud, you can score over 300 million (!) records for about $3, all in less than one hour.

Performance and scalability have been key design principles for ADAPA, in addition to open standards and Service Oriented Architecture (SOA). To illustrate this in a real-world benchmark, we measured the batch scoring performance for different Amazon EC2 instance types. Because computational efforts vary across different model types, we report only the average numbers measured for a collection of ten (10) different predictive models, each based on processing a data file containing 10 million records.

Figure: Average number of records processed per hour for each Amazon EC2 instance type. The average is based on 10 different PMML models, with the fastest instance scoring over 300 million records per hour.

We used ten different predictive models, including various regression models, neural network, clustering and decision tree algorithms which were created in several statistical tools and then exported in the Predictive Model Markup Language (PMML) standard. The PMML models subsequently were deployed and executed in the ADAPA Predictive Analytics Edition on Amazon EC2.

The fastest instance (Amazon type High-CPU XL), ADAPA scored on average over 300 million records in one hour. One hour of the High-CPU XL instance costs US$2.49 (two dollars and forty nine cents), plus a few cents for the data transfer; all in all, it adds up to less than $3 for the task.

In addition to raw processing performance for scoring data, note that ADAPA remarkable accelerates the speed of deployment and integration for predictive analytics. While it is possible to scale processing speed with additional hardware, deployment and integration are the real bottlenecks for projects. Only a framework that leverages open standards for interoperability provides the necessary agility required for proper management and deployment of predictive models.

With cloud computing and Software as a Service (SaaS), ADAPA delivers an unprecedented cost/performance ratio for implementing predictive analytics across the enterprise. Sign up for ADAPA on Amazon EC2 instantly and start using it in just a few minutes! Starting at $1 per hour for a small instance and no long-term commitment required, experience for yourself what ADAPA does for your predictive models without breaking the bank. Use your own models or try ADAPA with our PMML model examples.

Comprehensive blog featuring topics related to predictive analytics with an emphasis on open standards, Predictive Model Markup Language (PMML), cloud computing, as well as the deployment and integration of predictive models in any business process.

Link to original post

TAGGED:adapaamazon ec2cloudpmml
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

langgraph and genai
LangGraph Orchestrator Agents: Streamlining AI Workflow Automation
Artificial Intelligence Exclusive
ai fitness app
Will AI Replace Personal Trainers? A Data-Driven Look at the Future of Fitness Careers
Artificial Intelligence Big Data Exclusive
crypto marketing
How a Crypto Marketing Agency Can Use AI to Create Powerful Native Advertising Strategies
Blockchain Exclusive Marketing
data driven insights
How Data-Driven Insights Are Addressing Gaps in Patient Communication and Equity
Analytics Big Data Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

The latest ACM SIGKDD Explorations Newsletter is out. Focus on open source analytics and PMML

3 Min Read

Big Data Without Integration Is Broken

7 Min Read
big data trends 2013
AnalyticsBig Data

Four Off-the-Beaten-Path Big Data Predictions for 2013

8 Min Read

Scorecards in PMML: A Primer

8 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
ai chatbot
The Art of Conversation: Enhancing Chatbots with Advanced AI Prompts
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?