Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    How Data Analytics Is Reshaping Patient Financing Decisions
    How Data Analytics Is Reshaping Patient Financing Decisions
    13 Min Read
    business using business intelligence
    How to Use a Competitive Intelligence Dashboard to Turn Market Data Into Smarter Marketing Decisions 
    9 Min Read
    unusual trading activity
    Signal Or Noise? A Decision Tree For Evaluating Unusual Trading Activity
    3 Min Read
    software developer using ai
    How Data Analytics Helps Developers Deliver Better Tech Services
    8 Min Read
    ai for stock trading
    Can Data Analytics Help Investors Outperform Warren Buffett
    9 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: HadoopDB discussion with Daniel Abadi
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Warehousing > HadoopDB discussion with Daniel Abadi
Data Warehousing

HadoopDB discussion with Daniel Abadi

TonyBain
TonyBain
4 Min Read
SHARE


I spoke to Daniel Abadi a few days ago about his HadoopDB announcement that came out recently. I am sure this has been a busy time for Daniel and his team over in Yale as HadoopDB has been getting a lot of interest which I am sure will continue to build.

Some notes from our discussion:

  • HadoopDB is primarily focused on high scalability and the required availability at scale. Daniel questions current MPP’s ability to truly scale past 100 nodes whereas Hadoop has real examples on 3000+ nodes.
  • HadoopDB like many MPP analytical database platforms uses shared nothing relational database as processing units. HadoopDB uses Postgres. Unlike other MPP databases, HadoopDB uses Hadoop as the distributed mechanism.
  • I am ad libbing here, but I understand that Daniel doesn’t dispute DeWitt & Stonebrakers (and his) paper which claims Map/Reduce underperforms when compared to current MPP DBMS. HadoopDB, however, is focused on massive scale, hundreds or thousands of nodes.  Currently the largest MPP database we know of is 96 nodes.
  • Early benchmarking shows HadoopDB outperforms Hadoop but is slower than current MPP databases under normal circumstances. However, when …

I spoke to Daniel Abadi a few days ago about his HadoopDB announcement that came out recently. I am sure this has been a busy time for Daniel and his team over in Yale as HadoopDB has been getting a lot of interest which I am sure will continue to build.

Some notes from our discussion:

  • HadoopDB is primarily focused on high scalability and the required availability at scale. Daniel questions current MPP’s ability to truly scale past 100 nodes whereas Hadoop has real examples on 3000+ nodes.
  • HadoopDB like many MPP analytical database platforms uses shared nothing relational database as processing units. HadoopDB uses Postgres. Unlike other MPP databases, HadoopDB uses Hadoop as the distributed mechanism.
  • I am ad libbing here, but I understand that Daniel doesn’t dispute DeWitt & Stonebrakers (and his) paper which claims Map/Reduce underperforms when compared to current MPP DBMS. HadoopDB, however, is focused on massive scale, hundreds or thousands of nodes.  Currently the largest MPP database we know of is 96 nodes.
  • Early benchmarking shows HadoopDB outperforms Hadoop but is slower than current MPP databases under normal circumstances. However, when simulating node failure mid query HadoopDB outperformed current MPP databases significantly.
  • The higher the scalability the higher the possibility of node failure mid query. Very large Hadoop deployments may experience at least 1 node failure per query (job).
  • HadoopDB is usable today, but should not be considered an “out of the box” solution. HadoopDB is an outcome from a database research initiative, not a commercial venture.  Anyone planning to use HapoopDB will require the appropriate systems & development skills to effectively deploy.

HadoopDB is an innovative approach to the scalability challenges that continue to push the architecture of the modern database forward.

Related articles by Zemanta
  • Researchers Create Database-Hadoop Hybrid (tech.slashdot.org)
  • Yale researchers create database-Hadoop hybrid (computerworld.com)


Link to original post

TAGGED:hadoop
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

How Data Analytics Is Reshaping Patient Financing Decisions
How Data Analytics Is Reshaping Patient Financing Decisions
Analytics Big Data Exclusive
AI driven big data company
How AI-Driven Workflows Are Changing the Way Companies Think About Data Risk
Artificial Intelligence Data Management Exclusive Risk Management
ai product development
Why Businesses Outsource AI Product Development Companies
Exclusive News
banking tools
The Fintech and Banking Tools Global Entrepreneurs Rely On
Fintech Infographic

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

hadoop big data
Big Data

Demystifying Hadoop: Not All Problems Are Hadoop-able

6 Min Read
Hadoop Tools SAP Hana
Big DataComputingHadoopNewsSoftware

How Hadoop Tools Shape SAP Hana’s Big Data Platform

5 Min Read

Are Public Clouds Complex Environments?

4 Min Read

Lots of Data Does Not Equal “Big Data”

7 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence
ai chatbot
The Art of Conversation: Enhancing Chatbots with Advanced AI Prompts
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?