By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData Collective
  • Analytics
    AnalyticsShow More
    data science anayst
    Growing Demand for Data Science & Data Analyst Roles
    6 Min Read
    predictive analytics in dropshipping
    Predictive Analytics Helps New Dropshipping Businesses Thrive
    12 Min Read
    data-driven approach in healthcare
    The Importance of Data-Driven Approaches to Improving Healthcare in Rural Areas
    6 Min Read
    analytics for tax compliance
    Analytics Changes the Calculus of Business Tax Compliance
    8 Min Read
    big data analytics in gaming
    The Role of Big Data Analytics in Gaming
    10 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-23 SmartData Collective. All Rights Reserved.
Reading: Hadoop: A Storage Platform as Well as Analysis Tool?
Share
Notification Show More
Latest News
ai in automotive industry
AI Is Changing the Automotive Industry Forever
Artificial Intelligence
SMEs Use AI-Driven Financial Software for Greater Efficiency
Artificial Intelligence
data security in big data age
6 Reasons to Boost Data Security Plan in the Age of Big Data
Big Data
data science anayst
Growing Demand for Data Science & Data Analyst Roles
Data Science
ai software development
Key Strategies to Develop AI Software Cost-Effectively
Artificial Intelligence
Aa
SmartData Collective
Aa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Software > Hadoop > Hadoop: A Storage Platform as Well as Analysis Tool?
Big DataData ManagementHadoopITMapReduce

Hadoop: A Storage Platform as Well as Analysis Tool?

MicheleNemschoff
Last updated: 2013/12/11 at 5:17 PM
MicheleNemschoff
4 Min Read
Image
SHARE

ImageData warehouses are a critical component for enterprises seeking to gain insights from the data they collect, but as the volume of data businesses collect continues to grow, the traditional data warehouse is increasingly becoming too expensive to maintain.

ImageData warehouses are a critical component for enterprises seeking to gain insights from the data they collect, but as the volume of data businesses collect continues to grow, the traditional data warehouse is increasingly becoming too expensive to maintain. On top of this, the majority of data being created today is unstructured data, which a traditional database is unable to collect and store unless the data is converted into a structured form. Due to this, many businesses have turned to Apache Hadoop as a long-term storage and ETL tool. While many articles have been written acknowledging Hadoop’s value as an analysis platform for Big Data, it is also worthy of consideration as a storage platform, i.e. a data hub. Here’s why.

A More Affordable Option

As mentioned above, storing large amounts of data in a traditional database becomes increasingly too expensive. The average data warehouse requires $50k- $100k per TB of data. With data from tweets, emails, and Facebook posts as well as Machine-generated log files, sensor and clickstream data pouring in at an exponential rate, the cost of transforming and storing this data is huge, not to mention the cost of expanding the warehouse’s hardware to increase capacity.

More Read

data security in big data age

6 Reasons to Boost Data Security Plan in the Age of Big Data

Growing Demand for Data Science & Data Analyst Roles
How Big Data Is Transforming the Maritime Industry
Top Tools for Your Cloud Data Security Stack in 2023
Boosting Your Chances for Landing a Job as a Data Scientist

To reduce storage costs, many companies store only samples of their raw data based on pre-determined assumptions or priorities. This means that as priorities change or new business questions come up the raw data is no longer available to analyze, leaving room for costly mistakes and missed opportunities that the data could make visible.

Hadoop, on the other hand, stores data at less than $1K per TB, a cost savings of 50x-100x. In addition, Hadoop’s scalability keeps the size of the data storage system in check, further reducing the cost of data storage.

Data Pre-Processing

In discussion about Hadoop, some have started to refer to Hadoop as the big data hub. In other words, Hadoop is seen as a common repository for all types of data: structured, semi-structured and unstructured. Hadoop is used as a landing zone of sorts to collect all of the data being produced before it is sorted into the traditional database or left as is for long-term storage. This is incredibly valuable as there is a wealth of business insights within multi-structured data that Hadoop now allows companies to extract.

Flexibility 

Hadoop is an open source project, but some vendors, like MapR have started adding innovations on top of the core which has made Hadoop completely enterprise-ready and appropriate to use for a repository for all its data. The enterprise-grade version of Hadoop offers enhanced security, full data protection and disaster recovery, as well as rolling upgrades.   

As Hadoop becomes increasingly more sophisticated, businesses will have hard time looking past its ability to save the company money and to store, analyze and compute all types of data as they consider their data storage options. It seems Hadoop may have not only rocked the boat when it comes to data analytics but also how that data is stored in the first place.


MicheleNemschoff December 11, 2013
Share this Article
Facebook Twitter Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

ai in automotive industry
AI Is Changing the Automotive Industry Forever
Artificial Intelligence
SMEs Use AI-Driven Financial Software for Greater Efficiency
Artificial Intelligence
data security in big data age
6 Reasons to Boost Data Security Plan in the Age of Big Data
Big Data
data science anayst
Growing Demand for Data Science & Data Analyst Roles
Data Science

Stay Connected

1.2k Followers Like
33.7k Followers Follow
222 Followers Pin

You Might also Like

data security in big data age
Big Data

6 Reasons to Boost Data Security Plan in the Age of Big Data

7 Min Read
data science anayst
Data Science

Growing Demand for Data Science & Data Analyst Roles

6 Min Read
How Big Data Is Transforming the Maritime Industry
Big Data

How Big Data Is Transforming the Maritime Industry

8 Min Read
cloud data security in 2023
Cloud Computing

Top Tools for Your Cloud Data Security Stack in 2023

7 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

data-driven web design
5 Great Tips for Using Data Analytics for Website UX
Big Data
ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence

Quick Link

  • About
  • Contact
  • Privacy
Follow US

© 2008-23 SmartData Collective. All Rights Reserved.

Removed from reading list

Undo
Go to mobile version
Welcome Back!

Sign in to your account

Lost your password?