Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    image fx (60)
    Data Analytics Driving the Modern E-commerce Warehouse
    13 Min Read
    big data analytics in transporation
    Turning Data Into Decisions: How Analytics Improves Transportation Strategy
    3 Min Read
    sales and data analytics
    How Data Analytics Improves Lead Management and Sales Results
    9 Min Read
    data analytics and truck accident claims
    How Data Analytics Reduces Truck Accidents and Speeds Up Claims
    7 Min Read
    predictive analytics for interior designers
    Interior Designers Boost Profits with Predictive Analytics
    8 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Considering the Data Diet
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > Considering the Data Diet
Data MiningData Warehousing

Considering the Data Diet

TeradataExperts
TeradataExperts
6 Min Read
SHARE

I had the unique pleasure of meeting with Pete Fader and Eric Bradlow from the Wharton Interactive Media Initiative last week in Philadelphia. And while it wasn’t the point of the meeting I was really interested in their work about companies going on a data diet. While this is somewhat antithetical to the historical position of data warehousing – where every bit of data should be kept for as long as possible, I think they are doing important work in this space.

Not because I think companies need less data, but because they are going to be getting more and more data. With the emerging Web Squared environment where world wide web meshes with devices, sensors, geospatial and temporal data and the devices interact with this mesh to transform not just the consumer experience but the entire value chain. The volume of data that can be analyzed is not going to be nearly as important as the critical data to the problem you are trying to solve.

As Melyssa Plunket-Gomez from Crimson Hexagon showed me with Social Media data, a lot of what needs to be analyzed is what you exclude. When looking at blog traffic for sentiment about a company you have exclude stuff like job postings because they . …

I had the unique pleasure of meeting with Pete Fader and Eric Bradlow from the Wharton Interactive Media Initiative last week in Philadelphia. And while it wasn’t the point of the meeting I was really interested in their work about companies going on a data diet. While this is somewhat antithetical to the historical position of data warehousing – where every bit of data should be kept for as long as possible, I think they are doing important work in this space.

More Read

Software Dependence & Model Accuracy
Business Intelligence or Business Analytics?
Austinites Really Love Music & Kevin Durant is Kind of a Big Deal, So Says Data Science
What Trends Are In Store For Data Storage?
Predictions 2009 – John Battelle’s Searchblog

Not because I think companies need less data, but because they are going to be getting more and more data. With the emerging Web Squared environment where world wide web meshes with devices, sensors, geospatial and temporal data and the devices interact with this mesh to transform not just the consumer experience but the entire value chain. The volume of data that can be analyzed is not going to be nearly as important as the critical data to the problem you are trying to solve.

As Melyssa Plunket-Gomez from Crimson Hexagon showed me with Social Media data, a lot of what needs to be analyzed is what you exclude. When looking at blog traffic for sentiment about a company you have exclude stuff like job postings because they are irrelevant to the analysis even though they have all the right keywords. I think this is a perfect example of watching what you analyze – a critical part of any data diet.

And while I think the data diet statement can be too simple, I think the concept has an important role to play in how companies utilize tools like Teradata’s multi-temperature data warehousing solutions. Which data is hot and needs high performance disks and chips and which data is not needed all the time and can be moved to lower cost storage? A data diet approach can be useful in determining how to invest in data.

One of the challenges I see for data geeks in the coming years is how to decide what data you need to answer the questions that need to be answered tomorrow? I don’t don’t know what that question is and I don’t know what data I need. There is a big cost to not having the information you need when you need it. A data diet risks summarizing the value out of the data and eliminating data you need, but you don’t know it yet. Sometimes the most expensive data is the data you threw away.

Paul Barrett

—————
A couple follow-ups to earlier posts:

Consumers Taking Control > The district manager of the book store that didn’t get my money invited me to come to the store and meet and have a coffee. I am looking forward to it – I will probably make him spring for an expensive mocha instead of my usual value priced Iced Double Espresso – under $2 almost anywhere.

Millenials have never lived in a Batch World > I am getting ready for my presentation “Customer Time is Now Real-Time” at the Teradata Partners conference, coming up in October. This presentation is about how the confluence of the real-time web and a young generation of consumers are changing the rules for businesses. If you have comments or thoughts on Millenials or the real-time web and how it is impacting your business, please drop me a note at paul.barrett@teradata.com.  

Link to original post

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

Why the AI Race Is Being Decided at the Dataset Level
Why the AI Race Is Being Decided at the Dataset Level
Artificial Intelligence Big Data Exclusive
image fx (60)
Data Analytics Driving the Modern E-commerce Warehouse
Analytics Big Data Exclusive
ai for building crypto banks
Building Your Own Crypto Bank with AI
Blockchain Exclusive
julia taubitz vn5s g5spky unsplash
Benefits of AI in Nursing Education Amid Medicaid Cuts
Artificial Intelligence Exclusive News

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

Information Management Technology Revolution and Research Agenda for 2012

7 Min Read

When the customer knocks you need decision management not operational BI

5 Min Read

Nice article on EDM

2 Min Read

The dictatorship of the analysts

13 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence
ai chatbot
The Art of Conversation: Enhancing Chatbots with Advanced AI Prompts
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?