Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    image fx (67)
    Improving LinkedIn Ad Strategies with Data Analytics
    9 Min Read
    big data and remote work
    Data Helps Speech-Language Pathologists Deliver Better Results
    6 Min Read
    data driven insights
    How Data-Driven Insights Are Addressing Gaps in Patient Communication and Equity
    8 Min Read
    pexels pavel danilyuk 8112119
    Data Analytics Is Revolutionizing Medical Credentialing
    8 Min Read
    data and seo
    Maximize SEO Success with Powerful Data Analytics Insights
    8 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Big Data: Smaller is Better
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Warehousing > Big Data: Smaller is Better
AnalyticsData WarehousingUnstructured Data

Big Data: Smaller is Better

Brett Stupakevich
Brett Stupakevich
4 Min Read
SHARE

Big data: keep it small, stupid.

Big data: keep it small, stupid.

That’s the advice of lead Forrester advanced analytics analyst James Kobielus (@jameskobielus), who says that as data scientists move deeper into big data territory, they have to be sure they don’t drown in too much useless information. If you’re a data scientist take heed: it’s easier to make sense out of all that data, if you keep your data sample small and manageable.

More Read

It’s Just a Little More Disk Space
Using Data Analysis to Improve and Verify the Customer Experience and Bad Reviews
Worst Practices While Deploying a Predictive Model
Never Stop Expecting More from Your Unstructured Data
The Role of Big Data Analytics in Gaming

In the past, data scientists have had to be satisfied with analyzing “mere samples.” They haven’t been able to collect “petabytes” of data on “every relevant variable of every entity in the population under study.”

Until now.

Thanks to the big data revolution these limitations no longer exist. Data scientists now have access to more comprehensive data sets, enabling them to more quickly determine the answers to business questions that require detailed, interactive, multidimensional statistical analysis.

Kobielus says to think of this new model as “whole-population analytics,” rather than just the ability to pivot, drill, and crunch into larger data sets.

“Over time, as the world evolves toward massively parallel approaches such as Hadoop, we will be able to do true 360-degree analysis,” he says.

For instance, as people around the world continue to engage in social networking and conduct more of their lives in public online forums, data scientists will have access to more comprehensive, current, and detailed market intelligence on every possible demographic.

But beware: big data can mean big trouble if you’re not careful about how you approach it.

For one thing, as your company’s analytics initiatives rapidly grow, you’re going to max out your IT budget on storage if you don’t keep the data as compact, compressed, and storage-efficient as possible, Kobielus says.

Not only that, but your users will be overwhelmed by the massive amounts of information they have to wade through if you don’t deliver the information they need to their tablets, smartphones, and other devices so they can act on it quickly.

So all you data scientists out there, listen to Kobielus and don’t give in to the temptation to throw more data at every analytic challenge. More often than not, you only need tiny, representative samples to find the most relevant patterns.

In fact, sometimes, you only need that one crucial observation or one piece of data to deliver the key insight. And quite often all you’ll need is gut feel, instinct, or intuition to solve some really difficult problem.

“New data may be redundant at best, or a distraction at worst, when you’re trying to collect your thoughts,” Kobielus says.

So it’s worth repeating—when it comes to big data: keep it small, stupid (no offense).

Image Courtesy of Idaho National Laboratory via Flickr

—

Author: Linda Rosencrance
Spotfire blogger *

TAGGED:big data
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

image fx (2)
Monitoring Data Without Turning into Big Brother
Big Data Exclusive
image fx (71)
The Power of AI for Personalization in Email
Artificial Intelligence Exclusive Marketing
image fx (67)
Improving LinkedIn Ad Strategies with Data Analytics
Analytics Big Data Exclusive Software
big data and remote work
Data Helps Speech-Language Pathologists Deliver Better Results
Analytics Big Data Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

From BI to Enterprise IT Integration

5 Min Read
future of franchises
Big DataInfographic

Will Big Data Change The Future Of Franchises Forever?

5 Min Read
conversion rate optimization strategies
Big DataExclusive

3 Data-Driven Elements Of Conversion Rate Optimization Strategies

6 Min Read

Digital Reasoning’s Synthesys

3 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

data-driven web design
5 Great Tips for Using Data Analytics for Website UX
Big Data
AI chatbots
AI Chatbots Can Help Retailers Convert Live Broadcast Viewers into Sales!
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?