Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    composable analytics
    How Composable Analytics Unlocks Modular Agility for Data Teams
    9 Min Read
    data mining to find the right poly bag makers
    Using Data Analytics to Choose the Best Poly Mailer Bags
    12 Min Read
    data analytics for pharmacy trends
    How Data Analytics Is Tracking Trends in the Pharmacy Industry
    5 Min Read
    car expense data analytics
    Data Analytics for Smarter Vehicle Expense Management
    10 Min Read
    image fx (60)
    Data Analytics Driving the Modern E-commerce Warehouse
    13 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Big Data: Smaller is Better
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Warehousing > Big Data: Smaller is Better
AnalyticsData WarehousingUnstructured Data

Big Data: Smaller is Better

Brett Stupakevich
Brett Stupakevich
4 Min Read
SHARE

Big data: keep it small, stupid.

Big data: keep it small, stupid.

That’s the advice of lead Forrester advanced analytics analyst James Kobielus (@jameskobielus), who says that as data scientists move deeper into big data territory, they have to be sure they don’t drown in too much useless information. If you’re a data scientist take heed: it’s easier to make sense out of all that data, if you keep your data sample small and manageable.

More Read

It’s not AI but…
5 Dark Data Sources that Lead to Better Marketing Analytics
The Coming Monetization of Big Data
Big Data Analytics: Think Differently To Maximize Value
What Really Is Big Data? And Why It Will Change the World

In the past, data scientists have had to be satisfied with analyzing “mere samples.” They haven’t been able to collect “petabytes” of data on “every relevant variable of every entity in the population under study.”

Until now.

Thanks to the big data revolution these limitations no longer exist. Data scientists now have access to more comprehensive data sets, enabling them to more quickly determine the answers to business questions that require detailed, interactive, multidimensional statistical analysis.

Kobielus says to think of this new model as “whole-population analytics,” rather than just the ability to pivot, drill, and crunch into larger data sets.

“Over time, as the world evolves toward massively parallel approaches such as Hadoop, we will be able to do true 360-degree analysis,” he says.

For instance, as people around the world continue to engage in social networking and conduct more of their lives in public online forums, data scientists will have access to more comprehensive, current, and detailed market intelligence on every possible demographic.

But beware: big data can mean big trouble if you’re not careful about how you approach it.

For one thing, as your company’s analytics initiatives rapidly grow, you’re going to max out your IT budget on storage if you don’t keep the data as compact, compressed, and storage-efficient as possible, Kobielus says.

Not only that, but your users will be overwhelmed by the massive amounts of information they have to wade through if you don’t deliver the information they need to their tablets, smartphones, and other devices so they can act on it quickly.

So all you data scientists out there, listen to Kobielus and don’t give in to the temptation to throw more data at every analytic challenge. More often than not, you only need tiny, representative samples to find the most relevant patterns.

In fact, sometimes, you only need that one crucial observation or one piece of data to deliver the key insight. And quite often all you’ll need is gut feel, instinct, or intuition to solve some really difficult problem.

“New data may be redundant at best, or a distraction at worst, when you’re trying to collect your thoughts,” Kobielus says.

So it’s worth repeating—when it comes to big data: keep it small, stupid (no offense).

Image Courtesy of Idaho National Laboratory via Flickr

—

Author: Linda Rosencrance
Spotfire blogger *

TAGGED:big data
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

mobile device farm
How Mobile Device Farms Strengthen Big Data Workflows
Big Data Exclusive
composable analytics
How Composable Analytics Unlocks Modular Agility for Data Teams
Analytics Big Data Exclusive
fintech startups
Why Fintech Start-Ups Struggle To Secure The Funding They Need
Infographic News
edge networks in manufacturing
Edge Infrastructure Strategies for Data-Driven Manufacturers
Big Data Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

How NASA Tackles Big Data with MySQL

7 Min Read
data analytics in sports industry
Big Data

Here’s How Data Analytics In Sports Is Changing The Game

6 Min Read
VPN data security
Data ManagementExclusivePrivacySecurity

How Big Data Provides A Pivotal Foundation For VPN Data Security

6 Min Read

Facebook’s IPO and the Laws of Big Data

5 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
ai is improving the safety of cars
From Bolts to Bots: How AI Is Fortifying the Automotive Industry
Artificial Intelligence

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?