Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    image fx (67)
    Improving LinkedIn Ad Strategies with Data Analytics
    9 Min Read
    big data and remote work
    Data Helps Speech-Language Pathologists Deliver Better Results
    6 Min Read
    data driven insights
    How Data-Driven Insights Are Addressing Gaps in Patient Communication and Equity
    8 Min Read
    pexels pavel danilyuk 8112119
    Data Analytics Is Revolutionizing Medical Credentialing
    8 Min Read
    data and seo
    Maximize SEO Success with Powerful Data Analytics Insights
    8 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Yes, you need more than just R for Big Data Analytics.
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Analytics > Yes, you need more than just R for Big Data Analytics.
AnalyticsR Programming LanguageStatistics

Yes, you need more than just R for Big Data Analytics.

DavidMSmith
DavidMSmith
4 Min Read
SHARE

Douglas Merrill, former CIO/VP of Engineering at Google, writes in Forbes about using the R language for data analysis:

Most folks with math-oriented graduate degrees will have written something in R, a non-commercial option for your big data analysis.  So, great graduates from great graduate schools know great tools.

Douglas Merrill, former CIO/VP of Engineering at Google, writes in Forbes about using the R language for data analysis:

Most folks with math-oriented graduate degrees will have written something in R, a non-commercial option for your big data analysis.  So, great graduates from great graduate schools know great tools.

His post is titled ‘R Is Not Enough For “Big Data”‘, and you might be surprised to learn that I agree that title, although for a different reason. Douglas’s point — and it’s a valid one — is that simply pumping data through any software tool, without an understanding of the problem you’re trying to solve and how statistical models apply to it, can lead to getting the wrong answers to the wrong questions:

If you ask the wrong question, you will be able to find statistics that give answers that are simply wrong (or, at best, misleading).

On net, having a degree in math, economics, AI, etc., isn’t enough. Tool expertise isn’t enough.  You need experience in solving real world problems, because there are a lot of importat limitations to the statistics that you learned in school.  Big data isn’t about bits, it’s about talent.

This is a great illustration of why the data science process is a valuable one for extracting information from Big Data, because it combines tool expertise with statistical expertise and the domain expertise required to understand the problem and the data applicable to it. He’s right that you need data science talent and software to solve problems with Big Data … and having software like R that supports the exploratory nature of the data science process is also critical.

But I also agree with the title for a different, technical reason: the R software is just one piece of software ecosystem — an analytics stack, if you will — of tools used to analyze Big Data. For one thing R isn’t a data store in its own right: you also need a data layer where R can access structured and unstructured data for analysis. (For example, see how you can use R to extract data from Hadoop in the slides from today’s webinar by Antonio Piccolboni.) At the analytics layer, you need statistical algorithms that work with Big Data, like those in Revolution R Enterprise. And at the presentation layer, you need the ability to embed the results of the analysis in reports, BI tools, or data apps.

So yes, Douglas is right: you need more than just R for Big Data. You also need a data layer, an analytics layer, and a presentation layer (all of which supports Big Data) … and you need Data Science skills to make sure you’re asking the right questions and getting appropriate answers.

Forbes: R Is Not Enough For “Big Data”

TAGGED:big datadomain expertise
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

power supplies for ATX for data scientists
Why Data Scientists Should Care About SFX Power Supplies
Big Data Exclusive
AI for website optimization
Free AI Tools to Test Website Accessibility
Artificial Intelligence Exclusive
Generative AI models
Thinking Machines At Work: How Generative AI Models Are Redefining Business Intelligence
Artificial Intelligence Business Intelligence Exclusive Infographic Machine Learning
image fx (2)
Monitoring Data Without Turning into Big Brother
Big Data Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

Pardon the Interruption: Industry Veteran Returns to Ventana Research

7 Min Read
big data analytics ERP 17-18
AnalyticsData Mining

Why Integrating Big Data Analytics with ERP Is the Future of Retail

5 Min Read
app development cost and big data
Big DataExclusive

Experts Debate The Cost Of Big Data Web Application Development

9 Min Read
big data helps branding
Big DataExclusive

The Vital Elements of a Big Data Branding Strategy

7 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI chatbots
AI Chatbots Can Help Retailers Convert Live Broadcast Viewers into Sales!
Chatbots
giveaway chatbots
How To Get An Award Winning Giveaway Bot
Big Data Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?