Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    sales and data analytics
    How Data Analytics Improves Lead Management and Sales Results
    9 Min Read
    data analytics and truck accident claims
    How Data Analytics Reduces Truck Accidents and Speeds Up Claims
    7 Min Read
    predictive analytics for interior designers
    Interior Designers Boost Profits with Predictive Analytics
    8 Min Read
    image fx (67)
    Improving LinkedIn Ad Strategies with Data Analytics
    9 Min Read
    big data and remote work
    Data Helps Speech-Language Pathologists Deliver Better Results
    6 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: What Is a Data Scientist (and What Isn’t)?
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Software > Hadoop > What Is a Data Scientist (and What Isn’t)?
Hadoop

What Is a Data Scientist (and What Isn’t)?

nraden
nraden
7 Min Read
SHARE

The perception among organizations over the past five years is that more quantitative methods, with or without Big Data, are critical to success. The problem is that most commercial organizations have little to no depth in these disciplines. On the other hand, businesses where data and data products are their primary revenue stream have an abundance of talent in this area.

The perception among organizations over the past five years is that more quantitative methods, with or without Big Data, are critical to success. The problem is that most commercial organizations have little to no depth in these disciplines. On the other hand, businesses where data and data products are their primary revenue stream have an abundance of talent in this area. Some, like Google or Amazon, employ hundreds of applied mathematicians and statisticians, in the same way that manufacturing companies employ mechanical or electrical engineers. Medical informatics, genomics, even intelligence and defense groups work on the bleeding edge of research into methods for classification, prediction and optimization. Because this work is rather unique, involving massive data volumes, unruly data formats and sources that are beyond the typical enterprise data flows, coupled with a broader understanding of the business or organization, a name for these professionals emerged: “Data Scientist.

But the term “Data Scientist” is an over-reaching title.

Lets look at how this actually plays out. The work is clearly divided between true scientists, those who research and create algorithms and methods, publish papers and actively participate in their discipline’s communications, and those who understand and employ quantitative methods, design, test and deploy models but do not create new science. I refer to these two as Type I and Type II respectively (in a forthcoming research report from Constellation Research, I go into much more detail and describe Types III and IV also). The former are truly scientists, the latter are not, though this is the group typically referred to as data scientists. There will be very few “data scientists” in commercial organizations. Data scientists work in research, academia and organizations where the production of new methods and algorithms are the core of the enterprise. Google, Amazon, Wall Street, etc. – these are companies whose scientist produce new methods in quantitative science and publish in peer-reviewed journals.

More Read

A year on: The promise of SAP HANA for Big Data analytics (Part Two)
3 Big Data Myths for Enterprises
How Companies are Meeting the Big Data Skills Challenge
How to Use Analytics for Effective Content Marketing
5 Reasons Organizations Use Hadoop [INFOGRAPHIC]

Although, there is a prejudice for employing PhD’s as Type II’s, it isn’t necessary.

Despite the unfortunate name given to this growing class of professionals (scientists they are not, in general), it does represent a new sort of role in organizations. Finding people to fill this role is difficult for all of the following reasons:

  • Varied types of data available and the resultant multitude of analyses that can be employed
  • A skill set that includes programming capability, quantitative methods, investigative and modeling orientation
  • The ability to understand what is meaningful and what is not
  • Need to have sufficient domain knowledge, not be quant-for-hire
  • The ability to communicate complex subjects to others who lack the background in the tools and methods employed

I mentioned engineers above.  Engineers come to work with a solid grounding in the area of their choice, but no real practical experience, and typically no experience at all in the business of their employer. They learn as they go. In fact, there is even a professional designation for engineers that demonstrates they have the skill, training and practitioner’s experience to be a senior engineer – Professional Engineer (usually abbreviated as PE).

Another model for recruiting and nurturing professionals for this role, instead of competing for a small pool of PhD’s who may be overqualified and unfulfilled with the work, is the way insurance company grow their own actuaries (full disclosure, I have an actuarial background). There are two major actuarial organizations, The Society of Actuaries and the Casualty Actuarial Society. Both organizations administer comprehensive (actually, sort of grueling) certification programs that start with most of an undergraduate math degree and proceed to all aspects of probability, statistics and the insurance business itself.  The series of exams can take 5-10 years to complete, and most insurance companies offer time off for study as well as on-the-job mentoring. Two things about this are key: first, a Fellow in either society demonstrates not only thorough grounding in quantitative methods, but also, and perhaps even more importantly, a true understanding of the workings of the enterprise as well as the entire industry.

There are tons of gimmicky professional “certifications,” but actuarial fellowship, Professional Engineer certification, even CPA, are all rigorous, practitioner-oriented programs. Analytics is looming in importance and is deserving of something similar.

Companies can’t expect universities to provide this kind of education. It’s obvious that skill with data, and analytics, are central to most if not all organization’s success. It’s time to get serious about it. Call them data scientists if you will, but you have to participate in their learning. They don’t grow on trees.

Some sort of legitimate professional certification is needed. But until then, companies need to take grooming and nurturing these professionals seriously.

Certified Data Scientist? I don’t think so. Fellow of Quantitiative Analysts Society? Better. Data Alchemist? I like that, but it’s not really comprehensive enough. Let me have your suggestions. 

This discussion continues on Twitter. Follow me there @neilraden

TAGGED:analyticsData ScientisteducationhadoopMapReducequantitativetraining
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

sales and data analytics
How Data Analytics Improves Lead Management and Sales Results
Analytics Big Data Exclusive
ai in marketing
How AI and Smart Platforms Improve Email Marketing
Artificial Intelligence Exclusive Marketing
AI Document Verification for Legal Firms: Importance & Top Tools
AI Document Verification for Legal Firms: Importance & Top Tools
Artificial Intelligence Exclusive
AI supply chain
AI Tools Are Strengthening Global Supply Chains
Artificial Intelligence Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

big data analytics of oil market and shell company
Big Data

Big Data in Big Oil: The Amazing Ways Shell Uses Analytics to Drive Business Success

10 Min Read
advanced analytics smartdatacollective.com
Analytics

Advanced Analytics Solution: To Build or Buy

7 Min Read

Big Data Analytics, Business Intelligence and the Mind of Sherlock Holmes

9 Min Read

Business Intelligence – The Power of Human Emotion

6 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
ai is improving the safety of cars
From Bolts to Bots: How AI Is Fortifying the Automotive Industry
Artificial Intelligence

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?