By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData Collective
  • Analytics
    AnalyticsShow More
    predictive analytics in dropshipping
    Predictive Analytics Helps New Dropshipping Businesses Thrive
    12 Min Read
    data-driven approach in healthcare
    The Importance of Data-Driven Approaches to Improving Healthcare in Rural Areas
    6 Min Read
    analytics for tax compliance
    Analytics Changes the Calculus of Business Tax Compliance
    8 Min Read
    big data analytics in gaming
    The Role of Big Data Analytics in Gaming
    10 Min Read
    analyst,women,looking,at,kpi,data,on,computer,screen
    Promising Benefits of Predictive Analytics in Asset Management
    11 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-23 SmartData Collective. All Rights Reserved.
Reading: The Nature of Big Data and the Skills of Data Scientists
Share
Notification Show More
Latest News
ai digital marketing tools
Top Five AI-Driven Digital Marketing Tools in 2023
Artificial Intelligence
ai-generated content
Is AI-Generated Content a Net Positive for Businesses?
Artificial Intelligence
predictive analytics in dropshipping
Predictive Analytics Helps New Dropshipping Businesses Thrive
Predictive Analytics
cloud data security in 2023
Top Tools for Your Cloud Data Security Stack in 2023
Cloud Computing
become a data scientist
Boosting Your Chances for Landing a Job as a Data Scientist
Jobs
Aa
SmartData Collective
Aa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > The Nature of Big Data and the Skills of Data Scientists
AnalyticsBig DataData MiningJobs

The Nature of Big Data and the Skills of Data Scientists

Ling Zhang
Last updated: 2012/12/05 at 12:05 PM
Ling Zhang
7 Min Read
SHARE

The job title Data Scientist was invented by DJ Patil and Jeff Hammerbacher when  they tried to name people in their data team who work on big data and they did not want to limit people’s functions because of improper job title like business analyst or research scientist Building Data Science Teams

The job title Data Scientist was invented by DJ Patil and Jeff Hammerbacher when  they tried to name people in their data team who work on big data and they did not want to limit people’s functions because of improper job title like business analyst or research scientist Building Data Science Teams

Ever since, the data scientist is becoming more and more popular with the big data becoming more critical to drive a successful business. However, some organizations still do not quite understand the roles that data scientists play and their responsibilities. It’s just like sometimes organizations do not know how to draw values from big data even though they are well convinced there are nuggets behind – their vision in using big data has actually blurred.

The nature of big data is defined by three Vs – Volume, Variety and Velocity. The roles and responsibilities of data scientists should be naturally determined by the nature of big data. First, as big data wears many hats, so does a data scientist who works on it. That means a data scientist has multiple roles and takes multiple responsibilities in an organization.

More Read

predictive analytics in dropshipping

Predictive Analytics Helps New Dropshipping Businesses Thrive

Boosting Your Chances for Landing a Job as a Data Scientist
Utilizing Data to Discover Shortcomings Within Your Business Model
Small Businesses Use Big Data to Offset Risk During Economic Uncertainty
The Importance of Data-Driven Approaches to Improving Healthcare in Rural Areas
  •  Experise in Diverse Technologies

In order to tackle the big volume of data, a big data platform such as Apache Hadoop or LexisNexis HPPC is required to process big data. A data scientist should have a package of knowledge around a big data platform so that they can proficiently tackle the big data on its platform. A data scientist should

1) Have a thorough understanding about the framework of a big data platform like DFS and MapReduce programming framework to deliver robust application designs. That means a data scientist should also have the knowledge about software architecture, compoent and design.

2) Be proficient with several programming languages supported by a big data platform like Java, Python, C++, or ECL, etc.

3) Have a good understanding about database technologies, especially, NoSQL database like HBase, CouchDB, etc.  Because a big data platform is usually communicating with databases to store variety of data format.

4) Be good expertise in math/statistics, machine learning and data mining fields.

The success of a business is not driven by the amount of data but rather driven by successfully finding and extracting interesting and novel patterns and relationship among data and use those gold values to develop the gold products – statistics, machine learning and data mining are great technologies used to understand data and dig out the nuggets from data. Naturally a data scientist must have the expertise in those fields for success. Skills to use some data mining tools or platform like R, Excel, SPSS and SAS is very critical, see Top Analytics and big data software tools

5) Be good at Natural Language Processing (NLP) software or tools – as most the content from big data are text based, news, social media and reports and comments, etc. Knowledge and master one or more NLP software or tools is very critical to the success as a data scientist.

6) Be skillful to one or more data visualization tools. In order to effectively demo the patterns and relationship mined from big data, be able to use some good visualization tools is definitely a plus to a data scientist. Here is a link of top 20 visualization tools.

  • Innovation – curiosity

As the velocity of data change is so fast, constantly there are new findings and problems, a data scientist should be sensitive to those changes, be curiosity to new findings and creative to tackle new problems. He or she should also be passionate to communicate them in a timely manner, explore new product ideas and solutions with the new findings and become a driver for product innovation.

  • Business Skills

First, the nature of wearing multiple hats as a data scientist drives the need for stronger communication skill. A data scientist has to communicate with diverse people in an organization that includes communicating and understanding business requirements, application requirements and interpret the patterns and relationships mined from data to people in marketing group, product development teams, and corporate executives. Effective communication is the key for a business to timely act on the new findings from big data. A data scientist should be a great collaborator and the hook of all.

Second, a data scientist needs great planning and organization skills so that he/she can skillfully handle multiple tasks and set up right priorities and guarantee timely delivery.

Third, a data scientist should have persuasive power, passion and story-telling skill to influence people to make the right decisions based on fact found in data and convince people the value of new findings. A data scientist in this sense is a leader to drive product innovation.

Overall the nature of big data defines the skills of data scientists and their roles in an organization.

Ling Zhang December 5, 2012
Share this Article
Facebook Twitter Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

ai digital marketing tools
Top Five AI-Driven Digital Marketing Tools in 2023
Artificial Intelligence
ai-generated content
Is AI-Generated Content a Net Positive for Businesses?
Artificial Intelligence
predictive analytics in dropshipping
Predictive Analytics Helps New Dropshipping Businesses Thrive
Predictive Analytics
cloud data security in 2023
Top Tools for Your Cloud Data Security Stack in 2023
Cloud Computing

Stay Connected

1.2k Followers Like
33.7k Followers Follow
222 Followers Pin

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

[mc4wp_form id=”1616″]

You Might also Like

predictive analytics in dropshipping
Predictive Analytics

Predictive Analytics Helps New Dropshipping Businesses Thrive

12 Min Read
become a data scientist
Jobs

Boosting Your Chances for Landing a Job as a Data Scientist

9 Min Read
utlizing big data for business model
Big Data

Utilizing Data to Discover Shortcomings Within Your Business Model

6 Min Read
big data use in small businesses
Big Data

Small Businesses Use Big Data to Offset Risk During Economic Uncertainty

7 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence
giveaway chatbots
How To Get An Award Winning Giveaway Bot
Big Data Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US

© 2008-23 SmartData Collective. All Rights Reserved.

Removed from reading list

Undo
Go to mobile version
Welcome Back!

Sign in to your account

Lost your password?