Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    predictive analytics risk management
    How Predictive Analytics Is Redefining Risk Management Across Industries
    7 Min Read
    data analytics and gold trading
    Data Analytics and the New Era of Gold Trading
    9 Min Read
    composable analytics
    How Composable Analytics Unlocks Modular Agility for Data Teams
    9 Min Read
    data mining to find the right poly bag makers
    Using Data Analytics to Choose the Best Poly Mailer Bags
    12 Min Read
    data analytics for pharmacy trends
    How Data Analytics Is Tracking Trends in the Pharmacy Industry
    5 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: The Nature of Big Data and the Skills of Data Scientists
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > The Nature of Big Data and the Skills of Data Scientists
AnalyticsBig DataData MiningJobs

The Nature of Big Data and the Skills of Data Scientists

Ling Zhang
Ling Zhang
7 Min Read
SHARE

The job title Data Scientist was invented by DJ Patil and Jeff Hammerbacher when  they tried to name people in their data team who work on big data and they did not want to limit people’s functions because of improper job title like business analyst or research scientist Building Data Science Teams

The job title Data Scientist was invented by DJ Patil and Jeff Hammerbacher when  they tried to name people in their data team who work on big data and they did not want to limit people’s functions because of improper job title like business analyst or research scientist Building Data Science Teams

Ever since, the data scientist is becoming more and more popular with the big data becoming more critical to drive a successful business. However, some organizations still do not quite understand the roles that data scientists play and their responsibilities. It’s just like sometimes organizations do not know how to draw values from big data even though they are well convinced there are nuggets behind – their vision in using big data has actually blurred.

The nature of big data is defined by three Vs – Volume, Variety and Velocity. The roles and responsibilities of data scientists should be naturally determined by the nature of big data. First, as big data wears many hats, so does a data scientist who works on it. That means a data scientist has multiple roles and takes multiple responsibilities in an organization.

More Read

Report from the 2012 Hadoop Summit
Basking in a dashboard’s warm glow
5 Data-Driven Amazon Ads Ideas to Skyrocket Sales
Predictive Analytic Strategies to Out-Predict the Competition
5 Ways Your Retail Banks Can Use Data to Better Serve Digital Natives
  •  Experise in Diverse Technologies

In order to tackle the big volume of data, a big data platform such as Apache Hadoop or LexisNexis HPPC is required to process big data. A data scientist should have a package of knowledge around a big data platform so that they can proficiently tackle the big data on its platform. A data scientist should

1) Have a thorough understanding about the framework of a big data platform like DFS and MapReduce programming framework to deliver robust application designs. That means a data scientist should also have the knowledge about software architecture, compoent and design.

2) Be proficient with several programming languages supported by a big data platform like Java, Python, C++, or ECL, etc.

3) Have a good understanding about database technologies, especially, NoSQL database like HBase, CouchDB, etc.  Because a big data platform is usually communicating with databases to store variety of data format.

4) Be good expertise in math/statistics, machine learning and data mining fields.

The success of a business is not driven by the amount of data but rather driven by successfully finding and extracting interesting and novel patterns and relationship among data and use those gold values to develop the gold products – statistics, machine learning and data mining are great technologies used to understand data and dig out the nuggets from data. Naturally a data scientist must have the expertise in those fields for success. Skills to use some data mining tools or platform like R, Excel, SPSS and SAS is very critical, see Top Analytics and big data software tools

5) Be good at Natural Language Processing (NLP) software or tools – as most the content from big data are text based, news, social media and reports and comments, etc. Knowledge and master one or more NLP software or tools is very critical to the success as a data scientist.

6) Be skillful to one or more data visualization tools. In order to effectively demo the patterns and relationship mined from big data, be able to use some good visualization tools is definitely a plus to a data scientist. Here is a link of top 20 visualization tools.

  • Innovation – curiosity

As the velocity of data change is so fast, constantly there are new findings and problems, a data scientist should be sensitive to those changes, be curiosity to new findings and creative to tackle new problems. He or she should also be passionate to communicate them in a timely manner, explore new product ideas and solutions with the new findings and become a driver for product innovation.

  • Business Skills

First, the nature of wearing multiple hats as a data scientist drives the need for stronger communication skill. A data scientist has to communicate with diverse people in an organization that includes communicating and understanding business requirements, application requirements and interpret the patterns and relationships mined from data to people in marketing group, product development teams, and corporate executives. Effective communication is the key for a business to timely act on the new findings from big data. A data scientist should be a great collaborator and the hook of all.

Second, a data scientist needs great planning and organization skills so that he/she can skillfully handle multiple tasks and set up right priorities and guarantee timely delivery.

Third, a data scientist should have persuasive power, passion and story-telling skill to influence people to make the right decisions based on fact found in data and convince people the value of new findings. A data scientist in this sense is a leader to drive product innovation.

Overall the nature of big data defines the skills of data scientists and their roles in an organization.

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

street address database
Why Data-Driven Companies Rely on Accurate Street Address Databases
Big Data Exclusive
predictive analytics risk management
How Predictive Analytics Is Redefining Risk Management Across Industries
Analytics Exclusive Predictive Analytics
data analytics and gold trading
Data Analytics and the New Era of Gold Trading
Analytics Big Data Exclusive
student learning AI
Advanced Degrees Still Matter in an AI-Driven Job Market
Artificial Intelligence Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

Counteracting Our Obsession with Speed

4 Min Read
data science use advertising
Big DataMarketing

The Low-Down On Using Data Science And Statistics In Advertising

6 Min Read

The Environmental Performance Index, Visualized with R

2 Min Read

Anti-terror software glitches?

4 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence
AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?