Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    unusual trading activity
    Signal Or Noise? A Decision Tree For Evaluating Unusual Trading Activity
    3 Min Read
    software developer using ai
    How Data Analytics Helps Developers Deliver Better Tech Services
    8 Min Read
    ai for stock trading
    Can Data Analytics Help Investors Outperform Warren Buffett
    9 Min Read
    media monitoring
    Signals In The Noise: Using Media Monitoring To Manage Negative Publicity
    5 Min Read
    data analytics
    How Data Analytics Can Help You Construct A Financial Weather Map
    4 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: The Nature of Big Data and the Skills of Data Scientists
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > The Nature of Big Data and the Skills of Data Scientists
AnalyticsBig DataData MiningJobs

The Nature of Big Data and the Skills of Data Scientists

Ling Zhang
Ling Zhang
7 Min Read
SHARE

The job title Data Scientist was invented by DJ Patil and Jeff Hammerbacher when  they tried to name people in their data team who work on big data and they did not want to limit people’s functions because of improper job title like business analyst or research scientist Building Data Science Teams

The job title Data Scientist was invented by DJ Patil and Jeff Hammerbacher when  they tried to name people in their data team who work on big data and they did not want to limit people’s functions because of improper job title like business analyst or research scientist Building Data Science Teams

Ever since, the data scientist is becoming more and more popular with the big data becoming more critical to drive a successful business. However, some organizations still do not quite understand the roles that data scientists play and their responsibilities. It’s just like sometimes organizations do not know how to draw values from big data even though they are well convinced there are nuggets behind – their vision in using big data has actually blurred.

The nature of big data is defined by three Vs – Volume, Variety and Velocity. The roles and responsibilities of data scientists should be naturally determined by the nature of big data. First, as big data wears many hats, so does a data scientist who works on it. That means a data scientist has multiple roles and takes multiple responsibilities in an organization.

More Read

Hadoop in government
How Governments Can (and Should) Use Hadoop
Privacy in Behavioral Targeting: Poll Results
Crucial Advantages of Investing in Big Data Management Solutions
Podcast: BI tools vs. Microsoft Excel Spreadsheets
Expert Panel on Challenges and Solutions
  •  Experise in Diverse Technologies

In order to tackle the big volume of data, a big data platform such as Apache Hadoop or LexisNexis HPPC is required to process big data. A data scientist should have a package of knowledge around a big data platform so that they can proficiently tackle the big data on its platform. A data scientist should

1) Have a thorough understanding about the framework of a big data platform like DFS and MapReduce programming framework to deliver robust application designs. That means a data scientist should also have the knowledge about software architecture, compoent and design.

2) Be proficient with several programming languages supported by a big data platform like Java, Python, C++, or ECL, etc.

3) Have a good understanding about database technologies, especially, NoSQL database like HBase, CouchDB, etc.  Because a big data platform is usually communicating with databases to store variety of data format.

4) Be good expertise in math/statistics, machine learning and data mining fields.

The success of a business is not driven by the amount of data but rather driven by successfully finding and extracting interesting and novel patterns and relationship among data and use those gold values to develop the gold products – statistics, machine learning and data mining are great technologies used to understand data and dig out the nuggets from data. Naturally a data scientist must have the expertise in those fields for success. Skills to use some data mining tools or platform like R, Excel, SPSS and SAS is very critical, see Top Analytics and big data software tools

5) Be good at Natural Language Processing (NLP) software or tools – as most the content from big data are text based, news, social media and reports and comments, etc. Knowledge and master one or more NLP software or tools is very critical to the success as a data scientist.

6) Be skillful to one or more data visualization tools. In order to effectively demo the patterns and relationship mined from big data, be able to use some good visualization tools is definitely a plus to a data scientist. Here is a link of top 20 visualization tools.

  • Innovation – curiosity

As the velocity of data change is so fast, constantly there are new findings and problems, a data scientist should be sensitive to those changes, be curiosity to new findings and creative to tackle new problems. He or she should also be passionate to communicate them in a timely manner, explore new product ideas and solutions with the new findings and become a driver for product innovation.

  • Business Skills

First, the nature of wearing multiple hats as a data scientist drives the need for stronger communication skill. A data scientist has to communicate with diverse people in an organization that includes communicating and understanding business requirements, application requirements and interpret the patterns and relationships mined from data to people in marketing group, product development teams, and corporate executives. Effective communication is the key for a business to timely act on the new findings from big data. A data scientist should be a great collaborator and the hook of all.

Second, a data scientist needs great planning and organization skills so that he/she can skillfully handle multiple tasks and set up right priorities and guarantee timely delivery.

Third, a data scientist should have persuasive power, passion and story-telling skill to influence people to make the right decisions based on fact found in data and convince people the value of new findings. A data scientist in this sense is a leader to drive product innovation.

Overall the nature of big data defines the skills of data scientists and their roles in an organization.

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

data migration risk prevention
Best Approach to Risk Management for Data Migration in Data-Driven Businesses
Big Data Data Management Exclusive Risk Management
AI in branding
How Data Analytics and Data Mining Strengthen Brand Identity Services
Big Data Exclusive
Hidden AI, a risk?
Hidden AI, Real Risk: A Governance Roadmap For Mid-Market Organizations
Artificial Intelligence Exclusive Infographic
unusual trading activity
Signal Or Noise? A Decision Tree For Evaluating Unusual Trading Activity
Analytics Exclusive Infographic

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

The Emerging Role of the Analyst – Posts from the Blogosphere

1 Min Read

Business Intelligence & Data Warehousing – Too Much Hype: 11 for 2011

7 Min Read

How Big Data Is Helping Law Enforcement

5 Min Read

Embrace Analytic Athleticism

7 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

data-driven web design
5 Great Tips for Using Data Analytics for Website UX
Big Data
ai in ecommerce
Artificial Intelligence for eCommerce: A Closer Look
Artificial Intelligence

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?