Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    How Data Analytics Is Reshaping Patient Financing Decisions
    How Data Analytics Is Reshaping Patient Financing Decisions
    13 Min Read
    business using business intelligence
    How to Use a Competitive Intelligence Dashboard to Turn Market Data Into Smarter Marketing Decisions 
    9 Min Read
    unusual trading activity
    Signal Or Noise? A Decision Tree For Evaluating Unusual Trading Activity
    3 Min Read
    software developer using ai
    How Data Analytics Helps Developers Deliver Better Tech Services
    8 Min Read
    ai for stock trading
    Can Data Analytics Help Investors Outperform Warren Buffett
    9 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Data Mining Fundamentals: Terms You Must Know
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > Data Mining Fundamentals: Terms You Must Know
AnalyticsData MiningPredictive AnalyticsStatisticsText AnalyticsWeb Analytics

Data Mining Fundamentals: Terms You Must Know

metabrown
metabrown
7 Min Read
SHARE

If anyone tells you there is a firm definition for the term “data mining,” that person is either misinformed or flat-out lying. While I’m pleased to report that I haven’t encountered many conscious liars in this realm, misinformation is very common, even among professionals who oughta know better. Just as common is use of unnecessarily complex language, to the point that I have difficulty understanding many of my colleagues, so heaven help the novice. No wonder so many business people still think of analytics as fancy stuff they don’t need!

If anyone tells you there is a firm definition for the term “data mining,” that person is either misinformed or flat-out lying. While I’m pleased to report that I haven’t encountered many conscious liars in this realm, misinformation is very common, even among professionals who oughta know better. Just as common is use of unnecessarily complex language, to the point that I have difficulty understanding many of my colleagues, so heaven help the novice. No wonder so many business people still think of analytics as fancy stuff they don’t need!

Analytics doesn’t have to be incomprehensible to be good. In fact, if you can’t understand it, it’s probably doing you no good at all. The best analysts communicate in plain business language!

More Read

Faceted Search Presentation at New York CTO Club
Business Intelligence 2.0: Simpler, More Accessible, Inevitable…
How We Are Heading Towards a Smart Planet with the Internet of Things [INFOGRAPHIC]
Teradata Podcasts on Data Mining And SNA
A Prediction for the Olympic Men’s 100m Sprint

Want a better understanding of data mining basics and a better ability to see through analytics mumbo jumbo? Start by getting a solid understanding of some of the most common terminology.

analysis

Any method, formal or informal, of summarizing data into a (comparatively) brief, informative form. (At least, it is informative in the view of the analyst.)

analytics

Another very general term, but unlike “analysis,” “analytics” always implies that there is math involved in the process. Some people use this term to refer to simple reports (usually data summarized as totals and averages of historical data), while others are referring strictly to more sophisticated analysis, such as inferential statistics.

data mining

This term came into widespread use in the 1990s to describe techniques and tools geared to enabling business users (people knowledgeable about their business, but not trained in statistical analysis) to independently identify meaningful patterns in data and develop predictive models, with only a moderate amount of training in the use of the data mining toolset. Some of the distinctive elements associated with data mining include: empowerment of business users, emphasis on visualization (graphs), speed and simplicity of discovery and model development.

The vision of empowering business users has yet to become a widespread reality. Today, users of data mining tools usually have significant training or experience in traditional statistical analysis, and the tools have been expanded to offer a wide variety of sampling and traditional statistical modeling techniques.

Because of the rise in popularity of data mining, many vendors and analysts have taken to describing whatever they offer as “data mining.” What’s more, many analysts have taken to advising clients that data miners must have expertise in SQL, programming and/or a variety of other skills that are not necessary to obtaining useful business insight from data. If it’s not geared to speedy discovery of meaningful patterns and predictive models from business data, or if using the tool requires lengthy formal training in statistics, programming or anything else, it’s not data mining.

predictive analytics

Development and use of mathematical models that make predictions about specific events, such as whether an individual will buy a product or repay a loan. These predictions are usually in the form of probabilities. Both traditional statistics and data mining can be used for predictive analytics.

statistics

At the most basic level, “statistics” can refer to simple summaries, such as totals and averages. More sophisticated (and revealing) statistical analysis is based on testing hypotheses about data. This type of analysis is known as “inferential statistics” or “hypothesis testing.”  Many formal procedures have been developed for inferential statistics to suit a wide variety of uses.

Libraries could be filled with the many and varied books written on statistical procedures. That said, most businesses that use inferential statistics make use of just a small selection of widely used procedures, and good explanations of these procedures can be found in most any current text used for introductory college statistics courses.

text mining (or, text analysis)

This is data mining, when the data is text, such as responses to open-ended survey questions, social media posts or comments on warranty claims. In this context, text is often described as “unstructured data.” Text mining is a developing field, not yet used by many businesses. Text mining is the most challenging area of data analysis today.

web analytics

Analytics based on data describing events occurring on the Internet, or some other similar network. In practice, most web analytics are simple summaries, counts of events such as page downloads, referrals from specific sites and so on. However, more sophisticated analytics can also be applied to web data. Inferential statistics applied to web page performance (in sales or some other desired behavior) is “A/B” or “multivariate” testing. Data mining techniques used to study user movement through as web site are “sequence analysis.”

Not fancy or incomprehensible, is it? And there is no reason why it should be.

 

©2011 Meta S. Brown

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

How Data Analytics Is Reshaping Patient Financing Decisions
How Data Analytics Is Reshaping Patient Financing Decisions
Analytics Big Data Exclusive
AI driven big data company
How AI-Driven Workflows Are Changing the Way Companies Think About Data Risk
Artificial Intelligence Data Management Exclusive Risk Management
ai product development
Why Businesses Outsource AI Product Development Companies
Exclusive News
banking tools
The Fintech and Banking Tools Global Entrepreneurs Rely On
Fintech Infographic

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

big data analytics for smart devices
AnalyticsBig Data

Big Data, Big Difference: Building Smarter Devices with Data Analytics

7 Min Read

R, the FDA, and clinical trials

8 Min Read
Big Data Analytics
AnalyticsBig Data

Big Data: A Review Of 12 Amazing Months in Analytics

10 Min Read
big data for social media
AnalyticsBig DataExclusiveSocial Data

5 Tools That Use Big Data For Social Media Optimization

7 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI chatbots
AI Chatbots Can Help Retailers Convert Live Broadcast Viewers into Sales!
Chatbots
ai chatbot
The Art of Conversation: Enhancing Chatbots with Advanced AI Prompts
Chatbots

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?