Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    predictive analytics risk management
    How Predictive Analytics Is Redefining Risk Management Across Industries
    7 Min Read
    data analytics and gold trading
    Data Analytics and the New Era of Gold Trading
    9 Min Read
    composable analytics
    How Composable Analytics Unlocks Modular Agility for Data Teams
    9 Min Read
    data mining to find the right poly bag makers
    Using Data Analytics to Choose the Best Poly Mailer Bags
    12 Min Read
    data analytics for pharmacy trends
    How Data Analytics Is Tracking Trends in the Pharmacy Industry
    5 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Big Data converging Data and Content
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Business Intelligence > Big Data converging Data and Content
Business IntelligenceExclusive

Big Data converging Data and Content

Barry Devlin
Barry Devlin
5 Min Read
Image
SHARE

ImageWhatever about my concerns (and those of others) that the term big data defies sensible definition, there is little doubt that it has been the big news of 2011 and looks set to continue in the same role next year.  One interesting trend that I see is a convergence of interests and approach between vendors from the once separate worlds of data and content.  The thinking springs from a recent conversation with Laur

ImageWhatever about my concerns (and those of others) that the term big data defies sensible definition, there is little doubt that it has been the big news of 2011 and looks set to continue in the same role next year.  One interesting trend that I see is a convergence of interests and approach between vendors from the once separate worlds of data and content.  The thinking springs from a recent conversation with Laurent Simoneau, President and CTO of Coveo.  But first, let’s make sure we all mean the same thing by data and content.

As discussed elsewhere, I find it useful to consider big data in four separate categories: (1) machine-generated event data, (2) computer-generated log data, (3) user-generated text and (4) user-generated audio, image and video.  These categories are based largely on the level of structure within the data and the differing technologies required to store and process it, leading to category boundaries that are somewhat flexible.  For completeness, and to include what we might call small data, let me add here category (0) which includes the traditional transaction and status data that we’ve stored and manipulated by computer for more than half a century.  We seem to have a tendency as humans to categorize the world in a binary fashion—good or bad, short or tall, black or white and so on.  You are no doubt familiar with the binary classification of structured vs. unstructured information which splits the above categories somewhere around (2).  I, myself, prefer the equivalent terms hard vs. soft information, simply because “unstructured information” is an oxymoron; information, by definition, has structure. 

The other common binary classification is data vs. content, where content starts at category (3) above.  The data/content division arises from an old technological boundary between databases and content stores.  Databases distinguish between pieces of information depending on their meaning and store them in separate records and fields; access is via query.  Content stores do not see or impose lower-level structuring of information; access is via search.  This distinction is oversimplified, of course.  Databases have been adding features to support large text fields and blobs (binary large objects) for years.  Content stores do, of course, support field structures.  However, such non-core add-ons have tended to be treated as second-class citizens on both sides.

More Read

machine learning accuracy hacks
Essential Accuracy Optimization Hacks for Machine Learning Projects
Can AI Replace The Staff In The Judicial System?
A Question of Scope
Finding a Holistic Predictive Analytics Approach to Boost Employee Retention
Big Data: Smaller is Better

Recent business developments and technological advances have caused vendors on both sides of the fence to look to the other side.  My most recent white paper, sponsored by NeutrinoBI, examined how search technology could be effectively applied to corporate data.  An earlier white paper with Attivio came at the convergence from the content side, extending the use of inverted indexes from the content world to more structured data.

Coveo have always had a large number of connectors to a wide variety of content from websites, e-mails and content stores as well as ODBC connectivity to relational databases.  Version 7.0 adds Twitter as a source.  Their Enterprise Search 2.0 concept—stop moving data and start accessing knowledge—will make perfect sense to anybody following the push for data virtualization in the BI world by vendors such as Composite Software and Denodo.

From a business point of view, the important point here is to recognize that business users need integrated access to both data and content in order to understand what’s going on and predict what to do next.  The volumes and varieties of big data make it very clear that this need cannot be satisfied by trying to push everything into or through one large information store, whether that be a database or a content store.  There are, and will continue to be, optimal storage and processing technologies for different types of information and different purposes.  Providing equal access to these different stores and equal priority to different access methods will be key.

TAGGED:Attiviobig datacontentdataDenodo
Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

microsoft 365 data migration
Why Data-Driven Businesses Consider Microsoft 365 Migration
Big Data Exclusive
real time data activation
How to Choose a CDP for Real-Time Data Activation
Big Data Exclusive
street address database
Why Data-Driven Companies Rely on Accurate Street Address Databases
Big Data Exclusive
predictive analytics risk management
How Predictive Analytics Is Redefining Risk Management Across Industries
Analytics Exclusive Predictive Analytics

Stay Connected

1.2KFollowersLike
33.7KFollowersFollow
222FollowersPin

You Might also Like

data quality
Big Data

Can Business Automation Solve Your Data Quality Problems?

6 Min Read
big data role in maintenance industry
Big DataExclusive

The Role of Big Data In The Maintenance Industry

6 Min Read
customer data collection
Best PracticesBig DataBusiness IntelligenceData CollectionExclusiveMarketingPredictive AnalyticsWeb Analytics

How To Use Big Data To Deliver Optimized Customer Experiences

7 Min Read

Tips To Get Your Organization Ready For Big Data

5 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai is improving the safety of cars
From Bolts to Bots: How AI Is Fortifying the Automotive Industry
Artificial Intelligence
giveaway chatbots
How To Get An Award Winning Giveaway Bot
Big Data Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?