Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    predictive analytics risk management
    How Predictive Analytics Is Redefining Risk Management Across Industries
    7 Min Read
    data analytics and gold trading
    Data Analytics and the New Era of Gold Trading
    9 Min Read
    composable analytics
    How Composable Analytics Unlocks Modular Agility for Data Teams
    9 Min Read
    data mining to find the right poly bag makers
    Using Data Analytics to Choose the Best Poly Mailer Bags
    12 Min Read
    data analytics for pharmacy trends
    How Data Analytics Is Tracking Trends in the Pharmacy Industry
    5 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-25 SmartData Collective. All Rights Reserved.
Reading: Why XML is incompatible with big data
Share
Notification
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Big Data > Data Mining > Why XML is incompatible with big data
Data Mining

Why XML is incompatible with big data

DavidMSmith
DavidMSmith
2 Min Read
SHARE

Mike Driscoll lays out his misadventures using XML for managing large amounts of data: it’s too verbose and slows down translation operations, and despite the goals of the XML standard, tags are opaque and cumbersome for humans to deal with. He concludes that there must be a better way: the simple, delimited text files we’ve been using since the fifties. He offers an analogy with LaTeX and MathML:

Spoken languages are strengthened by usage, not by imperial fiat, and data formats are no different. Far better to evolve and adapt the standards we already have (as JSON and SQLite’s file format do), than to fabricate new ones from whole cloth.

XML offers some nice advantages for interoperability when managing “human-sized” documents, but when it comes to truly large datasets I have to agree these benefits are outweighed by the overheads detailed in Mike’s article.

Dataspora Blog: How XML Threatens Big Data

Link to original post

More Read

An Introduction to ElasticSearch
REvolution Computing training series announced
Beyond ZIP +4 to Location Intelligence
Rewrite the Rules
What is R?

Mike Driscoll lays out his misadventures using XML for managing large amounts of data: it’s too verbose and slows down translation operations, and despite the goals of the XML standard, tags are opaque and cumbersome for humans to deal with. He concludes that there must be a better way: the simple, delimited text files we’ve been using since the fifties. He offers an analogy with LaTeX and MathML:

Spoken languages are strengthened by usage, not by imperial fiat, and
data formats are no different. Far better to evolve and adapt the
standards we already have (as JSON and SQLite’s file format do), than
to fabricate new ones from whole cloth.

XML offers some nice advantages for interoperability when managing “human-sized” documents, but when it comes to truly large datasets I have to agree these benefits are outweighed by the overheads detailed in Mike’s article.

Dataspora Blog: How XML Threatens Big Data

Link to original post

Share This Article
Facebook Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

street address database
Why Data-Driven Companies Rely on Accurate Street Address Databases
Big Data Exclusive
predictive analytics risk management
How Predictive Analytics Is Redefining Risk Management Across Industries
Analytics Exclusive Predictive Analytics
data analytics and gold trading
Data Analytics and the New Era of Gold Trading
Analytics Big Data Exclusive
student learning AI
Advanced Degrees Still Matter in an AI-Driven Job Market
Artificial Intelligence Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

Stop Using Search Engines

4 Min Read

The case for a smarter health system (via IBMSocialMedia)

1 Min Read

Web 2.0 Expo SF 2008: Clay Shirky

0 Min Read

Electrospinning is a process that uses an electrical charge to…

2 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

AI and chatbots
Chatbots and SEO: How Can Chatbots Improve Your SEO Ranking?
Artificial Intelligence Chatbots Exclusive
giveaway chatbots
How To Get An Award Winning Giveaway Bot
Big Data Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-25 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?