« Many Different Forms of XML | Main | National Information Exchange Model (NIEM) »

Many Different Data Formats, Why XML?

This is a series of posts where I am providing information relating to figuring out what the best data format to use and why. Basically, when is XML better, when is XBRL better, and when is RDF/OWL better.

If you have been working with computers as long as I have you have likely run across many different data formats.  Here is a list of the common data exchange formats (other than database file formats):

Each of those files expresses the same information in a different way.  Another way of saying this is that each file format has a different syntax.  There are lots of other file formats (see http://www.fileinfo.com/filetypes/data).  Imagine having to write sorftware to parse and use these different data formats.

To make a very long story short, XML is becoming quite popular.  XML is a meta language, a language for building languages.  XML is a syntax.  The XML syntax is becoming very popular for the following reasons:

  1. It works well over the Web.
  2. It is platform independent. (Meaning, it is just a text file, pretty much any computer can read XML.)
  3. It can express complex information structures well. (For example, CSV cannot be used to expess a hierearchy.)
  4. It is readable by both humans and by computers.
  5. It is self-describing.  (Meaning, the information which describe the information, called metadata, is availalbe with the information.  A contra example is that there is no standard way to describe the contents of say a a CSV file.)
  6. There are lots of free or low-cost XML parsers available.  There are many XML editors available.  There are many XML schema editors available.
  7. XML is easy to create by hand or generated by a computer application.

On the one hand, every file format has its pros and cons.  (For example, JSON is far less verbose than XML and easy for many Java developers to use.)  On the other hand, agreeing on one format has its advantages. 

Bottom line: Lots of people are agreeing on XML.  This saves time and money.

Posted on Sunday, April 18, 2010 at 09:11AM by Registered CommenterCharlie in , | CommentsPost a Comment

PrintView Printer Friendly Version

EmailEmail Article to Friend

Reader Comments

There are no comments for this journal entry. To create a new comment, use the form below.

PostPost a New Comment

Enter your information below to add a new comment.

My response is on my own website »
Author Email (optional):
Author URL (optional):
All HTML will be escaped. Hyperlinks will be created for URLs automatically.