Information-Driven Business
How to Manage Data and Information for Maximum Advantage

Robert Hillard

Metadata versus Taxonomy
by Robert Hillard

I’ve advocated for many years that Information Management should be a superset of related disciplines including data warehousing, document management, library science, enterprise search et cetera.  While this is an easy statement to make, it is really hard to execute.

The problem is that practitioners from the different technical backgrounds have radically different approaches to handling information in all of its forms.  While the technologies are different (using solutions as diverse as relational databases, file systems and even physical shelving) this is not the real reason why the disciplines are so hard to bring together.

Practitioners coming from unstructured and structured data backgrounds use subtly different definitions of metadata and I argue that it is these differences that cause most of the angst that comes through in disparate repositories, governance and a lack of integrated business solutions.

Unstructured data came first, and its filing is primarily treated as a problem of taxonomy.  The most famous approach is, of course, the Dewey Decimal System.  When unstructured data practitioners talk of metadata they include the taxonomy and attributes of the data itself such as the author, publication date, copyright and other core attributes (best defined by Dublin Core).

Structured data practitioners have, for the past forty years, relied on relational database theory as the foundation of their information management practices.  Relational data generally includes as data, rather than definition, the key elements of people, place and time.  Such an approach is very neat, with metadata being literally data about data and being restricted to data structures and the definition of the data elements themselves.  As a result, the metadata for structured data is much more succinct.

While succinct is a good thing for computer programmers, it seldom translates well for the rest of society.  As a result, structured database metadata has seldom found its way out of technical departments within large organisations.  At the same time, the need to understand who authored a record, who it was about and how it relates to other events in a timeline remain as important as ever.  As a result, we now have “master data”.

Perhaps the solution is for all Information Management practitioners to concede that Metadata should encompass both the metadata that structured data practitioners advocate and the master data that the unstructured data practitioners have long advocated as being essential.  We just have to get over our fixation on the titles.  I’ve tried to define an approach that does this in my new book, Information-Driven Business.

comments powered by Disqus


blogThe Information-Driven Business blog is published monthly:

   2018   2017   2016   2015   2014
   2013   2012   2011   2010


Also featured from the Information-Driven Business blog:

Sometimes it’s lonely being a robot
I’m committed to be a global citizen but, living in Australia, I simply can’t get to as many meetings around the world as my role would ideally involve. To deal with this, I find other ways to participate. The myriad … Continue reading

Information-driven work
I’ve recently spoken to several executives who have more than two thousand unread emails. They all said roughly the same thing: “If someone really wants me they’ll keep trying”. Others have said the opposite, they are keen to be easy … Continue reading

Opportunities beyond startups
Is it just me or has the world gone mad for startups and writing software? Don’t get me wrong, I am a big fan of startups and all that they bring to the economy. However, if you read the business … Continue reading

Email works too well
Everyone who regularly feels overwhelmed by their email would agree that there is a problem.  The hundreds of articles about the issue typically make the same assumption and are wrong. Writer after writer bemoans email as inefficient and an obstacle … Continue reading

The Internet was a mistake, now let’s fix it
Each generation over the last century has seen new technologies that become so embedded in their lives that its absence would be unimaginable. Early in the 20th century it was radio, which quickly become the entertainment of choice, then television, … Continue reading




© 2010-2018 Robert Hillard