The problem of dirty data
More than 25 percent of critical data in Fortune 1,000 companies is poor quality because organizations have little experience in managing content.
Gartner Research believes that very few organizations are making progress towards professionally managing their most important data. Research Vice-President Andreas Bitterer said that “dirty data” is often overlooked as a business imperative.
“There is not a company on the planet that does not have a data quality problem,” said Bitterer. “And where a company does recognize it has a problem, it often underestimates the size of it.”
Why is this? Why do most organizations do such a poor job of managing their content? The reason is that most organizations have little experience in managing content. In the past, they have only administered it.
Before the Web, content was rarely seen as critical to the future of the organization. It was important, but only in the same way that an invoice or a product manual is important.
How many organizations are run by people who started their careers as librarians or journalists? Not many. Organizations tend to be run by people with an accounting or sales background.
The Web has changed the role of content. Content can now make the sale, deliver the service and build the brand. Slowly, organizations are beginning to realize that content is a business asset.
How do we professionally manage content? We don’t. We shouldn’t manage content in the same way that we shouldn’t manage technology. Content and technology are merely a means to an end. What is the end? The end is the task the customer wishes to complete. That is what we should manage.
I once reviewed a section of a human resources intranet that was dealing with dental care for staff. The homepage started off explaining why it’s important to look after your teeth. Who cares?
Staff came to this page for the following reasons:
- To find out what dental plans were available
- To compare plans
- To calculate how much per month it would cost to join a particular plan
- To find out where the nearest approved dentist was located
I asked the person responsible for the section where this information was. She informed me that it was in a PDF. And where was this PDF? At the bottom of the page.
This person was at best an administrator-a put-it-upper. She saw her job as getting content up on the website as quickly and easily as possible. That’s how she was measured.
What is the task of your customer? What are they trying to do? Your job is not to put content up on your website. Your job is to help your customers complete common tasks quickly and easily. How quickly can they find that local dentist? That’s how you measure success.
This is the age of the Web. This is the age of content. If commerce is selling with people, then ecommerce is selling with content. If government is serving with people, then egovernment is serving with content.
Most organizations are only beginning to realize the fabulous potential that content has. If you manage the task-not the content or the technology-you can help your organization tap that potential. Doing this will help your organization and significantly enhance your career potential.

Lee Fleming says:
Added on April 23rd, 2007 at 2:44 pmHow did you make the leap from “data” to “content”? You are talking about two different things here. Content is finding information on a website or in a product manual or training materials etc. etc. Data is business intelligence, and, if used properly, can drive business processes and decisions without ever being seen by human eyes. A bottleneck in, say, the delivery of power over the grid can be resolved by correctly using data applied to a business rule which sparks a decision — but this is something a human might discover only after the fact when he or she looks at the logs. Content may be a subset of data, but it is not data. Maybe you should read a bit more about business intelligence before jumping into topics like business data.
Peter Cullen says:
Added on April 24th, 2007 at 11:27 amI’d have to agree that Gartner are talking about something different. They are talking about ‘poor quality customer data’ rather than efficient usability on a company website. I can see a link for sure between managing data correctly to understand what content customers want.
The Gartner press release is quite specific in its definition of ‘data quality’;
* Existence (whether the organisation has the data)
* Validity (whether the data values fall within an acceptable range or domain).
* Consistency (for example, whether the same piece of data stored in multiple locations contains the same values)
* Integrity (the completeness of relationships between data elements and across data sets)
* Accuracy (whether the data describes the properties of the object it is meant to model)
* Relevance (whether the data is the appropriate data to support the business objectives)
Full press release @
http://www.gartner.com/it/page.jsp?id=501733
Susanne says:
Added on May 3rd, 2007 at 7:24 pmI have to agree with Mr. Fleming that Mr. McGovern is making quite a leap from ‘data’ to content or the elements on a web page. Content includes: text, graphics, art, photos, videos, podcasts, wav files, hyperlinks, code - anything that a visitor might benefit from. In CS - there is a distinction between data and information. Data is a measurement that can be organzied - analyzed - manipualated - when it becomes organized some way - it becomes information. BUT - web content is more than data. So making a point by ‘manipulating’ Gartner Research is really stretching it - don’t you think Gerry?
Gerry McGovern (blog author) says:
Added on May 4th, 2007 at 6:25 pmPoint taken. Loud and clear.