How to manage out of date content

On the Web, nothing is more damaging to your organization’s reputation and brand than out of date content.

On Monday September 8, 2008, a story about a UAL bankruptcy began circulating on the Web. (UAL is the parent company of United Airlines.) Within hours of the story’s being released, UAL’s shares had dropped by 76 percent.

The story was 6 years old.

For some reason, the story had been added to the “Most Viewed” link section on the homepage of a Florida newspaper. From there, Google News picked it up, and the rest, as they say, is hysteria.

It’s easy to put content up on a public website or intranet. In fact, the content management software industry has made distributed publishing a key selling point. Basically, the more published, the merrier.

This is a totally unprofessional approach to website management. But the unmanaged distributed publishing model is attractive to organizations that do not value content. Such organizations want to find the cheapest and fastest possible way to deal with content. The cheapest way is to buy some software, give people basic training in how to use it, and then let them at it.

The easiest decision of all is to publish everything you have. Take all that print stuff and just PDF it. Take anything that’s digital and put it up on that great big website. It’s a “have gigabytes must fill” mentality. If you publish everything, nobody can blame you for leaving something out. It’s just that nobody can find anything.

It’s very hard to review and remove. Not alone does it take time, it also takes skill and authority. Anybody who can use a computer can quickly learn to publish a piece of content on a website. It takes real skill to review.

Organizations are in urgent need of professional review processes for their intranets and public websites. Out of date content is growing year by year, and there are many more UAL-type stories waiting to happen. It’s time for management to get serious and professionally manage their websites.

One of the first tasks is to stop free-for-all distributed publishing. It doesn’t work. We need some sort of basic editorial control that decides what gets published and what doesn’t.

Anything that does get published must have an identifiable owner. That owner must commit to regularly (every six months at least) checking their published content. It is absolutely no excuse for them to say they don’t have time. Don’t let them publish if they don’t have time to review and remove.

Some out of date content should be immediately deleted, some needs to be retained for legal or research reasons. This content should be placed in an archive. Archived content should be very difficult to get to. In particular, it should not appear in normal search results.

What the UAL story teaches us is that archives have the potential to do tremendous harm, whereas their value is often questionable. What it also teaches us is that professional content management requires real people and real skill.

At the bottom of the Google News homepage is a statement: “The selection and placement of stories on this page were determined automatically by a computer program”. I’m sure that gave great comfort to UAL investors as they watched their investments lose 76 percent of their value.

 

6 responses


  1. I agree that most organisations should have much stronger controls around who can publish, and that most of what has been published on many sites belongs in the bin, but I disagree that archives should be very hard to get to. Archives can be the most valuable part of a site: contemporary news reports, support for older products, company history. And two-stage searches suck — I think you’ve over-reached going from ’should not be in latest news’ to ’should not be in search results’.


  2. Hi Brian

    Sorry, I disagree, or perhaps it’s semantics.

    Archived material as defined in the article is ‘out of date content’. To me that means the content is not maintained, and it therefore does not belong on an website.

    To use one of your examples - support for older products: if this content is maintained, the fact it refers to a older product does not make it archive material, and it can therefore still live on the site.

    If the content is not maintained - and is therefore out of date - but ‘needs to be retained for legal or research reasons’, then it needs to be removed from the website and stored elsewhere. This is especially true where each version of the content needs to be kept.


  3. I agree about “updating” content as a regular basis, but as Brian says:

    “but I disagree that archives should be very hard to get to. Archives can be the most valuable part of a site: contemporary news reports, support for older products, company history”

    I think we have a “right” to know. But it is clear that it is not the first time, I see “old archive” appearing in Google “news”. One thing that is missing here, is that Google should improve his “automatically” news search, to really verify, when the news was published, and if it is an “old” news, that should appear big…


  4. The essential element that is all too often missing is a date.

    If a date is prominent on the page it is very difficult to mistake dated information for current stuff.

    As someone who is often looking for stats or recent references online, I am constantly surprised how many sites offer content without a prominent date. And I don’t mean general site content that wouldn’t need date information. I mean, for instance, informational content referencing “recent studies”.

    As others have noted, archive content is often valuable. The secret: it should be archived carefully so that it is clear when it was created. Date seems an obvious data field to have in a content database. Simple, one would think. But it seems not…


  5. I would go one step further and put the archives on a different site altogether. This inserts an extra step into viewing them.

    Yes, this could be made invisible as one page could look just like another site; however, a large banner at the top of the page or right above the article could announce Archived - Oct. 20, 2004.

    Abundant links could direct he reader back to main site where it is made plain that this site contains “Current info” with a date the site was updated. Archives printed out would get a header that read ‘Archived article date published.’

    Regarding some of the other points made, sites that are packed full of every kind of data can be a means of CYA (cover-your-ass)behavior.

    Reviewing content for its relevancy would entail a company having a content strategy and an understanding that what’s on their site affects their customers’ and-or the public perception of them.

    Content Management Systems tend to trivialize content. While it serves as a facilitator for uploading new stuff onto the site, it gives those doing so that the program is more important than the words themselves. What gets overlooked is that the content is a strategic resource in itself.

    Computer programs do not provide judgment and discretion like a content manager does.


  6. Brian, I think you may have a good point in relation to putting archives on another site. The more I come across archives the less value they actually have and the more cost they have. I think archives have a lot of potential to do major harm, and they clutter and confuse the site, as the content in them tends to grow like weeds.

    Yes, it is time to manage content. Take it seriously. Or else there will be consequences.

5 trackbacks

Leave a Reply