Low value content is destroying your website

Low-value content is destroying the usefulness of intranets and public websites. It needs to be stored separately.

Andrew Leung is a computer science researcher at the University of California. His team analyzed a large data/content environment over a three month period. Their findings included the following:

  • More than 90 percent of the files were never accessed.
  • Of those files accessed, 65 percent were only opened once.
  • Most of the rest were opened five or fewer times
  • About a dozen files were opened 100,000 times or more.

The study also found that the ratio of content/data being accessed in the system versus new content/data being published was about 2-1. In previous studies this ratio was 4-1 or higher. This means that the rate at which we are publishing new stuff versus the rate at which we are accessing already published stuff continues to grow.

Recently, I’ve been testing the quality of a search engine for a commercial organization’s public website. This organization sells a wide range of products, and its customers’ search behavior reflects this.

However, what I find when searching for some of this company’s most popular products is that the search results are full of links to the press archive and other old, out-of date content. Some of the content is misleading and wrong, talking about, for example, a feature for a product that has long since been replaced.

Poor quality, low grade, minor-interest content is choking the usefulness of the search engine. I find this happens again and again and again. It happens on intranets, many of which have become dumping grounds for low quality content.

One reason intranets have become such dumping grounds is because a great many organizations have no clear strategy in relation to how they manage their content/data. Because there is no other place to put “stuff”, many people simply store it on the intranet, which of course bulges and bulges and bulges.

Governments have a particularly severe problem when it comes to managing content. A key reason is because the Freedom of Information Act suffers from the law of unintended consequences. Some government people are piling everything they can find onto their websites so that they can say they’ve made it available to the public. You may not be able to find it but it is there somewhere.

Most data and content that we create is next to useless. Nobody will ever be interested in looking at it again. Then there is another chunk that has fractional demand. However, we need to store most of this low-level stuff somewhere just in case of some unforeseen event.

There is a saying: What do you get when you cross a fox with a chicken? A fox. When you manage low-level content and high-quality content on the same website, the low-level content smothers and eats up the high quality content. We must thus manage them separately. We need a website for the low level stuff. But our primary website should be for the high-quality content that people actually need today.

In every environment, there is a small set of content that has a disproportionate demand and value. As our content/data universe explodes, it has never been more important to manage this precious content separately from low-demand data.

Measurement and Analysis of Large-Scale Network File System Workloads (PDF)

THE 2008 GLOBAL INTRANET STRATEGIES SURVEY: CALL FOR PARTICIPATION
Companies and organizations with intranets are invited to participate and will receive a free advance copy of the 2009 Intranet Trends Report which will be published in October 2008. (Closing date: August 31.) To participate in the 2008 Survey, send an email to
Jane McConnell

 

14 responses


  1. I’ve had an experience similar to what you describe - finding old, out of date stuff when I perform a search. But… what about the times a user actually needs old information? Sometimes I go to a web site because I need to know what a company was saying in 2002, or I’m searching for specifications for an old product. One of the benefits of this information being available online is that it is findable, whereas in the past it would require human intervention to dig it up.

    In those situations it’s quite frustrating to find that an organization has erased its history on its web site.

    Perhaps the problem isn’t that the content is there, but that search tools are not up to the task of identifying what is new is and what is old and making that clear to the user?


  2. Gerry,
    Great column as always, and it’s definitely ammunition I can use when trying to persuade users to keep their content fresh.

    I do have a specific problem, though — as the manager of a hospital intranet we face the challenge of having to provide employees access to complete sets of clinical, administrative and other manuals. In many cases, departments have ditched the paper versions of the manual in favor of an electronic version. How would suggest we handle that? We do not have an enterprise document management system, and in many cases the intranet becomes that by default.

    Thanks in advance for any ideas,
    Sue Morgan


  3. Another column to make us stop and think. Thanks.

    Is it any different though from large libraries that bought (or were given) many books that were never accessed? But they all showed up in their catalogues and “took up valuable space”.

    And I agree with John: what’s wrong with “dumping” a whole lot of information on a website, as long as it’s accessible by navigation or search?


  4. I concur with John. Careful experience design work is required to make low-value content available to users that need it.

    If a user’s task is to find specific or niche content then that press release from 2002, or old product spec becomes “high-value content” for that individual user. They will not perceive that this sort of content will be available from anywhere else but the primary domain of the authoring organisation.

    If low-value content is stored on a separate website, then a search results screen on the company website could link to the archive site suggesting users search there. Web based technology can be an inexpensive alternative to bricks and mortar storage services. This is the reason why many in-house web teams receive pressure from the business area to put old content up, or keep all out-of-date content in production on the top-level sections of the site.


  5. I know this is a very difficult challenge and I know there is a lot of pressure to put up stuff. And it is a valid argument to say: What if some day someone needs this piece of content–let’s call it XY?

    However, my argument is: Today, huge numbers of people want the content piece AB. On many, many websites it’s very hard to find AB. It’s often very hard to find the important stuff because of the thousands and thousands of pieces of content like XY.

    Clint, it’s not at all easy to design for large-scale content environments. Many organizations I know of have little or no information architecture design professionals, and practically zero search management professionals.

    I believe, more and more strongly every day, that we should put the low-value content on a separate website, to which we give 10 percent of our resources. Then we focus 90 percent of our resources on the high-value stuff.


  6. Sue, you have a difficult challenge. This content is critical, so it must be managed. What I would do is take a selection of doctors, nurses, administrators and ask them to try and complete really important tasks using these online manuals. Literally, video them and measure how long it takes, how many give up, etc. Then, you can show to management that the environment is working or not.


  7. Gerry, another challenge for me to consider….
    I manage a software support site (no press releases :). We have just started using an “archive” location to store content for older releases that may be needed by a small segment of our user base. This has proven helpful for our support organization.

    The problem is that I can’t figure out how to make it really useful for our site visitors. AND, I’m having a heck of a time getting feedback from them.

    We’re still trying, though. Thanks for keeping us thinking.


  8. If you are working with limited resources, you have to invest where you’ll get the highest ROI. If you know that 10% of content makes up 90% of information needs, use your limited resources to make that 10% easy to access. That will have the greatest benefit. If you have plenty of resources, then make the other 90% of content easy to find and use.

    Successful strategy usually requires stating as clearly what you won’t do as what you will do, within the parameters and limitations of your work.


  9. Ephraim, I think you’ve hit the nail on the head. This is the essence of the challenge. I remember reading the book Good to Great. The great managers focused as much time of what they were not going to do as what they were going to do. Instead of having “to do” lists, they had “stop doing” lists. There’s way too much to do today. We have to be selective if we want to do a good job.


  10. I agree with your latest column about low value content overwhelming the good stuff. However, you do not mention how to go about correcting this situation.

    We currently have an intranet that is chock full of stuff, much of which we know is out of date. However, because the intranet is designed to be user driven, not communications driven, there is no mechanism to update or remove old information - users are supposed to keep on top of their information, but are too busy to do so.

    What can be done to edit this information down on a regular basis without having someone in charge of it, like an editor?


  11. Dorothy, it’s a bit like telling your content authors that “a puppy is not just for Christmas”. If they want to publish content then they have to agree to review it on a regular basis, and remove it when it goes out of date. And if they don’t have time to do that, then the content is obviously not worth publishing in the first place.

    It’s time to get serious about content management.


  12. Thought-provoking post that I think goes to the heart of search marketing optimisation ‘per se’, as well as the ‘content is king’ concept, and better site usability, navigation, architecture etc.

    As an experienced online marketer for a couple of decades now, I feel like I’ve grown alongside Gerry McGovern, his ‘content’ and consultancy.

    Now working in a search engine marketing company called Fuse Optimisation (www.fuseo.co.uk), and a journalist by profession before I started in online, content really does remain king.

    But the expertise that the likes of Fuse and Gerry can bring to organisations wondering how to surface their content to the outside world, to the public etc, is still way under-utilised at present.

    Many people don’t fully understand how (or even why) to harness the power of search and well-architected, navigable websites to ensure the copy is found by those searching for it (although to give ground a little, reaching these people with education is also required).

    Trying to keep this brief, I would just close by saying public organisations like government and councils need to prove value for money when spending from the public purse.

    Dodgy, one-man websites - poorly optimised, badly architected - is pretty much pouring public money down the drain. As a taxpayer, I kind of demand they stop wasting my money. As a search marketing optimisation specialist, I say they should speak to experts like Gerry and Fuse.

    Do look at how Fuse surfaces relevant content. Try searching for ‘claim’ on Google. We’ve optimised a website to be up at the top of the rankings - http://www.youclaim.co.uk. That site is all about personal injury claims, it’s one of the best in the world, if not the best at what it does.

    If your site isn’t doing that, or heading that way, maybe you ought to think about how you are spending your money online. Contact experts like Gerry or Fuse, and don’t pour good money after bad. Think, think, think is my final cry - please… taxpayers money, especially, shouldn’t be wasted.


  13. Your article was very attention grabbing and was just what I was searching for. jeremy


  14. As a writer myself, I find that it can be difficult to differentiate low value content from high value content at the best of times. As has been mentioned previously, even old or content perceived to be of low value has value for somebody somewhere.

    It results in a delicate balancing act of making sure your high value content is easily accessible, while at the same time ensuring users can access low value content if they need too, through archives and the like. Of course some content is completely out of date, such as time specific offers, so its not just a case of chucking all of the old content into an archive and leaving it at that. This can devalue the content further.

    I have been writing content for a website called Http://www.claimtime.com for a while now and find that this is one of the harder challenged. Making sure that the content remains up to date and of high value at all times is a constant job, especially when it comes to the blog.

One trackback

Leave a Reply