Wikipedia’s role on the web has been worrying me lately. Not that Wikipedia in itself is in any way particularly bad, but its influence on the web as a whole has some problems.
In the olden days, finding information on the web was hard, and good link directories were worth their bytes in gold, and good content was worth even more. Nowadays we have good search engines, thanks to Google, and lots of good content (and lots of bad content too, but that’s not really a problem).

Wikipedia is one source of good content on the web. But it’s not the only one – it’s just the most obvious one in many cases. My problem with Wikipedia is mainly how people link to it. Here’s an example:
× I like to feed the pigeons. I sometimes feed the sparrows too.
This I’d like to claim is by far the most common way of referring to Wikipedia. What’s wrong with it?
- If your reader doesn’t know what a pigeon or sparrow is, it’s 99.9% certain that’s because the reader isn’t very good at English. In this case, if anything, every word ought to be linked to a dictionary, but that wouldn’t be very useful except for 0.1% of your readers. They should know how to look up words themselves if they’re not proficient in the language of the text they’re reading, don’t you think?
- If your reader suddenly becomes very interested in pigeons or sparrows by reading your text, I’m sure one of the first places your reader would look for informations on said birds is Wikipedia. There is no need to point people to Wikipedia, because everyone knows how to find Wikipedia articles anyway. In Firefox, just type “wikipedia pigeon” in the address bar, and you’ll get redirected to the page. In other browsers, go to google.com and type it in and press “I’m feeling lucky”. It’s real easy.
The reader might think that you’re linking to some really interesting tidbit about pigeons or sparrows, or maybe a funny Youtube video. But to find out, the reader has to mouse over the links and check the status bar where they point. Only to find out they just point to the Wikipedia entries, which the reader could easily have found without your assistance. This makes reading cumbersome.
- It degrades the quality of interlinking on the web. All you’re doing is helping Wikipedia get a higher PageRank – and their PageRank is already as high as it can get. You’re not helping the guy who has spent serious time documenting pigeons and sparrows and runs a really interesting web site on the subject that your readers may actually enjoy if you had only taken the time to find and link to it – like in the old days.
In the example above, it’d be easy to argue that “pigeons” and “sparrows” don’t need to be linked at all. But this is how thoughtlessly people use hyperlinks. So let’s change the example to something similar yet where hyperlinking a word might be more appropriate:
× But the pronunciation doesn’t change since the word is a dvandva.
I bet you don’t know what a dvandva is. It doesn’t matter here anyway since I’m just using it as an example but I’m sure you’ve already went and read the Wikipedia article.
The Wikipedia entry comes out first on Google for a search on “dvandva”. Why? Probably because of hyperlinks like the above example. But the article sucks. Here’s an example of good use of hyperlinking:
○ But the pronunciation doesn’t change since the word is a dvandva.
The target of this link is a paper written at a university regarding dvandvas in Japanese. Of course, if the context isn’t about Japanese then it might not be the best link target, but I’m sure there are more good articles about dvandvas. I thought that paper was da proverbial bomb. Really good reading. That’s why I link to it – to encourage my readers to read it, and to promote it in the search rankings. (Of course this is hypothetical since I’m actually writing about something else right now but if I were writing about dvandvas…)
It takes time to find good link targets – but please take the time! For your own, your readers’, and the authors’ of those link targets sake. And for the future of the Internet.

Note that I am not opposed to linking to Wikipedia completely. If the Wikipedia article on a subject really is the best piece of information on it on the whole web, and the subject demands a hyperlink in order to be understood by most people, then indeed it’s the corresponding Wikipedia article you should link to.
Also of course when discussing Wikipedia itself it is highly appropriate to link to Wikipedia sources. But even in this case, I see it go wrong, for instance like this:
× Wikipedia recently started adding the “” attribute to outgoing links.
That “nofollow” link to the Wikipedia article on the “nofollow” attribute violates the point outlined above in the same way “pigeons” did. Here’s a better way of linking it:
○ Wikipedia recently started adding the “nofollow” attribute to outgoing links.
which links to Wikipedia’s meta wiki describing the policy. That’s a good way of linking to Wikipedia. Here’s an even better way of doing it:
☆ Wikipedia recently started adding the “nofollow” attribute to outgoing links.
That links to the most interesting text on the subject that I could find in a couple of minutes. I’ll gladly share that good piece of writing on this subject with you – that’s why I link to it.

Lastly, I’d like to mention that I think said Wikipedia policy of adding the “nofollow” attribute to outgoing links on Wikipedia is stupid and bad. I think that if you read my above argumentation, you’ll see why I think that. People are linking to Wikipedia en masse for no good reason, bloating its PageRank and diminishing the chance of other, better, sources of information to get found. If at least being cited in a Wikipedia entry boosted the PageRank of the source, then the chance of someone finding it would improve just a little. Not to mention it would be fair. The paper on dvandva above and Ed Felten’s blog entry deserve that.
[No interwebs were hurt in the writing of this blog post; all links to Wikipedia have the "nofollow" attribute set.]

I disagree with your claim that adding the "nofollow" attribute to outgoing links on Wikipedia was "stupid and bad".
Wikipedia's own interest is not to be polluted by spam. Getting a higher pagerank was one motivation for spammers to put their links in Wikipedia. The "nofollow" attribute removed that motivation.
Of course, this is not fair to many people on there – those to whom Wikipedia links, and who are not rewarded for it in terms of pagerank.
But… do you have a suggestion for Wikipedia that would do as much to control spam as using "nofollow"?
Hi DM, thanks for your comment. I think the subject of Wikipedia's "nofollow" attribute might require its own blog post…
But yes, I can think of other ways, like only allowing registered users to add outgoing links, or an even simpler measure would be to make links nofollow until a registered (perhaps registered for x months) user has verified it.
Personally I think Wikipedia would gain from not allowing anonymous edits at all, especially since it would probably reduce the administrative work drastically, allowing "editors" to improve the content instead. That would also alleviate the link spam problem.
Anyway, not linking back (in terms of search engine ranking) to the content they rip off is unfair and bad for the net. But I think people linking to Wikipedia in the thoughtless ways I outlined in this blog post is an even bigger problem. I hope people start linking to good content instead of relying on Wikipedia to be the global directory of the Internet. That would also alleviate the problem with outbound links from Wikipedia having such abnormally high value. I definitely don't expect that to happen in the near future though, I'm just wishing it would.
i so totally agree. it's an awful shame there is no way we can force all the fools making worthless hyperlink to read this and think about what they are actually doing instead of just feeling 'geeky cool' doing something nonsensical but 'internetty' as they currently are. the worst, i'd have to say though, is the web forum software and various sites that have some sort of retarded automated linking software in place that adds links to words used by an author who had know knowledge those links were going to be automatically plugged in. half the time you will see semantically the thing linked has absolutely nothing to do with the meaning of the word the author was using.
I would agree with the PageRank argument, if PageRank was actually still in use at Google.
The PageRank algorithm was used in the first implementation of Google in late 90s when it wasn't yet run by a multibillion dollars company doing business with AdWords.
Now, the way Google indexes the web today has barely anything to do with the PageRank algorithm and is a company secret anyway, but we keep seeing SEO companies around the world claiming they can "improve your PageRank" for a few thousands $$$…
Anonymous is totally missing the point. Yes, page rank isnt nearly as important in 2009, as it was years ago. But inbound links and the relevance of the pages linking to your site are VITALLY important to achieving higher rankings on Google.
Wikipedia wants their cake and to eat it too here. They want ALL of their inbound links to give them page credibility, yet want to discount all of their outbound links so NONE of the outbound links give any credit to anyone else.
Here is an example. Wikipedia has 2 lines on subject X and then a link to website Y, which has 100 pages on this subject. A million people a day click on the wiki page, find the link and realize the second page is a better resource. But since Google allows for this blind No Follow, a lot of the real relevance on the subject continues to go to Wikipedia and then die in their dead end of programming.
Site Y is the real relevant location that should be #1 in the rankings, but since Wiki controls the outbound link pop – Site Y never gets where it needs to in the rankings. There is no way anyone can condone this with a straight face.
Google needs to eventually be able to allow for no follow tags to block spammers, while still giving the appropriate credit to the sites that are being linked to.
If you highlight a word on nytimes.com a small question mark pops up, clicking it transports you to a dictionary. That's a useful and relevant feture, seeing how journalists at the greatest newspaper in the world tend to use novel words here and there.