The website was a great website which was functional until late 2012. It archived journal articles, newspapers and books of all sorts. At this pont in time Wikipedia has over 20 000 links to, most of the as sources and references. The only downsides is that the website is dead. Most times when a link dies they are archived on other websites such as (the Wayback machine), and this website was, until September 2012. At that time the websites robot.txt was changed, and all archived copies on websites which follows web etiquette was deleted. The website isn’t marked as 404 either, which makes it hard for tools to mark the links as dead too.

Most of the links from Wikipedia to FindArticles are for journal articles, which most likely has doi’s or pmid’s, but not mentioned in the articles. Thes casues some problems. Either remove all links to with any though of the consequences, or look at each article one-by-one and try to find another copy of the journal online or identification numbers and then remove the links.

Normally you could run the bot Citation bot on incompleat citation and have it fill out and find information which is not already in the reference, however the bot is currently blocked due to Wikimedia’s decition to change to https without real consideration of how it would break all tools currently in place for day-to-day operatons.


2 thoughts on “

  1. “At that time the websites robot.txt was changed, and all archived copies on websites which follows web etiquette has deleted all copies.” Ahem? who deleted what??


    1. Web Archive (and other archives) delete all copies post-archive if the robot.txt changes. may have some copies, but we have an RfC (and blacklist) not to use that.


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s