British Library sprinkles digital dust on dusty newsprint

Online archive opens up – at a cost


The British Library and its commercial partner brightsolid opened up a pay-per-view online archive of newspapers today, after a crack team scanned 4 million searchable pages that mainly date from out-of-copyright papers published in the 19th century.

It was confirmed in May 2010 that the British Library and brightsolid – which is a division of FriendsReunited and Beano owner DC Thomson – would eventually digitise 40 million pages of old newspapers.

The library holds 52,000 national and international titles covering 300 years. Up until now, around 30,000 researchers a year have to go to Colindale in north London to scan through microfilm or hard copies.

Under the deal, brightsolid agreed to scan a minimum of 4 million pages within two years, with the project taking 10 years in total to scan 40 million pages of tabloids and broadsheets in the UK and Ireland.

Eventually, in-copyright material from the early to mid-20th century is expected to be included – should negotiations with rights-holders prove successful.

Brightsolid will also flog the content on its own GenesReunited site.

"We’ve unearthed national treasures at the turn of every page,” said brightsolid CEO Chris van der Kuyl.

“All human life is here; from a man who decided one day to walk round the world in an iron mask, to bullet-stopping corsets and the art of wide-sleeved shoplifting – this is a digital Aladdin's Cave.

"The 65 million stories available online today, taken together, are the story of the UK and we're adding 120,000 stories a day, so you’ll be able to uncover what life was like in your street, your town and your area.”

The British Library said today that its users will have free access to the material, while anyone visiting its website will be required to pay for the content.

Users of the site will be able to search for free previews of snippets of the material. Anyone wanting to download full articles and images can do so using pay-per-view access for 48 hours or 30 days. There's also an annual subscription package.

A Boston Spa, West Yorkshire-based newspaper storage facility will eventually house hard copy versions of the entire collection, which is currently stored in Colindale. The new space was built after the Department for Culture, Media and Sport coughed up £33m for the project.

“The British Newspaper Archive is a rich and hugely exciting resource, packed with historical detail. It’s a great example of the public and private sectors collaborating to deliver something that neither party could have delivered by themselves," said Culture Minister Ed Vaizey. ®


Other stories you might like

  • You need to RTFM, but feel free to use your brain too
    But I was only following the procedures!

    Who, Me? Monday is here, and with it a warning that steadfast determination to ignore instructions might not be such a silly thing after all. Welcome to Who, Me?

    Today's story comes from a reader Regomized as "Sam" and takes us back to his first proper IT job following his departure from the education system.

    Sam found himself on the mainframe operations team for a telecommunications company. The work was, initially, pretty manual stuff. The telco wasn't silly, and had its new recruits start by performing offline duties, such as gathering tapes and job tickets for batch runs, handling payslips, "basically anything involving a bit of leg work," he told us.

    Continue reading
  • Tropical island paradise ponders tax-free 'Digital Nomad Visa'
    Live and work in Bali, pay tax at home

    The government of Indonesia has once again raised the idea of creating a "digital nomad visa" that would allow foreign workers to live and work in the tropical paradise of Bali, tax free, for five years.

    The idea was raised before the COVID-19 pandemic, but understandably shelved as borders closed and the prospect of any digital nomads showing up dropped to zero.

    But in recent interviews Sandiaga Uno, Indonesia's minister for Tourism and the Creative Economy, said the visa was back on the drawing board.

    Continue reading
  • Small in Japan: Hitachi creates its own (modest) cloud
    VMware-powered sovereign cloud not going to challenge hyperscalers, but probably won't be the last such venture

    Hitachi has taken a modest step towards becoming a public cloud provider, with the launch of a VMware-powered cloud in Japan that The Register understands may not be its only such venture.

    The Japanese giant has styled the service a "sovereign cloud" – a term that VMware introduced to distinguish some of its 4,000-plus partners that operate small clouds and can attest to their operations being subject to privacy laws and governance structures within the nation in which they operate.

    Public cloud heavyweights AWS, Azure, Google, Oracle, IBM, and Alibaba also offer VMware-powered clouds, at hyperscale. But some organizations worry that their US or Chinese roots make them vulnerable to laws that might allow Washington or Beijing to exercise extraterritorial oversight.

    Continue reading
  • Beijing probes security at academic journal database
    It's easy to see why – the question is, why now?

    China's internet regulator has launched an investigation into the security regime protecting academic journal database China National Knowledge Infrastructure (CNKI), citing national security concerns.

    In its announcement of the investigation, the China Cyberspace Administration (CAC) said:

    Continue reading
  • Cerebras sets record for 'largest AI model' on a single chip
    Plus: Yandex releases 100-billion-parameter language model for free, and more

    In brief US hardware startup Cerebras claims to have trained the largest AI model on a single device powered by the world's largest Wafer Scale Engine 2 chip the size of a plate.

    "Using the Cerebras Software Platform (CSoft), our customers can easily train state-of-the-art GPT language models (such as GPT-3 and GPT-J) with up to 20 billion parameters on a single CS-2 system," the company claimed this week. "Running on a single CS-2, these models take minutes to set up and users can quickly move between models with just a few keystrokes."

    The CS-2 packs a whopping 850,000 cores, and has 40GB of on-chip memory capable of reaching 20 PB/sec memory bandwidth. The specs on other types of AI accelerators and GPUs pale in comparison, meaning machine learning engineers have to train huge AI models with billions of parameters across more servers.

    Continue reading
  • Zendesk sold to private investors two weeks after saying it would stay public
    Private offer 34 percent above share price is just the thing to change minds

    Customer service as-a-service vendor Zendesk has announced it will allow itself to be acquired for $10.2 billion by a group of investors led by private equity firm Hellman & Friedman, investment company Permira, and a wholly-owned subsidiary of the Abu Dhabi Investment Authority.

    The decision is a little odd, in light of the company's recent strategic review, announced on June, which saw the board unanimously conclude "that continuing to execute on the Company's strategic plan as an independent, public company is in the best interest of the Company and its stockholders at this time."

    That process saw Zendesk chat to 16 potential strategic partners and ten financial sponsors, including a group of investors who had previously expressed conditional interest in acquiring the company. Zendesk even extended its discussions with some parties but eventually walked away after "no actionable proposals were submitted, with the final bidders citing adverse market conditions and financing difficulties at the end of the process."

    Continue reading

Biting the hand that feeds IT © 1998–2022