Oh no, you're thinking, yet another cookie pop-up. Well, sorry, it's the law. We measure how many people read us, and ensure you see relevant ads, by storing cookies on your device. If you're cool with that, hit “Accept all Cookies”. For more info and to customise your settings, hit “Customise Settings”.

Review and manage your consent

Here's an overview of our use of cookies, similar technologies and how to manage them. You can also change your choices at any time, by hitting the “Your Consent Options” link on the site's footer.

Manage Cookie Preferences
  • These cookies are strictly necessary so that you can navigate the site as normal and use all features. Without these cookies we cannot provide you with the service that you expect.

  • These cookies are used to make advertising messages more relevant to you. They perform functions like preventing the same ad from continuously reappearing, ensuring that ads are properly displayed for advertisers, and in some cases selecting advertisements that are based on your interests.

  • These cookies collect information in aggregate form to help us understand how our websites are being used. They allow us to count visits and traffic sources so that we can measure and improve the performance of our sites. If people say no to these cookies, we do not know how many people have visited and we cannot monitor performance.

See also our Cookie policy and Privacy policy.

Reluctant Wikipedia lifts lid on $2.5m internet search engine project

It simply ain't true – until Mr Wales publicly denies it


The Wikimedia Foundation has finally disclosed details of its controversial Knowledge Engine grant – and it confirms that Wikipedia is getting seriously into search, despite Jimmy Wales' categorical denial that WMF is “doing a Google”.

Wikimedia’s reluctance to detail the restricted grant, from the Knight Foundation, was a factor in the departure of community-elected WMF board member James Heilman in December.

After a Wikipedia signpost article, and coverage at El Reg this week, the WMF caved and posted the Knight Foundation's approval of the $250,000 grant. It's the first stage of a project costed at $2.5m.

The grant provides seed money for stage one of the Knowledge Engine, described as “a system for discovering reliable and trustworthy information on the Internet”. It’s all about search and federation.

The discovery stage includes an exploration of prototypes of future versions of Wikipedia.org which are “open channels” rather than an encyclopaedia, analysing the query-to-content path, and embedding the Wikipedia Knowledge Engine “via carriers and Original Equipment Manufacturers”.

The biggest risk, according to the risk assessment section, is that Google or Yahoo! “could suddenly devote resources to a similar project”. The Discovery phase will take 12 months, lasting until 31 August this year. Knight and the recipient had only provided sketchy details to date.

Jimmy Wales had categorically denied community concerns that WMF was developing a search engine via the Knowledge Engine, calling the suggestion “a lie”.

Andreas Kolbe, who pieced together the Knowledge Engine story for a Wikipedia Signpost article this week, says the details of the grant have come as a surprise.

“Its gung-ho 'We're building a search engine!' content is a bit of a bombshell for the volunteer community," said Kolbe. "They were led to believe it was just about getting a central search function to find stuff spread out across the various Wikimedia sites, with OpenStreetMap thrown in perhaps… Volunteers feel WMF management has purposely kept them out of the loop.”

At least two of the six "conceptual directions for discovery", aka redesigns of the Wikipedia front page, sure look like Google-style search engines to us. ®


Other stories you might like

  • Biotech firm: Graphcore IPUs faster for AI-based drug discovery than GPUs
    Someone's got to keep, say, Nvidia on its toes

    In the race to provide the best machine-learning accelerators, one of Nvidia's top challengers has claimed a victory in the biotech space, London firm LabGenius, which said Graphcore's intelligence processing units (IPUs) provide significantly faster performance for AI-based drug discovery than some unidentified traditional GPUs.

    Founded in 2012, LabGenius is a venture-backed company that develops antibody treatments for cancer and inflammatory diseases by leaning on machine-learning algorithms and laboratory automation to discover proteins that have the right qualities to treat medical conditions.  

    In a blog post to be published Thursday, and seen by The Register, Bristol, UK-based Graphcore is set to reveal LabGenius turned to its IPUs to train a BERT Transformer model on a large data set of existing proteins to predict masked amino acids. This, in turn, we're told, helped LabGenius suss out important protein features that can help it develop new therapies.

    Continue reading
  • So, what happened with GitHub, Heroku, and those raided private repos?
    Who knew what when and what did they do?

    Analysis GitHub says it has identified and alerted developers who have had their private repositories accessed and downloaded via stolen authentication tokens.

    In this multifaceted fiasco, Microsoft-owned GitHub insisted its security was not breached. Instead, we're told, "compromised OAuth user tokens from Heroku and Travis-CI-maintained OAuth applications were stolen and abused to download private repositories belonging to dozens of victim organizations that were using these apps."

    Salesforce-owned Heroku confirmed someone compromised an OAuth token – presumably an internal staffer's token – to get into Heroku's GitHub account and rifle through, and potentially update, users' GitHub repositories "using OAuth tokens issued to Heroku’s OAuth integration dashboard hosted on GitHub."

    Continue reading
  • Apple geniuses in Atlanta beat New York to the punch, file petition to unionize
    Cannot confirm reports of Tim Cook on the Midnight Train to Georgia

    Workers in Atlanta, Georgia, have become the first US Apple Store staff to file an official request to hold an union vote with America's National Labor Relations Board.

    "A number of us have been here for many years, and we don't think you stick at a place unless you love it," said Derrick Bowles, Apple Genius and Communications Workers of America (CWA) organizer.

    "Apple is a profoundly positive place to work, but we know that the company can better live up to their ideals and so we're excited to be joining together with our coworkers to bring Apple to the negotiating table and make this an even better place to work."

    Continue reading
  • Five Eyes nations fear wave of Russian attacks against critical infrastructure
    If this is surprising to operators, we are doomed

    The Five Eyes nations' cybersecurity agencies this week urged critical infrastructure to be ready for attacks by crews backed by or sympathetic to the Kremlin amid strong Western opposition to Russia's invasion of Ukraine.

    The joint alert, issued by cybersecurity authorities in the US, UK, Australia, Canada and New Zealand, provides technical details on more than a dozen Russian state-sponsored hacking groups and Russia-aligned cybercrime gangs. 

    The missive urges critical infrastructure organizations to take immediate actions to protect against cyberattacks from these foes. These steps include patching known exploited vulnerabilities, updating software, enforcing multi-factor authentication, securing and monitoring remote desktop protocol (RDP) and other "potentially risky" services, and providing end-user security awareness and training. If this action is truly surprising to critical infrastructure operators, we're screwed.

    Continue reading
  • Brave, DuckDuckGo to unplug Google's AMP where possible
    Webpage acceleration tech deemed harmful by rivals

    Brave, the browser maker, and DuckDuckGo, the web search service, have both taken aim at AMP, Google's controversial web publishing framework.

    Brave on Tuesday introduced a feature called De-AMP that lets those using the Brave browser avoid Google-hosted AMP pages and go straight to publisher content on standard web pages.

    De-AMP will rewrite fetched web pages that link to AMP pages so their links point to the publisher versions of those pages. Brave's browser, which uses a modified version of Google Chrome's Chromium, will watch for when AMP pages are being loaded and stop them before they can render. Doing so prevents Google’s AMP scripts and images from being fetched and executed while also reducing the ad giant's ability to see web traffic and understand where it's going.

    Continue reading

Biting the hand that feeds IT © 1998–2022