12 simple rules: How Ted Codd transformed the humble database

Near misses and lucky escapes for a multi-billion-dollar baby


Ted Codd Photo courtesy: IBM

Stonebraker and Ellison go after IBM's dropped ball

Fortunately for us, others weren’t so flat-footed and relational was snatched from the jaws of defeat by those outside of IBM.

The two people who saw relational's potential were a Berkeley comp sci assistant professor and later serial entrepreneur called Michael Stonebraker – who built Ingres – and an obscure salesman named Larry Ellison, who first had a crack at RDBs at Software Development Laboratories (which later became Relational Software Inc and then Oracle). Both saw the opportunity, and they jumped in with their own relational database products.

The glory years were the 1980s and 1990s, when data was simple and structured, and the market was inflated by the explosion in the personal computer. On the back of PCs there were servers and storage, and with the creation of data came need to cleanse, analyse and understand that data – it was the time of business intelligence and of the data warehouse.

SQL was ratified as a vendor-neutral standard with the American National Standards Institute (ANSI) in 1986 and six years later saw publication of what’s considered the breakthrough SQL spec, which managed to fill a number of gaps.

Relational blossomed as it sucked on the data of the time: sales numbers, customer stats – text-based information. Relational imposed a rigid structure, but within it was the freedom and fluidity that everybody needed.

As the 2000s approached, relational faced its next challenge: unstructured and non-text-based data – audio, video, graphics – plus the desire of corporate folk to quickly retrieve, serve and understand that data on a huge scale.

SQL was judged too slow thanks to its centralised structure, and a new movement gave birth to NoSQL – databases that use a key value to describe an object and which are therefore known as key-value stores.

Key-value stores are capable of recognising non-text data – music, video, graphics and so on. These stores include Cassandra, Cloudant, CouchDB, SimpleDB and Google AppEngine DataStore.

The use of NoSQL has become widespread thanks to the rise of users like Google, Amazon clouds, Facebook, LinkedIn and Twitter, which serve and store huge amounts of unstructured data using distributed servers.

On the eve of the anniversary of Codd's birth, NoSQL advocates will gather not far from where Codd worked in San Jose, California, for NoSQL Now!.

But for all its advantages there are problems.

There’s no common NoSQL standard: you build on a database-by-database basis. The databases have either been developed by big companies or by database and programming whizzes and are still growing up in terms of their ease of use and management for ordinary end users.

Also, having eschewed SQL, many of first-generation NoSQL data stores voluntarily surrendered the ACID properties that are the hallmarks of relational and made it so suited to business.

But later NoSQL startups have re-embraced the faith.

The relational database providers, meanwhile, have responded. They've sped up their databases employing ideas such as memory processing using caching instead of disks for greater speed while adding support for non-relational architectures like Hadoop – the non-Google spin on BigTable.

Unnatural acts

On the 90th anniversary of Codd’s birth, his baby is big. Yet, as is characteristic with its birth, relational faces uncertainty.

Such is the health of relational that Oracle, IBM and Microsoft's are all growing – with even enterprise-resource-planning software-maker SAP now claiming its own relational business based on in-memory system Hana. And with so much relationally stored data now in existence, Codd’s technology is safe for at least until the 180th anniversary of this birth.

But change, as always, is inevitable.

As Michael Stonebraker, whose latest venture is a Big Data analytics punt with the SciDB NoSQL database, told me in 2010: “In the 1980s, the 'answer' was if all you wanted to do was business data processing, then it was relational databases.” Trying to stretch relational today, though, is an "unnatural act". ®

* Yes, we know Wikipedia says Codd was born on 23 August, 1923, but his Turning Award entry, here, says it was 19 August. The Reg checked with his former employer IBM whose archives department got in touch to say their files had 19 August. We'll go with that over Jimbo Wales... and consider it a warning Wikipedia's veracity.


Other stories you might like

  • Florida's content-moderation law kept on ice, likely unconstitutional, court says
    So cool you're into free speech because that includes taking down misinformation

    While the US Supreme Court considers an emergency petition to reinstate a preliminary injunction against Texas' social media law HB 20, the US Eleventh Circuit Court of Appeals on Monday partially upheld a similar injunction against Florida's social media law, SB 7072.

    Both Florida and Texas last year passed laws that impose content moderation restrictions, editorial disclosure obligations, and user-data access requirements on large online social networks. The Republican governors of both states justified the laws by claiming that social media sites have been trying to censor conservative voices, an allegation that has not been supported by evidence.

    Multiple studies addressing this issue say right-wing folk aren't being censored. They have found that social media sites try to take down or block misinformation, which researchers say is more common from right-leaning sources.

    Continue reading
  • US-APAC trade deal leaves out Taiwan, military defense not ruled out
    All fun and games until the chip factories are in the crosshairs

    US President Joe Biden has heralded an Indo-Pacific trade deal signed by several nations that do not include Taiwan. At the same time, Biden warned China that America would help defend Taiwan from attack; it is home to a critical slice of the global chip industry, after all. 

    The agreement, known as the Indo-Pacific Economic Framework (IPEF), is still in its infancy, with today's announcement enabling the United States and the other 12 participating countries to begin negotiating "rules of the road that ensure [US businesses] can compete in the Indo-Pacific," the White House said. 

    Along with America, other IPEF signatories are Australia, Brunei, India, Indonesia, Japan, South Korea, Malaysia, New Zealand, the Philippines, Singapore, Thailand and Vietnam. Combined, the White House said, the 13 countries participating in the IPEF make up 40 percent of the global economy. 

    Continue reading
  • 381,000-plus Kubernetes API servers 'exposed to internet'
    Firewall isn't a made-up word from the Hackers movie, people

    A large number of servers running the Kubernetes API have been left exposed to the internet, which is not great: they're potentially vulnerable to abuse.

    Nonprofit security organization The Shadowserver Foundation recently scanned 454,729 systems hosting the popular open-source platform for managing and orchestrating containers, finding that more than 381,645 – or about 84 percent – are accessible via the internet to varying degrees thus providing a cracked door into a corporate network.

    "While this does not mean that these instances are fully open or vulnerable to an attack, it is likely that this level of access was not intended and these instances are an unnecessarily exposed attack surface," Shadowserver's team stressed in a write-up. "They also allow for information leakage on version and build."

    Continue reading
  • A peek into Gigabyte's GPU Arm for AI, HPC shops
    High-performance platform choices are going beyond the ubiquitous x86 standard

    Arm-based servers continue to gain momentum with Gigabyte Technology introducing a system based on Ampere's Altra processors paired with Nvidia A100 GPUs, aimed at demanding workloads such as AI training and high-performance compute (HPC) applications.

    The G492-PD0 runs either an Ampere Altra or Altra Max processor, the latter delivering 128 64-bit cores that are compatible with the Armv8.2 architecture.

    It supports 16 DDR4 DIMM slots, which would be enough space for up to 4TB of memory if all slots were filled with 256GB memory modules. The chassis also has space for no fewer than eight Nvidia A100 GPUs, which would make for a costly but very powerful system for those workloads that benefit from GPU acceleration.

    Continue reading
  • GitLab version 15 goes big on visibility and observability
    GitOps fans can take a spin on the free tier for pull-based deployment

    One-stop DevOps shop GitLab has announced version 15 of its platform, hot on the heels of pull-based GitOps turning up on the platform's free tier.

    Version 15.0 marks the arrival of GitLab's next major iteration and attention this time around has turned to visibility and observability – hardly surprising considering the acquisition of OpsTrace as 2021 drew to a close, as well as workflow automation, security and compliance.

    GitLab puts out monthly releases –  hitting 15.1 on June 22 –  and we spoke to the company's senior director of Product, Kenny Johnston, at the recent Kubecon EU event, about what will be added to version 15 as time goes by. During a chat with the company's senior director of Product, Kenny Johnston, at the recent Kubecon EU event, The Register was told that this was more where dollars were being invested into the product.

    Continue reading
  • To multicloud, or not: Former PayPal head of engineering weighs in
    Not everyone needs it, but those who do need to consider 3 things, says Asim Razzaq

    The push is on to get every enterprise thinking they're missing out on the next big thing if they don't adopt a multicloud strategy.

    That shove in the multicloud direction appears to be working. More than 75 percent of businesses are now using multiple cloud providers, according to Gartner. That includes some big companies, like Boeing, which recently chose to spread its bets across AWS, Google Cloud and Azure as it continues to eliminate old legacy systems. 

    There are plenty of reasons to choose to go with multiple cloud providers, but Asim Razzaq, CEO and founder at cloud cost management company Yotascale, told The Register that choosing whether or not to invest in a multicloud architecture all comes down to three things: How many different compute needs a business has, budget, and the need for redundancy. 

    Continue reading

Biting the hand that feeds IT © 1998–2022