Private cloud completes Cloudera's data platform vision, but it has turned a corner into even fiercer competition

Where the big dogs roam


Cloudera has introduced support for the private cloud - built on the Red Hat OpenShift platform and supported by Kubernetes container orchestration - to elbow its way into a whole new world of competition.

The erstwhile Hadoop-shifter reckons the move will let customers run data services from a single integrated platform via public, private and hybrid clouds. It will be able to funnel data into reporting, analytics and machine learning that draws from enterprise applications, data warehouse, desktop systems and external data sources.

The data platform would also offer the same security, privacy and governance across cloud instances, Cloudera claimed.

Arun Murthy, chief product officer, told us that extending the environment across all cloud instances would "accelerate time value and ensures critical workloads meet their SLAs".

If the system works as advertised, it means users can manage their data in a private cloud but spin up workloads in the public cloud where necessary and appropriate, from within the same platform.

Adam Ronthal, Gartner research VP for data and analytics, said Cloudera's private cloud support might appeal to organisations yet to fully embrace the public cloud model due to security, governance, or data sovereignty concerns.

Other providers are also playing to this market, including Oracle Cloud at Customer, AWS Outposts and Azure Stack, he said.

"Cloudera starts from a position of on-premises strength while the cloud services providers are obviously strong in their own environments. Which one of these is more appealing is going to depend on where an organisation's centre of data gravity is today – cloud or on-prem – and where they expect that to be in the next two to three years," he said.

The decision would also be based on which vendors the user organisation is prepared to commit to in the long term. To start with, Cloudera's hybrid cloud data platform is most likely to appeal to eisting punters and those that need a hybrid cloud strategy, addded Ronthal.

"It provides perhaps the easiest lift-and-shift from an on-premises approach into the cloud and it provides cloud portability. It is generally best for end-users that have an existing strategic commitment to an independent software vendor's (ISV) offering. On the other hand, it may require a little bit more effort to extend that into the surrounding cloud service provider's ecosystem, and it represents an explicit decision to select a point of vendor lock-in. You're locking into the ISV, rather than the cloud service provider, but no matter what you do, you're going to have some degree of lock-in," Ronthal said.

Bigger pond

But Cloudera could struggle as merely a smaller component of a broader analytics landscape, where it's going to be competing with cloud-native approaches. Cloud vendors are already expanding their catalogue: there's Azure Synapse analytics, as well as cloud-native Hadoop products like Amazon EMR, GCP Cloud Dataproc and Microsoft HDInsight, each of which has their own tight integration points with their surrounding cloud ecosystem.

Cloudera's fortunes have been improving since it published disastrous Q1 results in 2019, after Hadoop – the big data file system it was built around – went out of favour and it struggled with the merger of former rival Hortonworks. The first quarter for fiscal 2021 ended 30 April saw revenue of $210m, an increase of 12 per cent as compared with the first quarter of fiscal 2020.

The improved performance reflected better management of the combined company, and growing acceptance of its Cloudera Data Platform. Whether it continues on this path will depend on how it fares against a whole jungle of ISVs competing in the same space, some fairly new to the market, such as Dataiku or Talend, and some with a deep history, like Informatica.

All these vendors are trying to get products in a sweet spot of meeting the users' multi-cloud data management needs beyond the services built into the cloud vendors' environments.

"What these third-party approaches are trying to do is push on the line of 'good enough'," said Ronthal. "When the users buy into that, it is because they feel can get additional value from an independent offering."

But the competition in the data platform market is fierce, with a long list of vendors vying for users who want more than "good enough". Meanwhile, the cloud infrastructure providers have deep pockets and continue to invest in their data platform products. Cloudera has turned a corner, but it may yet find obstacles in the route to fatter sales. ®


Other stories you might like

  • A peek into Gigabyte's GPU Arm for AI, HPC shops
    High-performance platform choices are going beyond the ubiquitous x86 standard

    Arm-based servers continue to gain momentum with Gigabyte Technology introducing a system based on Ampere's Altra processors paired with Nvidia A100 GPUs, aimed at demanding workloads such as AI training and high-performance compute (HPC) applications.

    The G492-PD0 runs either an Ampere Altra or Altra Max processor, the latter delivering 128 64-bit cores that are compatible with the Armv8.2 architecture.

    It supports 16 DDR4 DIMM slots, which would be enough space for up to 4TB of memory if all slots were filled with 256GB memory modules. The chassis also has space for no fewer than eight Nvidia A100 GPUs, which would make for a costly but very powerful system for those workloads that benefit from GPU acceleration.

    Continue reading
  • GitLab version 15 goes big on visibility and observability
    GitOps fans can take a spin on the free tier for pull-based deployment

    One-stop DevOps shop GitLab has announced version 15 of its platform, hot on the heels of pull-based GitOps turning up on the platform's free tier.

    Version 15.0 marks the arrival of GitLab's next major iteration and attention this time around has turned to visibility and observability – hardly surprising considering the acquisition of OpsTrace as 2021 drew to a close, as well as workflow automation, security and compliance.

    GitLab puts out monthly releases –  hitting 15.1 on June 22 –  and we spoke to the company's senior director of Product, Kenny Johnston, at the recent Kubecon EU event, about what will be added to version 15 as time goes by. During a chat with the company's senior director of Product, Kenny Johnston, at the recent Kubecon EU event, The Register was told that this was more where dollars were being invested into the product.

    Continue reading
  • To multicloud, or not: Former PayPal head of engineering weighs in
    Not everyone needs it, but those who do need to consider 3 things, says Asim Razzaq

    The push is on to get every enterprise thinking they're missing out on the next big thing if they don't adopt a multicloud strategy.

    That shove in the multicloud direction appears to be working. More than 75 percent of businesses are now using multiple cloud providers, according to Gartner. That includes some big companies, like Boeing, which recently chose to spread its bets across AWS, Google Cloud and Azure as it continues to eliminate old legacy systems. 

    There are plenty of reasons to choose to go with multiple cloud providers, but Asim Razzaq, CEO and founder at cloud cost management company Yotascale, told The Register that choosing whether or not to invest in a multicloud architecture all comes down to three things: How many different compute needs a business has, budget, and the need for redundancy. 

    Continue reading

Biting the hand that feeds IT © 1998–2022