​The hybrid cloud wants your data

Work in development, analytics, data science? – this is for you

Got Tips?

Promo A coding competition is coming soon, brought to you by The Register in partnership with IBM. For now all we can tell you is that IBM's Cloudant platform will play a part in the proceedings.

So let's get the ball rolling with this call to developers, data science professionals and IT analytic architects to sign up now for a free IBM Cloudant fully managed cloud services trial.

You can find out more about IBM Cloudant here. But in the meantime here is a handy executive summary for you.

So what is Cloudant?

In 2016, it's difficult to have a conversation about data without discussing analytics. The pressure is on to massage data and ask it questions rather than simply using it for transactions. That’s all well and good, but many companies don’t have the tools or the computing muscle.

Compromise comes in the form of hybrid cloud infrastructure services. IBM is one firm working to solve the problem, creating environments that give customers the tools to work on their own data sets in the cloud while keeping them in sync with local versions.

The company has grown its Cloud Data Services portfolio through a mixture of organic development and acquisition. It relies heavily on open source systems such as Apache Spark, making it easier to integrate and transfer data sets and applications between 25 different tools. The online platform also provides a selection of post-relational NoSQL databases, designed to support the development of web-scale apps processing many different data types.

One of the linchpins of IBM's Cloud Data Services offering is Cloudant, described as a 'distributed database as a service' offering, in February 2014. The company, founded in 2009, based its NoSQL database on Apache CouchDB. It brings to bear a range of features designed to support customers working on hybrid cloud-based analytics and applications:

Data synchronization

Cloudant gave developers the opportunity to replicate data from on-premise systems to IBM's BlueMix-based cloud data environment, and then to keep it synchronized. Developers distribute read/write-capable copies of their data across multiple computing instances, and can rely on the technology to synchronize them over intermittent connections.

Changes to the data are synchronized behind the scenes so that all sites are kept up to date, opening up several use cases for developers and operations staff. Updates to a single instance of Cloudant's Apache CouchDB engine could be replicated to a database cluster, for example, or changes to a cluster could be replicated to a remote datacenter running an identical one.

Developers can design applications to manipulate data offline and then resync it automatically using IBM’s service, leading to what the firm has labelled 'offline-first' design. This makes it easier to manipulate data in the cloud using the tools available in IBM’s Cloud Data marketplace, without creating discrepancies with locally-stored data sets.

Native JSON storage

Cloudant is a NoSQL database that goes beyond the traditional relational SQL-based store that you'll find in legacy relational database management system tools. Instead of maintaining data in tables that have to be rigidly joined together, it stores documents in a variety of formats, including plain text and JavaScript Object Notation (JSON).

When it comes to data storage, JSON is the data storage language of the web. It is a self-describing format, storing data elements in human-readable key pairs (eg ‘Name: Robin Birtstone’, ‘Publication: The Register’). This is stored in a tree-like structure that allows elements to be nested.

The big benefit of JSON databases are their flexibility. A traditional RDBMS forces developers to specify the data storage schema up front, making it difficult to change. Adding fields to relational tables is a big deal. The links between the database schema and the application are brittle and fragile. It’s a high-maintenance scenario.

Conversely, JSON users can easily add data fields on the fly, making it easier to update their data structures to support new features in their applications with no heavy lifting.

Advanced search capabilities

Cloudant also gave IBM another key capability: advanced search capabilities. Traditional databases often require specialized add-on modules for the storage and searching of particular types of data, such as geospatial information. Cloudant stores geospatial in GeoJSON, a dialect of JSON designed to handle geospatial properties such as co-ordinates, shapes, and geographical features. The platform includes indexing algorithms optimized for this data type.

Developers and data analysts can use search terms designed specifically for geospatial data types that you’d normally find in niche GIS software, including descriptions of shapes and geometric relationships. The results are viewable on interactive maps provided by Mapbox, which is a mapping service aimed at developers.

In additional to geospatial data, Cloudant also supports full-text searches using its Lucerne indexing technology. Developers have traditionally stored full text separately to transactional data, often creating duplicates and imposing a synchronization overhead. Instead, Lucerne lets them handle the full text natively, describing how they want to index text at a deep level, defining things like word break rules and enabling them to customize for specific languages. Full-text search types include phrase queries, fuzzy matching and proximity queries, and developers can access search capabilities via a RESTful API for simplicity.

Deployment flexibility

Hybrid cloud solutions need flexible deployment, so it is important for developers and operations staff to have a range of deployment options. Cloudant was engineered to be installed in a variety of different models.

These models range from a fully managed service where engineers maintain the database for a customer offsite, through to on-premise deployments in the customer’s own data center, via the Cloudant Local option introduced in October 2014. This would be suitable for more sensitive data. There are also options to deploy native libraries for mobile users on either iOS or Android.

A platform for future data-driven development

The ability to deploy across different environments in different formats, all while keeping data synchronized, also creates a foundation for DevOps. The ability to change data schemas quickly and support new application features is a useful component in agile development, but it needs a stable architecture that takes care of basic synchronization tasks behind the scenes. Having these two things in place allows DevOps teams to focus on tasks such as provisioning, and the smooth introduction of frequent application updates.

We've come rather a long way from the cumbersome, on-premise business intelligence systems we used to grapple with. Now, customers need to work out how they can use the platform to create new kinds of app. Talking to your line of business managers is a good starting point, to see what their problems are – and where the potential opportunities lie.

You can find out more about IBM Cloudant here, and sign up now for a free IBM Cloudant fully managed cloud services trial here.

SUBSCRIBE TO OUR WEEKLY TECH NEWSLETTER


Keep Reading

Microsoft sides with Epic over Apple developer ban, supports motion for temporary restraining order

'Apple’s discontinuation of Epic’s ability to develop and support Unreal Engine for iOS or macOS will harm game creators and gamers,' says Microsoft

Leaked benchmarks from developer kit for Apple's home-baked silicon appear to give Microsoft a run for its money

Before you get too excited 1) They're benchmarks 2) New consumer Arm-based Macs might use something else

Apple funnels Worldwide Developer Conference 2020 through iOS app, website amid coronavirus lockdowns

Hey, finally an Apple event The Reg can attend, sorta, right?

Node.js community finally prodded to patch Chromium XHR bug after developer refuses to let flaw stand

If at first you don't succeed, try, try... try, try, try... try again

Android 11 Developer Preview 3 allows your mobe to become a router via USB Ethernet – if you can get a decent signal

Plus: Multitasking and other functions rejigged, but no promises they'll stick

ReactOS hits a milestone – actually hiring a full-time developer. And we've got our talons on the latest build to see what needs fixing

Open-source Windows lookalike aims to fix its 'long neglected' storage stack

Some lucky web developer just scored $20k to scour Facebook out of Neil Young’s website

Plans to ‘disengage from the corrupt social platform’ that ‘is screwing with our election’

Firefox 78: Protections dashboard, new developer features... and the end of the line for older macOS versions

New Regular Expression evaluator borrowed from Chromium V8 – a sign of more collaboration to come?

Tech Resources

Webcast Slide Deck | Three reasons you need a hybrid multicloud

Businesses need their IT teams to operate applications and data in a hybrid environment spanning on-premises private and public clouds. But this poses many challenges, such as managing complex networking, re-architecting applications for the cloud, and managing multiple infrastructure silos. There is a pressing need for a single platform that addresses these challenges - a hybrid multicloud built for the digital innovation era. Just this Regcast to find out: Why hybrid multicloud is the ideal path to accelerate cloud migration.

The Enterprise Buyer’s Guide for FIDO Credentials

Choosing secure credentials for your organization is a balancing act. This guide will help you navigate the complexity of the credential selection process.

Digital Document Processes in 2020: A Spotlight On APAC

In the wake of the global pandemic, online interactions augment and, in many instances, they replace in-person interactions. To continue to service their customers and enable employee collaboration, companies in the Asia Pacific (APAC) region are forced to reassess their digital capabilities, taking digital document processes to the forefront.

Webcast Slide Deck | Secure Paperless Communications

We’ve been talking about paperless communications since the dawn of digital, but we haven’t got the job done yet. That can lead to complications with remote workforces and a stalled digital transformation. If this is you, don’t beat yourself up: going digital with secure communications is about more than flipping a switch. From ensuring compliance to justifying the ROI to a procurement team, there’s a lot to think about.

Biting the hand that feeds IT © 1998–2020