Big Data Market Segment LS
Big Data Market Segment RS
Monday, 11 August 2014 12:56

Enterprise data hub gives best of both worlds: Cloudera


Is there an enterprise data hub in your IT future?

Cloudera's concept of the enterprise data hub addresses the potential of big data by providing a home for any type of data along with access mechanisms to support different workloads, Cloudera co-founder and CTO Amr Awadallah told iTWire.

Unlike traditional databases, Hadoop can store unstructured data such as images, videos and PDF files alongside structured data, he explained.

Where SQL can only express a subset of tasks that people may want to perform on these large data sets, Cloudera supports Apache Spark and other frameworks to allow a wider range of workloads. This allows organisations to leverage all of their data and ask better questions, Mr Awadallah said.

The Hadoop philosophy is that data should be consolidated into one platform, and applications moved close to the data. Mr Awadallah drew an analogy with the way the camera in a smartphone works: you take photographs, which are then stored in one place where they can be accessed by any app that can make use of them.

Previous ideas such as the enterprise data warehouse depended on the idea of a database that could only store structured data, but the modern environment isn't all structured. For example, organisations may want to combine data such as transactions, clickstreams and voice recordings to estimate customer sentiment, determine the probability of losing a particular customer, and decide whether to make a retention offer.

An important aspect of the enterprise data hub is support for both 'schema on write' and 'schema on read' in order to handle routine and exploratory workloads.

Schema on write (as with traditional databases) provides good performance as it is possible to lay out the data efficiently, as well as good governance.

Schema on read allows users to store any data as the system looks more like a file system than a database. It effectively performs ETL (extract, transform, load) on the fly at read time, generating the appropriate schema as part of the process. This means an additional column of data can be provided for analysis very quickly.

"You want both," he said, likening the two situations to two very different types of commercial kitchen. The kitchen at a McDonald's store is optimised to prepare the same limited range of items every day, whereas that at a high-end restaurant has a range of ingredients and equipment allowing the preparation of dozens of different dishes.

Cloudera was the first commercial Hadoop vendor, Mr Awadallah said, adding that it "is the world leader" ahead of Hortonworks, MapR, Pivotal and IBM.

The company has more experience than anyone else, he said, and the founders of the major Hadoop projects work for Cloudera.

"We use our own technology" to monitor the operation of customers' systems, so Cloudera can quickly correlate the relevant data if someone reports a problem.

Cloudera combines open source components with its proprietary technology for backup and recovery, security and auditing. Furthermore, the company certifies each of its releases interoperates with a large ecosystem of applications such as SAS and Splunk, he said.

"None of the other vendors have this breadth and depth," Mr Awadallah said.

Subscribe to ITWIRE UPDATE Newsletter here


The much awaited iTWire Shop is now open to our readers.

Visit the iTWire Shop, a leading destination for stylish accessories, gear & gadgets, lifestyle products and everyday portable office essentials, drones, zoom lenses for smartphones, software and online training.

PLUS Big Brands include: Apple, Lenovo, LG, Samsung, Sennheiser and many more.

Products available for any country.

We hope you enjoy and find value in the much anticipated iTWire Shop.



iTWire TV offers a unique value to the Tech Sector by providing a range of video interviews, news, views and reviews, and also provides the opportunity for vendors to promote your company and your marketing messages.

We work with you to develop the message and conduct the interview or product review in a safe and collaborative way. Unlike other Tech YouTube channels, we create a story around your message and post that on the homepage of ITWire, linking to your message.

In addition, your interview post message can be displayed in up to 7 different post displays on our the site to drive traffic and readers to your video content and downloads. This can be a significant Lead Generation opportunity for your business.

We also provide 3 videos in one recording/sitting if you require so that you have a series of videos to promote to your customers. Your sales team can add your emails to sales collateral and to the footer of their sales and marketing emails.

See the latest in Tech News, Views, Interviews, Reviews, Product Promos and Events. Plus funny videos from our readers and customers.


Stephen Withers

Stephen Withers is one of Australia¹s most experienced IT journalists, having begun his career in the days of 8-bit 'microcomputers'. He covers the gamut from gadgets to enterprise systems. In previous lives he has been an academic, a systems programmer, an IT support manager, and an online services manager. Stephen holds an honours degree in Management Sciences and a PhD in Industrial and Business Studies.

Share News tips for the iTWire Journalists? Your tip will be anonymous




Guest Opinion

Guest Interviews

Guest Reviews

Guest Research

Guest Research & Case Studies

Channel News