Big Data Market Segment LS
Big Data Market Segment RS
Monday, 11 August 2014 12:56

Enterprise data hub gives best of both worlds: Cloudera


Is there an enterprise data hub in your IT future?

Cloudera's concept of the enterprise data hub addresses the potential of big data by providing a home for any type of data along with access mechanisms to support different workloads, Cloudera co-founder and CTO Amr Awadallah told iTWire.

Unlike traditional databases, Hadoop can store unstructured data such as images, videos and PDF files alongside structured data, he explained.

Where SQL can only express a subset of tasks that people may want to perform on these large data sets, Cloudera supports Apache Spark and other frameworks to allow a wider range of workloads. This allows organisations to leverage all of their data and ask better questions, Mr Awadallah said.

The Hadoop philosophy is that data should be consolidated into one platform, and applications moved close to the data. Mr Awadallah drew an analogy with the way the camera in a smartphone works: you take photographs, which are then stored in one place where they can be accessed by any app that can make use of them.

Previous ideas such as the enterprise data warehouse depended on the idea of a database that could only store structured data, but the modern environment isn't all structured. For example, organisations may want to combine data such as transactions, clickstreams and voice recordings to estimate customer sentiment, determine the probability of losing a particular customer, and decide whether to make a retention offer.

An important aspect of the enterprise data hub is support for both 'schema on write' and 'schema on read' in order to handle routine and exploratory workloads.

Schema on write (as with traditional databases) provides good performance as it is possible to lay out the data efficiently, as well as good governance.

Schema on read allows users to store any data as the system looks more like a file system than a database. It effectively performs ETL (extract, transform, load) on the fly at read time, generating the appropriate schema as part of the process. This means an additional column of data can be provided for analysis very quickly.

"You want both," he said, likening the two situations to two very different types of commercial kitchen. The kitchen at a McDonald's store is optimised to prepare the same limited range of items every day, whereas that at a high-end restaurant has a range of ingredients and equipment allowing the preparation of dozens of different dishes.

Cloudera was the first commercial Hadoop vendor, Mr Awadallah said, adding that it "is the world leader" ahead of Hortonworks, MapR, Pivotal and IBM.

The company has more experience than anyone else, he said, and the founders of the major Hadoop projects work for Cloudera.

"We use our own technology" to monitor the operation of customers' systems, so Cloudera can quickly correlate the relevant data if someone reports a problem.

Cloudera combines open source components with its proprietary technology for backup and recovery, security and auditing. Furthermore, the company certifies each of its releases interoperates with a large ecosystem of applications such as SAS and Splunk, he said.

"None of the other vendors have this breadth and depth," Mr Awadallah said.

Subscribe to ITWIRE UPDATE Newsletter here

Now’s the Time for 400G Migration

The optical fibre community is anxiously awaiting the benefits that 400G capacity per wavelength will bring to existing and future fibre optic networks.

Nearly every business wants to leverage the latest in digital offerings to remain competitive in their respective markets and to provide support for fast and ever-increasing demands for data capacity. 400G is the answer.

Initial challenges are associated with supporting such project and upgrades to fulfil the promise of higher-capacity transport.

The foundation of optical networking infrastructure includes coherent optical transceivers and digital signal processing (DSP), mux/demux, ROADM, and optical amplifiers, all of which must be able to support 400G capacity.

With today’s proprietary power-hungry and high cost transceivers and DSP, how is migration to 400G networks going to be a viable option?

PacketLight's next-generation standardised solutions may be the answer. Click below to read the full article.


WEBINAR PROMOTION ON ITWIRE: It's all about webinars

These days our customers Advertising & Marketing campaigns are mainly focussed on webinars.

If you wish to promote a Webinar we recommend at least a 2 week campaign prior to your event.

The iTWire campaign will include extensive adverts on our News Site and prominent Newsletter promotion and Promotional News & Editorial.

This coupled with the new capabilities 5G brings opens up huge opportunities for both network operators and enterprise organisations.

We have a Webinar Business Booster Pack and other supportive programs.

We look forward to discussing your campaign goals with you.


Stephen Withers

joomla visitors

Stephen Withers is one of Australia¹s most experienced IT journalists, having begun his career in the days of 8-bit 'microcomputers'. He covers the gamut from gadgets to enterprise systems. In previous lives he has been an academic, a systems programmer, an IT support manager, and an online services manager. Stephen holds an honours degree in Management Sciences and a PhD in Industrial and Business Studies.

Share News tips for the iTWire Journalists? Your tip will be anonymous




Guest Opinion

Guest Interviews

Guest Reviews

Guest Research

Guest Research & Case Studies

Channel News