Data Storage Today

CIO Today Network Sites:   Top Tech News  |   CIO Today   |   Mobile Tech Today   |   Data Storage Today
News & Information for Data Storage Professionals
Commvault Simpana® 10
Protect, manage, access, and
realize the untapped value of data.

www.commvault.com
Monday, May 20th 
Stingray Traffic Manager on Amazon Web Services
Home
Data Centers
Storage Solutions
Storage Networks
Data Storage Issues
Data Security
Enterprise I.T.
DST Press Releases
 
Free Newsletters
Top CIO News
 
Mobile Tech Today
 

Advertisement

Storage Solutions

Cloudera Announces Real-Time Query Engine for Hadoop

Cloudera Announces Real-Time Query Engine for Hadoop
October 24, 2012 1:56PM

Bookmark and Share
Cloudera's Apache-licensed, open-source query engine, Cloudera Impala, is specifically designed for real-time query of data stored in a Hadoop Distributed File System, or HDFS, and in HBase, a non-relational distributed database, and the company said it is the result of two years of in-house development. The queries for Impala can be expressed as SQL.

Brocade delivers a comprehensive cloud-optimized networking portfolio of products and open-architecture solutions to simplify and accelerate the deployment of cloud computing and provide maximum deployment flexibility with plug-in scalability. Click here to learn more.

There's a new tool for Big Data analysis. On Wednesday, Cloudera announced a real-time query engine for Apache Hadoop, resulting from two years of in-house development efforts.

The engine is an enhancement to Cloudera's Big Data platform, known as Cloudera Enterprise. In describing the query-engine's uniqueness, Cloudera claims this is the first time both real-time and batch operations are available for unstructured and structured data Relevant Products/Services in one massively scalable system.

Cloudera offers a commonly used version of Hadoop, an open-source data framework designed for handling Big Data.

In its announcement, Cloudera said that the new query engine will enable organizations to "process data at petabyte scale Relevant Products/Services and, on the same system, interact with that data in real time to deliver 'speed-of-thought' insights." In short, the company said, the new tool will allow organizations to "ask bigger questions" of their data.

SQL Queries

The Apache-licensed, open-source query engine, Cloudera Impala, is specifically designed for real-time query of data stored in a Hadoop Distributed File System (HDFS) and in HBase, a non-relational distributed database. Interactive queries for Impala can be expressed as SQL.

The company said that Impala operates 10 times as fast as the existing Hive/MapReduce, and can be even faster, depending on the workload. It pointed to cost savings for analyzing Big Data with real-time queries, by using this open-source technology with commodity hardware.

Cloudera said that, in a recent survey it conducted of more than 100 customers, over 70 percent were looking at how to extract value from Big Data. Operational IT efficiency and competitive advantage were cited by the customers as reasons for adopting Hadoop, but the vast majority also indicated they needed faster methods of querying than the batch operations that had been available.

'Most Exciting' Since Hadoop

In its announcement, the company pointed to one of its clients, travel Web site Expedia, which said that it uses the Cloudera Enterprise platform to manage more than 4 petabytes of data. With Impala added, Expedia said the enhanced Enterprise Real-Time Query platform allows the creation of one single platform for Big Data, instead of having to maintain several systems for archiving, extracting, transforming, loading, and analytics.

Cloudera CEO Mike Olson said in a statement that, "until now, enterprises had to limit the work they did with Hadoop because batch-mode processing using MapReduce was just too slow for some business problems." Impala, he explained, will enable organizations to store all their data in Hadoop and "use the same hardware to do both powerful analytics and run real-time queries using industry-standard tools and the SQL language."

In fact, Cloudera co-founder and Chief Scientist Jeff Hammerbacher characterized Impala as "the most exciting open-source project since Hadoop," adding that it was "the most important framework beyond MapReduce for analyzing data stored in HDFS and Hbase."

Tell Us What You Think
Comment:

Name:

Advertisement



 Storage Solutions
1. Google Revamps Data Storage Plan
2. EMC ViPR Changes the Storage World
3. Workers, Firms 'Cache' in on Big Data
4. BitTorrent Offers P2P File-Syncing App
5. MySQL Founders Reunite for MariaDB


advertisement


 Most Popular Articles
1. Half of Companies To Mandate BYOD by 2017, Gartner Says
2. Best of Interop Award Winners Announced
3. Novell Filr Offers IT-Friendly Dropbox Alternative
4. BitTorrent Offers Alpha of P2P File-Syncing App
5. 75% of Breaches Financially Motivated, 20% Are Espionage

Have an informed opinion on this story?
Send a Letter to the Editor.
We want to know what you think.
Send us your Feedback.

 Related Topics  Latest News & Special Reports

  China Hackers Resume U.S. Attacks
  Cyberattacks Could Help Syrian Raids
  Financial Times Latest Hacking Target
  HP and SAP Team on HANA Database
  Cloud Computing Gains Another Rival

 Technology Marketplace

BYOD & MDM
Build a business case for a BYOD program.
 
CRM Systems
Free Download: Understanding the Voice of the Customer
 
Cloud & Virtualization
Brocade technologies help enable the full benefits of virtualization.
Riverbed Stingray Traffic Manager on Amazon Web Services
 
Contact Centers
Unlock the potential in your people with Microsoft Dynamics
 
Customer Data
Free Download: Understanding the Voice of the Customer
 
Customer Service
Unlock the potential in your people with Microsoft Dynamics
 
Data Security
Simpana® 10 software: an exponential leap forward
 
Data Storage
Brocade makes it easier to deploy, manage, and scale networks.
 
Enterprise Software
Simpana® 10 software: an exponential leap forward
 
Mobile Apps
Build great mobile apps that drive engagement.
 
Network Security
Brocade makes it easier to deploy, manage, and scale networks.
 
Enterprise Software Spotlight

Should Enterprises Skip Over Windows 8?
Because of the interface changes and compatibility issues, most businesses will not adopt Windows 8 as their standard, but must be prepared to meet employee BYOD demand for it, Forrester Research says.

HP and SAP Team To Advance HANA Database Technology
The two tech leaders are working on a system that SAP says could fundamentally change the database market. HANA is SAP's technology that keeps data in-memory, for super fast processing.

Revlon Saving Millions with Microsoft Dynamics
The cosmetics giant is reporting millions of dollars in savings thanks to consolidating its enterprise resource planning by using Microsoft Dynamics ERP. Revlon CIO David Giambruno recently shared his story.

Advertisement
Navigation
Data Storage Today
Home/Top News | Data Centers | Storage Solutions | Storage Networks | Data Storage Issues | Data Security | Enterprise I.T.
DST Press Releases
Also visit these Enterprise Technology Sites
Top Tech News | CIO Today | Mobile Tech Today | Data Storage Today

Services:
FreeNewsFeed | Free Newsletters | XML/RSS Feed

About CIO Today Network | How To Contact Us | Article Reprints | Services for PR Pros (In partnership with NewsFactor) | Top Tech Wire | How To Advertise

Privacy Policy | Terms of Service
© Copyright 2000-2013 Data Storage Today. All rights reserved. Article rating technology by Blogowogo. Member of Accuserve Ad Network.