Data Storage Today

CIO Today Network Sites:   Top Tech News  |   CIO Today   |   Mobile Tech Today   |   Data Storage Today
News & Information for Data Storage Professionals
Eliminate costly downtime!
Find out how with Free White Paper
& enter to win a Samsung Galaxy Note

www.apc.com
Saturday, April 19th 
24/7/365 Network Uptime!
This ad will display for the next 20 seconds. Please click for more information, or scroll down to pass the ad, or Close Ad.
Trending Topics:   Security Heartbleed Big Data Cloud Computing Windows XP Data Centers OS X Mavericks
Home
Data Centers
Storage Solutions
Storage Networks
Data Storage Issues
Data Security
DST Press Releases
 
Free Newsletters
Top CIO News
 
Mobile Tech Today
 

Storage Solutions

Cloudera Announces Real-Time Query Engine for Hadoop

Cloudera Announces Real-Time Query Engine for Hadoop
October 24, 2012 1:56PM

Bookmark and Share
Cloudera's Apache-licensed, open-source query engine, Cloudera Impala, is specifically designed for real-time query of data stored in a Hadoop Distributed File System, or HDFS, and in HBase, a non-relational distributed database, and the company said it is the result of two years of in-house development. The queries for Impala can be expressed as SQL.

Your Next Generation Data Center Is Here! Vblock™ Systems: the world's most advanced converged infrastructure are built on the Cisco Unified Computing System with Intel® Xeon® processors. Vblock™ Systems deliver extraordinary time to market, ROI and TCO, and flexibility to meet your continually changing demands with 5X faster deployment, 96% less downtime, and 1/2 the cost. Click here to learn more.

There's a new tool for Big Data analysis. On Wednesday, Cloudera announced a real-time query engine for Apache Hadoop, resulting from two years of in-house development efforts.

The engine is an enhancement to Cloudera's Big Data platform, known as Cloudera Enterprise. In describing the query-engine's uniqueness, Cloudera claims this is the first time both real-time and batch operations are available for unstructured and structured data in one massively scalable system.

Cloudera offers a commonly used version of Hadoop, an open-source data framework designed for handling Big Data.

In its announcement, Cloudera said that the new query engine will enable organizations to "process data at petabyte scale and, on the same system, interact with that data in real time to deliver 'speed-of-thought' insights." In short, the company said, the new tool will allow organizations to "ask bigger questions" of their data.

SQL Queries

The Apache-licensed, open-source query engine, Cloudera Impala, is specifically designed for real-time query of data stored in a Hadoop Distributed File System (HDFS) and in HBase, a non-relational distributed database. Interactive queries for Impala can be expressed as SQL.

The company said that Impala operates 10 times as fast as the existing Hive/MapReduce, and can be even faster, depending on the workload. It pointed to cost savings for analyzing Big Data with real-time queries, by using this open-source technology with commodity hardware.

Cloudera said that, in a recent survey it conducted of more than 100 customers, over 70 percent were looking at how to extract value from Big Data. Operational IT efficiency and competitive advantage were cited by the customers as reasons for adopting Hadoop, but the vast majority also indicated they needed faster methods of querying than the batch operations that had been available.

'Most Exciting' Since Hadoop

In its announcement, the company pointed to one of its clients, travel Web site Expedia, which said that it uses the Cloudera Enterprise platform to manage more than 4 petabytes of data. With Impala added, Expedia said the enhanced Enterprise Real-Time Query platform allows the creation of one single platform for Big Data, instead of having to maintain several systems for archiving, extracting, transforming, loading, and analytics.

Cloudera CEO Mike Olson said in a statement that, "until now, enterprises had to limit the work they did with Hadoop because batch-mode processing using MapReduce was just too slow for some business problems." Impala, he explained, will enable organizations to store all their data in Hadoop and "use the same hardware to do both powerful analytics and run real-time queries using industry-standard tools and the SQL language."

In fact, Cloudera co-founder and Chief Scientist Jeff Hammerbacher characterized Impala as "the most exciting open-source project since Hadoop," adding that it was "the most important framework beyond MapReduce for analyzing data stored in HDFS and Hbase."

Tell Us What You Think
Comment:

Name:



 Storage Solutions
1. Fast Seagate 6 TB Drive for Data Center
2. Hortonworks, LucidWorks Join Forces
3. Google Drops Cloud Storage Prices
4. SAP Upgrades Business Warehouse App
5. Storage firm Box Files $250M IPO




 Most Popular Articles
1. Intel Bets on Cloudera for Big Data Analytics
2. SAP HANA Data Warehouse App Gets Faster Analytics
3. California DMV Investigates Possible Security Breach
4. Fast Seagate 6 TB Drive Offered for Enterprise Data Centers
5. Resetting All Passwords Now May Be Worst Heartbleed Fix

Have an informed opinion on this story?
Send a Letter to the Editor.
We want to know what you think.
Send us your Feedback.

 Related Topics  Latest News & Special Reports

  Heartbleed Exploit Could Cost Millions
  Michaels: Nearly 3M Cards Breached
  Malware Targets Facebook Users
  IBM Adds Disaster Recovery to SoftLayer
  How To Beat the Heartbleed Bug

 Technology Marketplace

Business Intelligence
Get real-time, cloud-based information services with Neustar.
 
Cloud Computing
BMC's I.T. solutions unleash the power of your business
Next Generation Data Center Is Here! Vblock™ Systems from VCE
 
Contact Centers
HP delivers the future of the contact center with HP Qfiniti 10.
 
Data Storage
Next Generation Data Center Is Here! Vblock™ Systems from VCE
Barium Ferrite (BaFe) is the future of tape.
2.5" Enterprise-class SATA & SAS SSDs for server & storage applications
 
Enterprise Hardware
Barium Ferrite (BaFe) is the future of tape.
2.5" Enterprise-class SATA & SAS SSDs for server & storage applications
 
Enterprise I.T.
BMC's I.T. solutions unleash the power of your business
 
Hardware
Protect your network with APC Smart-UPS battery backup
 
Network Security
Protect your network with APC Smart-UPS battery backup
 

Network Security Spotlight
Heartbleed Could Cost Millions, Could Have Been Prevented
Early estimates of Heartbleed’s cost to enterprises are running in the millions. The reason: revoking all the SSL certificates the bug exposed will come at a very hefty price. Some say it all could have been avoided.
 
Michaels Says Nearly 3M Credit, Debit Cards Breached
Arts and crafts retail giant Michaels Stores has confirmed that a data breach at its POS terminals from May 2013 to Jan. 2014 may have exposed nearly 3 million customer credit and debit cards.
 
Google's Street View Software Unravels CAPTCHAs
The latest software Google uses for its Street View cars to read street numbers in images for Google Maps works so well that it also solves CAPTCHAs, those puzzles designed to defeat bots.
 

Enterprise Hardware Spotlight
Vaio Fit 11A Battery Danger Forces Recall by Sony
Using a Sony Vaio Fit 11A laptop? It's time to send it back to Sony. In fact, Sony is encouraging people to stop using the laptop after several reports of its Panasonic battery overheating.
 
Continued Drop in Global PC Shipments Slows
Worldwide shipments of PCs fell during the first three months of the year, but the global slump in PC demand may be easing, with a considerable slowdown from last year's drops.
 
Google Glass Finds a Home in Medical Education, Practice
The innovative headpiece may find its niche in markets where hands-free access to data can be a big advantage. Glass experiments for doctors are already under way, with some promising results.
 

Mobile Technology Spotlight
Review: Siri-Like Cortana Fills Windows Phone Gap
With the new Cortana virtual assistant, Windows catches up with Apple's iOS and Google's Android in a major way, taking some of the best parts of Apple's and Google's virtual assistants, with new tools too.
 
With Galaxy S5, Samsung Proves Less Can Be More
Samsung has produced the most formidable rival yet to the iPhone 5s: the Galaxy S5. The device is the fifth edition of the company's successful line of Galaxy S smartphones, and shows less can be more.
 
Facebook Rolls Out Potentially Intrusive Location-Sharing
Looking for friends? Facebook users in the U.S. will soon be able to see which of their friends are nearby, using a smartphone's GPS. Could be a cool feature in some cases, or way too much information.
 

Navigation
Data Storage Today
Home/Top News | Data Centers | Storage Solutions | Storage Networks | Data Storage Issues | Data Security | DST Press Releases
Also visit these Enterprise Technology Sites
Top Tech News | CIO Today | Mobile Tech Today | Data Storage Today

Services:
FreeNewsFeed | Free Newsletters | XML/RSS Feed

About CIO Today Network | How To Contact Us | Article Reprints | Services for PR Pros (In partnership with NewsFactor) | Top Tech Wire | How To Advertise

Privacy Policy | Terms of Service
© Copyright 2000-2014 Data Storage Today. All rights reserved. Article rating technology by Blogowogo. Member of Accuserve Ad Network.