Organizations are increasingly adopting the Hadoop MapReduce framework to transform strategic decision-making and uncover unique business opportunities hidden in vast amounts of data. However, developing and maintaining Hadoop deployments, while achieving performance service level agreements (SLAs) required by the business, poses challenges that can increase risk and hinder adoption.
- Scaling requires adding more nodes due to a high CPU and memory footprint.
- Tuning involves hundreds of configurable parameters, making it difficult to achieve optimum performance.
- Developing and maintaining MapReduce jobs requires highly technical skills.
- Adding new capabilities, such as reverse sorting, requires manual coding.
Accelerating Hadoop with DMExpress
Syncsort has introduced a special DMExpress Hadoop Edition of its record-setting data integration acceleration software to help organizations accelerate Hadoop deployments, minimize TCO and achieve faster time-to-value. Built on 40+ years of data integration and data performance expertise, this new solution makes it easier than ever to develop MapReduce jobs by harnessing all the benefits of the distributed computing platform combined with the unmatched performance and efficiency of DMExpress.
- Faster Performance at Scale means reducing hardware by up to 50% so you can defer hardware purchases while still exceeding performance SLAs.
- Automatic tuning engines translate into fewer IT staff hours needed to maintain existing Hadoop deployments.
- Intuitive graphical interface reduces barriers for wider adoption across the organization, increasing overall productivity and accelerating development of strategic initiatives.
DMExpress Hadoop Edition is initially available as part of a limited beta program, with expected general availability later in 2011. If you are interested in our beta program, please contact us to learn more.
“The Hadoop benchmark testing we have completed with Syncsort has exceeded our expectations. We see tremendous potential for using lightweight, easy to deploy tools like DMExpress for accelerating MapReduce processing and making it more efficient.”
-- Michael Brown, Chief Technology Officer, comScore Inc. |
comScore Sees 2x Performance with Hadoop Acceleration
comScore, a leader in measuring the digital world, has built and defined a market by leveraging ‘Big Data’ to help its customers succeed. The company monitors, collects and analyzes more than 20 billion records a day, amounting to terabytes of information, to provide unique insights about users online and offline behavior.
comScore engaged Syncsort to accelerate Hadoop processing and, in benchmark testing, achieved 2x faster performance with DMExpress without additional hardware and with minimal coding and tuning.*
Syncsort announces plans to contribute an external sort “plug-in” to the Hadoop open source community..
> Learn More
Data Integration Acceleration solutions eliminate performance bottlenecks within existing DI environments in a cost-effective, non-disruptive and scalable manner.
> Learn More
DMExpress is the quickest data integration solution to deploy, the easiest to use, the fastest to process growing data sets, and the least taxing on hardware and network resources.
> Learn More
* The benchmark testing was completed on a 6 node cluster on Cloudera’s Distribution for Hadoop Version 3 (CDH3) and involved terabytes of data.