Categories: EnterpriseSoftware

Dell, Cloudera, & Syncsort Partner To Simplify Hadoop For New Users

Today Dell announced that it was partnering with Cloudera and Syncsort to create a new solution for Hadoop. The new solution is aimed at streamlining the extract, transform, and load (ETL) process. ETL is the process of planning, design, construction and deployment process of transforming data into a ready state for analysis, then loading it for business reporting or for querying. The new Dell| Cloudera | Syncsort Data Warehouse Optimization – ETL Offload Reference Architecture will help customers achieve new insights.


Today Dell announced that it was partnering with Cloudera and Syncsort to create a new solution for Hadoop. The new solution is aimed at streamlining the extract, transform, and load (ETL) process. ETL is the process of planning, design, construction and deployment process of transforming data into a ready state for analysis, then loading it for business reporting or for querying. The new Dell| Cloudera | Syncsort Data Warehouse Optimization – ETL Offload Reference Architecture will help customers achieve new insights.

More and more organizations want to adopt Big Data and analytics technologies but tools such as relational database management systems and enterprise data warehouses are too expensive and lacked the desired performance and scalability. Hadoop comes to mind as a potential solution, with the main drawback of the lack of expertise on Hadoop in the industry. In order to effectively use Hadoop companies need to either train their IT resources or pay consulting fees, both of which can be costly. The newly announced reference architecture is designed to achieve faster insights without setting aside the costs and time to learn Hadoop.

The Dell| Cloudera | Syncsort Data Warehouse Optimization – ETL Offload Reference Architecture uses Syncsort’s DMX-h technology to develop and deploy Hadoop ETL jobs. This new reference architecture has been tested and validated:
Dell had an entry-level technician and an expert-level senior engineer run the same workload on four Dell PowerEdgeTM R730xd servers and two Dell PowerEdge R730 servers, powered by Intel Xeon processor E5-2600 v3 product family, in a Hadoop cluster. The results were clear: an entry-level technician created ETL jobs with the Dell | Cloudera | Syncsort solution 60% faster than an expert level senior engineer running the same scenario with do-it-yourself, open-source ETL solutions. Additionally, the entry-level technician was able to streamline ETL design by 53%, giving businesses the equivalent of four days back.

Key features include:

  • Greater cost savings: Customers can save up to 75 percent on ETL administrative costs when compared to a do-it-yourself approach with open-source solutions.5 Additionally, by offloading the data transformation to Hadoop, customers can reduce transformation costs and reclaim data warehouse capability
  • Faster time-to-value: New Hadoop users can more quickly develop and deploy Hadoop ETL jobs, reducing setup time from four weeks with a do-it-yourself open-source approach to one week with the Dell | Cloudera | Syncsort solution6
  • Quicker access to data insights: The Dell | Cloudera | Syncsort solution reduces the time spent on design by 53 percent, and enables 60 percent faster ETL jobs overall when compared to a do- it-yourself open-source solution7
  • Customizable solutions: The reference architecture is easy and intuitive to integrate and deploy, allowing customers to spend more time on strategic forward-looking projects, and less on IT setup and management tasks

Dell main site

Cloudera main site

Syncsort main site

Discuss This Story

Adam Armstrong

Adam is the chief news editor for StorageReview.com, managing our internal and freelance content teams.

Recent Posts

Dell Advances Data Protection Portfolio Amid Rising Cyber Threats

Dell Technologies is advancing its data protection portfolio to enhance cyber resiliency across appliances, software, and as-a-service offerings amid rising…

2 days ago

HPE Cray Storage Systems C500 Lowers Storage Costs For Entry-level Snd Midrange HPC/AI Clusters

Since its launch in 2019, the Cray ClusterStor E1000 Storage System has emerged as a pivotal technology in the field…

2 days ago

Quantum Introduces Quantum GO Subscription Service For Data Management

Quantum Corporation has introduced Quantum GO, a subscription service designed to meet the escalating data demands and cost considerations enterprises…

3 days ago

JetCool Unveils Cold Plates for the NVIDIA H100 GPU

JetCool has launched an innovative liquid cooling module tailored for NVIDIA's H100 SXM and PCIe GPUs, claiming a significant advancement…

5 days ago

iXsystems Expands TrueNAS Enterprise with H-Series Platforms

iXsystems has launched the TrueNAS Enterprise H-Series platforms, designed to give organizations ultimate performance. The H10 model is now available,…

1 week ago

Microsoft Azure Edge Infrastructure At Hannover Messe 2024

Hannover Messe 2024 represents a significant event in the global industrial sector, serving as the world's largest industrial trade fair.…

1 week ago