Enterprise Data Warehouse Optimization with Hadoop on IBM Power Systems Servers

Enterprise Data Warehouse Optimization with Hadoop on IBM Power Systems Servers

Title: Enterprise Data Warehouse Optimization with Hadoop on IBM Power Systems Servers
Author: Scott Vetter, Helen Lu & Maciej Olejniczak
Release: 2018-01-31
Kind: ebook
Genre: Computers, Books, Computers & Internet
Size: 1253908
Data warehouses were developed for many good reasons, such as providing quick query and reporting for business operations, and business performance. However, over the years, due to the explosion of applications and data volume, many existing data warehouses have become difficult to manage. Extract, Transform, and Load (ETL) processes are taking longer, missing their allocated batch windows. In addition, data types that are required for business analysis have expanded from structured data to unstructured data.

The Apache open source Hadoop platform provides a great alternative for solving these problems.

IBM® has committed to open source since the early years of open Linux. IBM and Hortonworks together are committed to Apache open source software more than any other company.

IBM Power Systems™ servers are built with open technologies and are designed for mission-critical data applications. Power Systems servers use technology from the OpenPOWER Foundation, an open technology infrastructure that uses the IBM POWER® architecture to help meet the evolving needs of big data applications. The combination of Power Systems with Hortonworks Data Platform (HDP) provides users with a highly efficient platform that provides leadership performance for big data workloads such as Hadoop and Spark.

This IBM Redpaper™ publication provides details about Enterprise Data Warehouse (EDW) optimization with Hadoop on Power Systems. Many people know Power Systems from the IBM AIX® platform, but might not be familiar with IBM PowerLinux™, so part of this paper provides a Power Systems overview. A quick introduction to Hadoop is provided for those not familiar with the topic. Details of HDP on Power Reference architecture are included that will help both software architects and infrastructure architects understand the design.

In the optimization chapter, we describe various topics: traditional EDW offload, sizing guidelines, performance tuning, IBM Elastic Storage™ Server (ESS) for data-intensive workload, IBM Big SQL as the common structured query language (SQL) engine for Hadoop platform, and tools that are available on Power Systems that are related to EDW optimization. We also dedicate some pages to the analytics components (IBM Data Science Experience (IBM DSX) and IBM Spectrum™ Conductor for Spark workload) for the Hadoop infrastructure.

More Books from Scott Vetter, Helen Lu & Maciej Olejniczak

Scott Vetter, Mel Cordero, Lucio Correia, Hai Lin, Vamshikrishna Thatikonda & Rodrigo Xavier
Scott Vetter, Alexandre Bicas Caldeira, Bartłomiej Grabowski, Volker Haug, Marc-Eric Kahle, Andrew Laidlaw, Cesar Diniz Maciel, Monica Sanchez & Seulgi Yoppy Sung
Scott Vetter, Marina Rodriguez Batalha, Raghavendra K Prasannakumar & Humberto Tadashi Tsubamoto
Scott Vetter, Ivaylo B. Bozhinov, Anto A John, Rafael Freitas de Lima, Ahmed.(Mash) Mashhour, James Van Oosten, Fernando Vermelho & Allison White
Scott Vetter, Alexandre Bicas Caldeira, Cho Younghoon, James Cruickshank, Bartłomiej Grabowski, Volker Haug, Andrew Laidlaw & Seulgi Yoppy Sung
Scott Vetter, Alexandre Bicas Caldeira, Bartłomiej Grabowski, Volker Haug, Marc-Eric Kahle, Cesar Diniz Maciel & Monica Sanchez
Scott Vetter, Shivaji D Bhosale, Alexandre Bicas Caldeira, Bartłomiej Grabowski, Chuck Graham, Alexander D Hames, Volker Haug, Marc-Eric Kahle, Cesar Diniz Maciel, Manjunath N Mangalur & Monica Sanchez
Scott Vetter, Alexandre Bicas Caldeira, Volker Haug, Marc-Eric Kahle, Cesar Diniz Maciel & Monica Sanchez
Scott Vetter, Alexandre Bicas Caldeira, Cho Younghoon, James Cruickshank & Bartłomiej Grabowski
Scott Vetter, Sylvain Delabarre, Sorin Hanganu & Thomas Libor PhD
Scott Vetter, Alexandre Caldeira, Marc-Eric Kahle, Gerard Saverimuthu & K. C. Vearner
Scott Vetter, Volker Haug, Andrew Laidlaw & Seulgi Yoppy Sung
Scott Vetter, James Cruickshank, Volker Haug, Yongsheng Li (Victor) & Armin Röll
Scott Vetter, Murilo Opsfelder Araújo, Breno Leitao, Stephen Lutz & José Ricardo Ziviani
Scott Vetter, Giuliano Anselmi, Bruno Blanchard, Cho Younghoon, Christopher Hales & Marcos Quezada
Scott Vetter, Swarna Narendra Babu & Harihara Balakrishnan
Scott Vetter, Javier Bazan Lazcano & Martin Parrella
Scott Vetter, Navdeep Dhaliwal, Ahmed Mashhour, Armin Röll & Liviu Rosca
Scott Vetter, Alexandre Bicas Caldeira & Volker Haug
Scott Vetter, Young Hoon Cho, Gareth Coates, Bartłomiej Grabowski & Volker Haug
Scott Vetter, Ahmed Azraq, Soheel Chughtai, Ahmed (Mash) Mashhour, Duy V Nguyen & Reginaldo Marcelo dos Santos
Scott Vetter, Giuliano Anselmi, Manish Arora, Ivaylo Bozhinov, Dinil Das, Turgut Genc, Bartłomiej Grabowski, Madison Lee & Armin Röll
Scott Vetter, Alexandre Caldeira, Marc-Eric Kahle, Gerard Saverimuthu & K. C. Vearner
Scott Vetter, Javier Bazan Lazcano & Stephen Lutz
Scott Vetter, James Cruickshank, Volker Haug, Yongsheng Li (Victor) & Armin Röll
Scott Vetter, David Barron, Alexandre Bicas Caldeira & Volker Haug
Scott Vetter & Alexandre Bicas Caldeira
Scott Vetter, Giuliano Anselmi, Marc Gregorutti, Stephen Lutz, Michael Malicdem, Guido Somers & Tsvetomir Spasov
Scott Vetter, Volker Haug, Ritesh Nohria & Gustavo Santos
Scott Vetter, Alexandre Bicas Caldeira & Volker Haug
Scott Vetter, Sachin P. Deshmukh, Thierry Huche, Stephen Lutz, Ahmed Mashhour, Christopher Emefiene Osiegbu & Borislav Ivanov Stoymirski
Scott Vetter, Lokesh Bhatt, Turgut Genc, Sabine Jordan & Wasif Mohammad
Scott Vetter, Tobias Elpelt, Rico Franke & Yanil Zeledón Miranda
Scott Vetter, Volker Haug, Ritesh Nohria & Gustavo Santos
Scott Vetter, Jean-Luc Bonhommet & Ingo Dimmer
Scott Vetter, Cho Younghoon & Stephen Lutz
Scott Vetter, David Barron, Alexandre Bicas Caldeira & Volker Haug
Scott Vetter, Glen Corneau, Andrew Laidlaw & Marcos Quezada
Scott Vetter, Young Hoon Cho, Gareth Coates, Bartłomiej Grabowski & Volker Haug
Scott Vetter, Ivaylo Bozhinov, Boran Lee & Gustavo Santos
Scott Vetter, Bartłomiej Grabowski, Mauro Minomizaki & Tamas David Domjan
Scott Vetter, Tonny Bastiaans & Andrew Laidlaw
Scott Vetter, Young Hoon Cho, Gareth Coates, Bartłomiej Grabowski & Volker Haug