'Hadoop illuminated' is the open source book about Apache Hadoop™. It aims to make Hadoop knowledge accessible to a wider audience, not just to the highly technical.

The book is a 'living book' -- we will keep updating it to cover the fast evolving Hadoop eco system.

Checkout these chapters : Hadoop use cases, Big Data Eco-system, publicly available Big Data sets

The book is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License .
Creative Commons License

(Same as MIT Open Courseware )


  • This website is NOT associated with Apache Software Foundation
  • Apache Hadoop is an open source software from Apache Software Foundation.
  • Apache, Apache Hadoop, and Hadoop are trademarks of The Apache Software Foundation. Used with permission. No endorsement by The Apache Software Foundation is implied by the use of these marks
  • For brevity we will refer Apache Hadoop as Hadoop

Read the Book

The book is freely available online. HTML   and   PDF

Book source code (in docbook format) is available from github repository : https://github.com/elephantscale/hadoop-book

About The Authors

Mark Kerzner and Sujee Maniyam wrote this book along with a few contributors

The authors can be reached at authors@hadoopilluminated.com

Mark and Sujee are founders of Elephant Scale -- a company focused on providing high quality services and solutions around Hadoop / Big Data eco system.