Table of Contents
Analytics for Hadoop can be done by the following:
How ever doing analytics like this can feel a little pedantic and time consuming. Business INtelligence tools (BI tools for short) can address this problem.
BI tools have been around since before Hadoop. Some of them are generic, some are very specific towards a certain domain (e.g. Telecom, Health Care ..etc). BI tools provide rich, user friendly environment to slice and dice data. Most of them have nice GUI environments as well.
Since Hadoop is gaining popularity as a data silo, a lot of BI tools have added support to Hadoop. In this chapter we will look into some BI tools that work with Hadoop.
We are trying to present capabilities of BI tools in an easy to compare feature matrix format. This is a 'living' document. We will keep it updated as new versions and new features surface.
This matrix is under construction
How to read the matrix?
Y - feature is supported
N - feature is NOT supported
? or empty - unknown
Read the legend for feature descriptions.
Can validate data confirms to certain limits, can do cleansing and de-duping.
Can share the results with others within or outside organization easily. (Think like sharing a document on DropBox or Google Drive)
You can slice and dice data on locally on a computer or tablet. This uses the CPU power of the device and doesn't need a round-trip to a 'server' to process results. This can speed up ad-hoc data exploration
The platform allows customers to buy third party analytics app. Think like APple App Store