Tag: Hadoop

Hive

Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc ...

Pig

Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, ...

Apache Hadoop

Apache Hadoop

The Apache Hadoop software library is the leading framework for distributed processing of large data sets across clusters of computers ...

HBase

HBase is the Hadoop database. Think of it as a distributed, scalable, big data store.

CDH

CDH (Cloudera's Distribution, including Apache Hadoop) is Cloudera's 100% open source Hadoop distribution, and the world's leading Apache Hadoop solution: ...

MapR

MapR delivers on the promise of Hadoop, making Big Data management and analysis a reality for more business users. The ...