Tag: MapReduce

Hive

Hive is a data warehouse infrastructure built on top of Hadoop that provides tools to enable easy data summarization, adhoc ...

Pig

Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, ...

Apache Hadoop

Apache Hadoop

The Apache Hadoop software library is the leading framework for distributed processing of large data sets across clusters of computers ...

Riak

Riak

Riak combines a decentralized key/value store, a flexible map/reduce engine, and a friendly HTTP/JSON query interface to provide a database ...