Skip to content

brownsys/brownsys-tez

This branch is 932 commits behind apache/tez:master.

Folders and files

NameName
Last commit message
Last commit date
Dec 11, 2015
Feb 22, 2016
Jan 13, 2016
Jan 13, 2016
Mar 18, 2016
Mar 14, 2016
Mar 29, 2016
Mar 14, 2016
Jan 13, 2016
Feb 23, 2016
Feb 26, 2016
Mar 14, 2016
Mar 18, 2016
Mar 4, 2016
Feb 24, 2016
Mar 14, 2016
Mar 21, 2016
Mar 25, 2016
Nov 17, 2014
Mar 25, 2016
Mar 29, 2016
Sep 3, 2014
May 11, 2015
Nov 11, 2013
Feb 18, 2015
Mar 4, 2015
Jan 19, 2016
Mar 25, 2016

Repository files navigation

Apache Tez

Apache Tez is a generic data-processing pipeline engine envisioned as a low-level engine for higher abstractions such as Apache Hadoop Map-Reduce, Apache Pig, Apache Hive etc.

At its heart, tez is very simple and has just two components:

  • The data-processing pipeline engine where-in one can plug-in input, processing and output implementations to perform arbitrary data-processing. Every 'task' in tez has the following:
  • Input to consume key/value pairs from.
  • Processor to process them.
  • Output to collect the processed key/value pairs.
  • A master for the data-processing application, where-by one can put together arbitrary data-processing 'tasks' described above into a task-DAG to process data as desired. The generic master is implemented as a Apache Hadoop YARN ApplicationMaster.

Packages

No packages published

Languages

  • Java 86.9%
  • JavaScript 10.0%
  • HTML 1.6%
  • CSS 0.6%
  • Shell 0.3%
  • Protocol Buffer 0.3%
  • Other 0.3%