Skip to content

XbaoWu/tez

This branch is 96 commits behind apache/tez:master.

Folders and files

NameName
Last commit message
Last commit date
Feb 15, 2023
Feb 13, 2023
Jan 2, 2023
Oct 10, 2022
Oct 10, 2022
Feb 24, 2023
Oct 10, 2022
Nov 26, 2022
Feb 24, 2023
Oct 10, 2022
Nov 26, 2022
Nov 26, 2022
Feb 13, 2023
Feb 28, 2023
Nov 26, 2022
Mar 6, 2023
Feb 24, 2023
Nov 26, 2022
Apr 5, 2023
Feb 14, 2021
Apr 14, 2022
Jun 21, 2022
Jul 22, 2022
Sep 3, 2014
Jan 21, 2022
Aug 25, 2020
Nov 11, 2013
Jul 5, 2016
Mar 4, 2015
Oct 10, 2022
Apr 5, 2023

Repository files navigation

Apache Tez

Apache Tez is a generic data-processing pipeline engine envisioned as a low-level engine for higher abstractions such as Apache Hadoop Map-Reduce, Apache Pig, Apache Hive etc.

At its heart, tez is very simple and has just two components:

  • The data-processing pipeline engine where-in one can plug-in input, processing and output implementations to perform arbitrary data-processing. Every 'task' in tez has the following:
  • Input to consume key/value pairs from.
  • Processor to process them.
  • Output to collect the processed key/value pairs.
  • A master for the data-processing application, where-by one can put together arbitrary data-processing 'tasks' described above into a task-DAG to process data as desired. The generic master is implemented as a Apache Hadoop YARN ApplicationMaster.

Releases

No releases published

Packages

No packages published

Languages

  • Java 90.0%
  • JavaScript 7.8%
  • Handlebars 1.0%
  • Less 0.5%
  • Shell 0.3%
  • Python 0.2%
  • Other 0.2%