Skip to content

SparkCAD (Spark Caching Anomalies Detector): Logical plan visualization and caching anomalies detection tool.

Notifications You must be signed in to change notification settings

dbis-ilm/SparkCAD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SparkCAD

SparkCAD (Spark Caching Anomalies Detector): Logical plan visualization and caching anomalies detection tool for Apache Spark applications.

  • Current version: 1.0
  • Contents:
    1. Overview
    2. Requirements
    3. Organization

OVERVIEW

SparkCAD is an interactive decision support tool that visualizes the logical plan of Spark applications and detects caching anomalies. It parses the execution logs of Spark (thse same ones that Spark's History Server parses without additional metadata).

Requirements

Organization

Samples of 130 execution logs could be found in "logs" folder. The produced logical plans are stored by default in "Spark-DAGs". The default configuration is stored in "config.ini". The user can change it or overwrite the default configuration values during using the notebook.

About

SparkCAD (Spark Caching Anomalies Detector): Logical plan visualization and caching anomalies detection tool.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published