Spark Application Garbage Collector
ContextCleaner
is a Spark service that is responsible for application-wide cleanup of shuffles , RDDs , broadcasts , accumulators and checkpointed RDDs that is aimed at reducing the memory requirements of long-running data-heavy Spark applications.
ContextCleaner
runs on the driver. It is created and immediately started whenSparkContext
starts (andspark.cleaner.referenceTracking
Spark property is enabled, which it is by default). It is stopped whenSparkContext
is stopped
.
Last updated