- Requires no additional installation on the
D s item item node
- Support for yarn-cluster mode ensures that all Spark processing is handled on the Hadoop cluster.
- Exact bin counts appear for profile results, except for Top-N counts.
NOTE: Spark History Server is not supported. It should be used only for short-term debugging tasks, as it requires considerable resources.
Before you begin, please verify the following: