A Spark bootstrap project written in Scala with gradle as build tool.
- JavaVersion=1.11
- sparkVersion=3.4.1
- scalaVersion=2.12
- deltaVersion=2.4.0
java -version
openjdk version "11.0.20" 2023-07-18
OpenJDK Runtime Environment Homebrew (build 11.0.20+0)
OpenJDK 64-Bit Server VM Homebrew (build 11.0.20+0, mixed mode)
./gradlew clean build
./gradlew check
./gradlew reportTestScoverage
Gradle sparkSubmit task is configured to run with class the dev.template.spark.RddCollect
./gradlew sparkSubmit
${SPARK_HOME}/bin/spark-submit \
--verbose \
--class dev.template.spark.Main \
--packages io.delta:delta-core_2.12:2.4.0 \
--master "local[2]" \
--driver-memory 1g \
--executor-memory 1g \
--executor-cores 2 \
build/libs/spark-scala-gradle-bootstrap-2.12.0-all.jar \
src/main/resources/people-example.csv \
${SPARK_HOME}/bin/spark-submit \
--verbose \
--class dev.template.spark.Main \
--packages io.delta:delta-core_2.12:2.4.0 \
--master spark://localhost:7077 \
--driver-memory 1g \
--executor-memory 1g \
--executor-cores 2 \
build/libs/spark-scala-gradle-bootstrap-2.12.0-all.jar \
src/main/resources/people-example.csv \
${SPARK_HOME}/bin/spark-submit \
--class dev.template.spark.RddCollect \
--master spark://localhost:7077 \
build/libs/spark-scala-gradle-bootstrap-2.12.0-all.jar
${SPARK_HOME}/bin/spark-submit \
--class dev.template.spark.CovidDataPartitioner \
--packages io.delta:delta-core_2.12:2.4.0 \
--master "local[2]" \
--driver-memory 1g \
--executor-memory 1g \
--executor-cores 2 \
build/libs/spark-scala-gradle-bootstrap-2.12.0-all.jar \
src/main/resources/us-counties-recent.csv \
/tmp/partitioned-covid-data
https://github.com/scoverage/gradle-scoverage
https://github.com/scoverage/gradle-scoverage/blob/master/build.gradle#L59C1-L59C52
- Spark Docs - Root Page
- Spark Programming Guide
- Spark Latest API docs
- Scala API Docs
- https://barrelsofdata.com/spark-boilerplate-using-scala
https://github.com/mahen-github/spark-scala-gradle-bootstrap/issues
https://www.databricks.com/wp-content/uploads/2021/06/Ebook_8-Steps-V2.pdf
https://github.com/spark-examples/spark-scala-examples