| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
| |
Clarify when serializer is used based on recent user@ mailing list discussion.
|
|
|
| |
The documentation here is inconsistent with the coded default and other documentation.
|
|
|
|
| |
Also fix a couple HTML/Markdown issues in other files.
|
|
|
|
|
|
| |
* RDD, *RDDFunctions -> org.apache.spark.rdd
* Utils, ClosureCleaner, SizeEstimator -> org.apache.spark.util
* JavaSerializer, KryoSerializer -> org.apache.spark.serializer
|
| |
|
|
|
|
|
|
| |
Conflicts:
core/src/main/scala/spark/SparkContext.scala
|
|
|
| |
Make the example more compilable
|
|\
| |
| |
| |
| |
| |
| | |
Conflicts:
core/src/main/scala/spark/RDD.scala
core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala
core/src/test/scala/spark/ShuffleSuite.scala
|
| | |
|
|/
|
|
|
|
| |
Only brand new RDDs (e.g. parallelize and makeRDD) now use default
parallelism, everything else uses their largest parent's partitioner
or partition size.
|
| |
|
|
|
|
|
|
|
|
| |
- Edited quick start and tuning guide to simplify them a little
- Simplified top menu bar
- Made private a SparkContext constructor parameter that was left as
public
- Various small fixes
|
|
|
|
|
|
|
|
|
| |
throughout the docs: SPARK_VERSION, SCALA_VERSION, and MESOS_VERSION.
To use them, e.g. use {{site.SPARK_VERSION}}.
Also removes uses of {{HOME_PATH}} which were being resolved to ""
by the templating system anyway.
|
|
|
|
|
|
|
|
| |
1. Slight change in organization
2. Added pre-requisites
3. Made a new section about determining memory footprint
of an RDD
4. Other small changes
|
| |
|
|
|