spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SPARK-16505][YARN] Optionally propagate error during shuffle service startup.	Marcelo Vanzin	2016-07-14	1	-12/+1
\| \| \| \| \| \| \| \| \| \| \|	This prevents the NM from starting when something is wrong, which would lead to later errors which are confusing and harder to debug. Added a unit test to verify startup fails if something is wrong. Author: Marcelo Vanzin <vanzin@cloudera.com> Closes #14162 from vanzin/SPARK-16505.
*	Fix dynamic allocation docs to address cached data.	Michael Gummelt	2016-04-26	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	## What changes were proposed in this pull request? Documentation changes ## How was this patch tested? No tests Author: Michael Gummelt <mgummelt@mesosphere.io> Closes #12664 from mgummelt/fix-dynamic-docs.
*	[SPARK-13529][BUILD] Move network/* modules into common/network-*	Reynold Xin	2016-02-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	## What changes were proposed in this pull request? As the title says, this moves the three modules currently in network/ into common/network-*. This removes one top level, non-user-facing folder. ## How was this patch tested? Compilation and existing tests. We should run both SBT and Maven. Author: Reynold Xin <rxin@databricks.com> Closes #11409 from rxin/SPARK-13529.
*	[SPARK-13521][BUILD] Remove reference to Tachyon in cluster & release scripts	Reynold Xin	2016-02-26	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	## What changes were proposed in this pull request? We provide a very limited set of cluster management script in Spark for Tachyon, although Tachyon itself provides a much better version of it. Given now Spark users can simply use Tachyon as a normal file system and does not require extensive configurations, we can remove this management capabilities to simplify Spark bash scripts. Note that this also reduces coupling between a 3rd party external system and Spark's release scripts, and would eliminate possibility for failures such as Tachyon being renamed or the tar balls being relocated. ## How was this patch tested? N/A Author: Reynold Xin <rxin@databricks.com> Closes #11400 from rxin/release-script.
*	[SPARK-12534][DOC] update documentation to list command line equivalent to ↵	felixcheung	2016-01-21	1	-1/+4
\| \| \| \| \| \| \| \| \| \|	properties Several Spark properties equivalent to Spark submit command line options are missing. Author: felixcheung <felixcheung_m@hotmail.com> Closes #10491 from felixcheung/sparksubmitdoc.
*	[DOCUMENTATION] doc fix of job scheduling	Jeff Zhang	2016-01-08	1	-1/+1
\| \| \| \| \| \| \| \|	spark.shuffle.service.enabled is spark application related configuration, it is not necessary to set it in yarn-site.xml Author: Jeff Zhang <zjffdu@apache.org> Closes #10657 from zjffdu/doc-fix.
*	[SPARK-11809] Switch the default Mesos mode to coarse-grained mode	Reynold Xin	2015-11-18	1	-1/+1
\| \| \| \| \| \| \| \|	Based on my conversions with people, I believe the consensus is that the coarse-grained mode is more stable and easier to reason about. It is best to use that as the default rather than the more flaky fine-grained mode. Author: Reynold Xin <rxin@databricks.com> Closes #9795 from rxin/SPARK-11809.
*	[SPARK-11667] Update dynamic allocation docs to reflect supported cluster ↵	Andrew Or	2015-11-12	1	-28/+27
\| \| \| \| \| \| \| \|	managers Author: Andrew Or <andrew@databricks.com> Closes #9637 from andrewor14/update-da-docs.
*	[SPARK-4286] Add an external shuffle service that can be run as a daemon.	Iulian Dragos	2015-04-28	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This allows Mesos deployments to use the shuffle service (and implicitly dynamic allocation). It does so by adding a new "main" class and two corresponding scripts in `sbin`: - `sbin/start-shuffle-service.sh` - `sbin/stop-shuffle-service.sh` Specific options can be passed in `SPARK_SHUFFLE_OPTS`. This is picking up work from #3861 /cc tnachen Author: Iulian Dragos <jaguarul@gmail.com> Closes #4990 from dragos/feature/external-shuffle-service and squashes the following commits: 6c2b148 [Iulian Dragos] Import order and wrong name fixup. 07804ad [Iulian Dragos] Moved ExternalShuffleService to the `deploy` package + other minor tweaks. 4dc1f91 [Iulian Dragos] Reviewer’s comments: 8145429 [Iulian Dragos] Add an external shuffle service that can be run as a daemon.
*	[SPARK-6402][DOC] - Remove some refererences to shark in docs and ec2	Pierre Borckmans	2015-03-19	1	-4/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	EC2 script and job scheduling documentation still refered to Shark. I removed these references. I also removed a remaining `SHARK_VERSION` variable from `ec2-variables.sh`. Author: Pierre Borckmans <pierre.borckmans@realimpactanalytics.com> Closes #5083 from pierre-borckmans/remove_refererences_to_shark_in_docs and squashes the following commits: 4e90ffc [Pierre Borckmans] Removed deprecated SHARK_VERSION caea407 [Pierre Borckmans] Remove shark reference from ec2 script doc 196c744 [Pierre Borckmans] Removed references to Shark
*	SPARK-4585. Spark dynamic executor allocation should use minExecutors as...	Sandy Ryza	2015-02-02	1	-5/+4
\| \| \| \| \| \| \| \| \| \| \|	... initial number Author: Sandy Ryza <sandy@cloudera.com> Closes #4051 from sryza/sandy-spark-4585 and squashes the following commits: d1dd039 [Sandy Ryza] Add spark.dynamicAllocation.initialNumExecutors and make min and max not required b7c59dc [Sandy Ryza] SPARK-4585. Spark dynamic executor allocation should use minExecutors as initial number
*	[SPARK-4915][YARN] Fix classname to be specified for external shuffle service.	Tsuyoshi Ozawa	2014-12-22	1	-1/+1
\| \| \| \| \| \| \| \|	Author: Tsuyoshi Ozawa <ozawa.tsuyoshi@lab.ntt.co.jp> Closes #3757 from oza/SPARK-4915 and squashes the following commits: 3b0d6d6 [Tsuyoshi Ozawa] Fix classname to be specified for external shuffle service.
*	[SPARK-4140] Document dynamic allocation	Andrew Or	2014-12-19	1	-0/+108
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Once the external shuffle service is also documented, the dynamic allocation section will link to it. Let me know if the whole dynamic allocation should be moved to its separate page; I personally think the organization might be cleaner that way. This patch builds on top of oza's work in #3689. aarondav pwendell Author: Andrew Or <andrew@databricks.com> Author: Tsuyoshi Ozawa <ozawa.tsuyoshi@gmail.com> Closes #3731 from andrewor14/document-dynamic-allocation and squashes the following commits: 1281447 [Andrew Or] Address a few comments b9843f2 [Andrew Or] Document the configs as well 246fb44 [Andrew Or] Merge branch 'SPARK-4839' of github.com:oza/spark into document-dynamic-allocation 8c64004 [Andrew Or] Add documentation for dynamic allocation (without configs) 6827b56 [Tsuyoshi Ozawa] Fixing a documentation of spark.dynamicAllocation.enabled. 53cff58 [Tsuyoshi Ozawa] Adding a documentation about dynamic resource allocation.
*	SPARK-1183. Don't use "worker" to mean executor	Sandy Ryza	2014-03-13	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Author: Sandy Ryza <sandy@cloudera.com> Closes #120 from sryza/sandy-spark-1183 and squashes the following commits: 5066a4a [Sandy Ryza] Remove "worker" in a couple comments 0bd1e46 [Sandy Ryza] Remove --am-class from usage bfc8fe0 [Sandy Ryza] Remove am-class from doc and fix yarn-alpha 607539f [Sandy Ryza] Address review comments 74d087a [Sandy Ryza] SPARK-1183. Don't use "worker" to mean executor
*	Add way to limit default # of cores used by applications on standalone mode	Matei Zaharia	2014-01-07	1	-3/+2
\| \| \| \|	Also documents the spark.deploy.spreadOut option.
*	Updated docs for SparkConf and handled review comments	Matei Zaharia	2013-12-30	1	-9/+12
\|
*	Various broken links in documentation	Patrick Wendell	2013-12-07	1	-1/+1
\|
*	Review comments	Matei Zaharia	2013-09-08	1	-1/+1
\|
*	More fair scheduler docs and property names.	Matei Zaharia	2013-09-08	1	-7/+94
\| \| \| \| \|	Also changed uses of "job" terminology to "application" when they referred to an entire Spark program, to avoid confusion.
*	Work in progress:	Matei Zaharia	2013-09-08	1	-0/+81
	- Add job scheduling docs - Rename some fair scheduler properties - Organize intro page better - Link to Apache wiki for "contributing to Spark"