spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	Merge pull request #374 from mateiz/completeness	Reynold Xin	2014-01-09	6	-6/+90
\|\ \| \| \| \| \| \| \| \| \| \|	Add some missing Java API methods These are primarily for setting job groups, canceling jobs, and setting names on RDDs. Seemed like useful stuff to expose in Java.
\| *	Add some missing Java API methods	Matei Zaharia	2014-01-09	6	-6/+90
\| \|
* \|	Merge pull request #294 from RongGu/master	Reynold Xin	2014-01-09	1	-1/+6
\|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Bug fixes for updating the RDD block's memory and disk usage information Bug fixes for updating the RDD block's memory and disk usage information. From the code context, we can find that the memSize and diskSize here are both always equal to the size of the block. Actually, they never be zero. Thus, the logic here is wrong for recording the block usage in BlockStatus, especially for the blocks which are dropped from memory to ensure space for the new input rdd blocks. I have tested it that this would cause the storage metrics shown in the Storage webpage wrong and misleading. With this patch, the metrics will be okay. Finally, Merry Christmas, guys:)
\| * \	Merge remote branch 'upstream/master'	walker	2014-01-09	128	-1060/+2943
\| \|\ \
\| * \| \|	add inline comments	walker	2014-01-07	1	-1/+1
\| \| \| \|
\| * \| \|	add inline comments	walker	2014-01-07	1	-0/+4
\| \| \| \|
\| * \| \|	Merge remote branch 'upstream/master'	walker	2014-01-07	343	-5155/+6663
\| \|\ \ \
\| * \| \| \|	Bug fixes for updating the RDD block's memory and disk usage information	walker	2013-12-25	1	-1/+2
\| \| \| \| \|
* \| \| \| \|	Merge pull request #293 from pwendell/standalone-driver	Patrick Wendell	2014-01-09	34	-153/+1568
\|\ \ \ \ \ \| \|_\|_\|_\|/ \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SPARK-998: Support Launching Driver Inside of Standalone Mode [NOTE: I need to bring the tests up to date with new changes, so for now they will fail] This patch provides support for launching driver programs inside of a standalone cluster manager. It also supports monitoring and re-launching of driver programs which is useful for long running, recoverable applications such as Spark Streaming jobs. For those jobs, this patch allows a deployment mode which is resilient to the failure of any worker node, failure of a master node (provided a multi-master setup), and even failures of the applicaiton itself, provided they are recoverable on a restart. Driver information, such as the status and logs from a driver, is displayed in the UI There are a few small TODO's here, but the code is generally feature-complete. They are: - Bring tests up to date and add test coverage - Restarting on failure should be optional and maybe off by default. - See if we can re-use akka connections to facilitate clients behind a firewall A sensible place to start for review would be to look at the `DriverClient` class which presents users the ability to launch their driver program. I've also added an example program (`DriverSubmissionTest`) that allows you to test this locally and play around with killing workers, etc. Most of the code is devoted to persisting driver state in the cluster manger, exposing it in the UI, and dealing correctly with various types of failures. Instructions to test locally: - `sbt/sbt assembly/assembly examples/assembly` - start a local version of the standalone cluster manager ``` ./spark-class org.apache.spark.deploy.client.DriverClient \ -j -Dspark.test.property=something \ -e SPARK_TEST_KEY=SOMEVALUE \ launch spark://10.99.1.14:7077 \ ../path-to-examples-assembly-jar \ org.apache.spark.examples.DriverSubmissionTest 1000 some extra options --some-option-here -X 13 ``` - Go in the UI and make sure it started correctly, look at the output etc - Kill workers, the driver program, masters, etc.
\| * \| \| \|	Some usability improvements	Patrick Wendell	2014-01-09	3	-25/+62
\| \| \| \| \|
\| * \| \| \|	Adding polling to driver submission client.	Patrick Wendell	2014-01-08	6	-68/+132
\| \| \| \| \|
\| * \| \| \|	Adding mockito to maven build	Patrick Wendell	2014-01-08	2	-0/+11
\| \| \| \| \|
\| * \| \| \|	Merge remote-tracking branch 'apache-github/master' into standalone-driver	Patrick Wendell	2014-01-08	128	-1052/+2936
\| \|\ \ \ \ \| \| \| \|_\|/ \| \| \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/test/scala/org/apache/spark/deploy/JsonProtocolSuite.scala pom.xml
\| * \| \| \|	Show more helpful information in UI	Patrick Wendell	2014-01-08	4	-9/+18
\| \| \| \| \|
\| * \| \| \|	Fixes	Patrick Wendell	2014-01-08	3	-4/+5
\| \| \| \| \|
\| * \| \| \|	Rename to Client	Patrick Wendell	2014-01-07	2	-8/+5
\| \| \| \| \|
\| * \| \| \|	Adding --verbose option to DriverClient	Patrick Wendell	2014-01-07	2	-4/+22
\| \| \| \| \|
\| * \| \| \|	Adding unit tests and some refactoring to promote testability.	Patrick Wendell	2014-01-07	10	-35/+264
\| \| \| \| \|
\| * \| \| \|	Some doc fixes	Patrick Wendell	2014-01-06	1	-3/+2
\| \| \| \| \|
\| * \| \| \|	Fixes after merge	Patrick Wendell	2014-01-06	3	-6/+8
\| \| \| \| \|
\| * \| \| \|	Merge remote-tracking branch 'apache-github/master' into standalone-driver	Patrick Wendell	2014-01-06	316	-3469/+4904
\| \|\ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/deploy/client/AppClient.scala core/src/main/scala/org/apache/spark/deploy/client/TestClient.scala core/src/main/scala/org/apache/spark/deploy/master/Master.scala core/src/main/scala/org/apache/spark/deploy/worker/Worker.scala core/src/main/scala/org/apache/spark/scheduler/cluster/SparkDeploySchedulerBackend.scala
\| * \| \| \| \|	Changes based on review feedback.	Patrick Wendell	2014-01-06	7	-22/+34
\| \| \| \| \| \|
\| * \| \| \| \|	Respect supervise option at Master	Patrick Wendell	2013-12-29	1	-3/+15
\| \| \| \| \| \|
\| * \| \| \| \|	Slight change to retry logic	Patrick Wendell	2013-12-29	1	-2/+3
\| \| \| \| \| \|
\| * \| \| \| \|	TODO clean-up	Patrick Wendell	2013-12-29	5	-6/+6
\| \| \| \| \| \|
\| * \| \| \| \|	Adding driver ID to submission response	Patrick Wendell	2013-12-29	2	-2/+2
\| \| \| \| \| \|
\| * \| \| \| \|	Documentation and adding supervise option	Patrick Wendell	2013-12-29	5	-18/+51
\| \| \| \| \| \|
\| * \| \| \| \|	Changes to allow fate sharing of drivers/executors and workers.	Patrick Wendell	2013-12-29	17	-133/+239
\| \| \| \| \| \|
\| * \| \| \| \|	Some notes and TODO about dependencies	Patrick Wendell	2013-12-27	1	-1/+7
\| \| \| \| \| \|
\| * \| \| \| \|	Intermediate clean-up of tests to appease jenkins	Patrick Wendell	2013-12-26	1	-10/+25
\| \| \| \| \| \|
\| * \| \| \| \|	Minor fixes	Patrick Wendell	2013-12-26	3	-20/+25
\| \| \| \| \| \|
\| * \| \| \| \|	Addressing smaller changes from Aaron's review	Patrick Wendell	2013-12-26	7	-27/+31
\| \| \| \| \| \|
\| * \| \| \| \|	Merge pull request #1 from aarondav/driver	Patrick Wendell	2013-12-26	1	-62/+31
\| \|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Refactor DriverClient to be more Actor-based
\| \| * \| \| \| \|	Refactor DriverClient to be more Actor-based	Aaron Davidson	2013-12-25	1	-62/+31
\| \| \| \| \| \| \|
\| * \| \| \| \| \|	Removing accidental file	Patrick Wendell	2013-12-26	2	-10/+1
\| \| \| \| \| \| \|
\| * \| \| \| \| \|	Updated approach to driver restarting	Patrick Wendell	2013-12-26	2	-23/+30
\| \|/ / / / /
\| * \| \| \| \|	Removing un-used variable	Patrick Wendell	2013-12-25	1	-2/+0
\| \| \| \| \| \|
\| * \| \| \| \|	Small fix from rebase	Patrick Wendell	2013-12-25	1	-1/+1
\| \| \| \| \| \|
\| * \| \| \| \|	Minor bug fix	Patrick Wendell	2013-12-25	2	-1/+6
\| \| \| \| \| \|
\| * \| \| \| \|	Minor style clean-up	Patrick Wendell	2013-12-25	6	-18/+17
\| \| \| \| \| \|
\| * \| \| \| \|	Import clean-up (yay Aaron)	Patrick Wendell	2013-12-25	10	-38/+33
\| \| \| \| \| \|
\| * \| \| \| \|	Adding scheduling and reporting based on cores	Patrick Wendell	2013-12-25	6	-8/+14
\| \| \| \| \| \|
\| * \| \| \| \|	Adding better option parsing	Patrick Wendell	2013-12-25	10	-42/+187
\| \| \| \| \| \|
\| * \| \| \| \|	Initial cut at driver submission.	Patrick Wendell	2013-12-25	16	-53/+781
\| \| \| \| \| \|
\| * \| \| \| \|	Renaming Client => AppClient	Patrick Wendell	2013-12-25	4	-11/+12
\| \| \| \| \| \|
* \| \| \| \| \|	Merge pull request #372 from pwendell/log4j-fix-1	Patrick Wendell	2014-01-09	2	-0/+2
\|\ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Send logs to stderr by default (instead of stdout).
\| * \| \| \| \| \|	Send logs to stderr by default (instead of stdout).	Patrick Wendell	2014-01-09	2	-0/+2
\| \| \| \| \| \| \|
* \| \| \| \| \| \|	Merge pull request #362 from mateiz/conf-getters	Matei Zaharia	2014-01-09	34	-79/+78
\|\ \ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Use typed getters for configuration settings This improves some of the code style after SPARK-544.
\| * \| \| \| \| \| \|	Use typed getters for configuration settings	Matei Zaharia	2014-01-09	34	-79/+78
\| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \|	Merge pull request #361 from rxin/clean	Reynold Xin	2014-01-09	4	-55/+68
\|\ \ \ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Minor style cleanup. Mostly on indenting & line width changes. Focused on the few important files since they are the files that new contributors usually read first.