spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	Changes based on review feedback.	Patrick Wendell	2014-01-06	7	-22/+34
\|
*	Respect supervise option at Master	Patrick Wendell	2013-12-29	1	-3/+15
\|
*	Slight change to retry logic	Patrick Wendell	2013-12-29	1	-2/+3
\|
*	TODO clean-up	Patrick Wendell	2013-12-29	5	-6/+6
\|
*	Adding driver ID to submission response	Patrick Wendell	2013-12-29	2	-2/+2
\|
*	Documentation and adding supervise option	Patrick Wendell	2013-12-29	4	-13/+18
\|
*	Changes to allow fate sharing of drivers/executors and workers.	Patrick Wendell	2013-12-29	16	-127/+229
\|
*	Some notes and TODO about dependencies	Patrick Wendell	2013-12-27	1	-1/+7
\|
*	Minor fixes	Patrick Wendell	2013-12-26	3	-20/+25
\|
*	Addressing smaller changes from Aaron's review	Patrick Wendell	2013-12-26	7	-27/+31
\|
*	Merge pull request #1 from aarondav/driver	Patrick Wendell	2013-12-26	1	-62/+31
\|\ \| \| \| \|	Refactor DriverClient to be more Actor-based
\| *	Refactor DriverClient to be more Actor-based	Aaron Davidson	2013-12-25	1	-62/+31
\| \|
* \|	Removing accidental file	Patrick Wendell	2013-12-26	2	-10/+1
\| \|
* \|	Updated approach to driver restarting	Patrick Wendell	2013-12-26	2	-23/+30
\|/
*	Removing un-used variable	Patrick Wendell	2013-12-25	1	-2/+0
\|
*	Small fix from rebase	Patrick Wendell	2013-12-25	1	-1/+1
\|
*	Minor bug fix	Patrick Wendell	2013-12-25	2	-1/+6
\|
*	Minor style clean-up	Patrick Wendell	2013-12-25	5	-18/+16
\|
*	Import clean-up (yay Aaron)	Patrick Wendell	2013-12-25	10	-38/+33
\|
*	Adding scheduling and reporting based on cores	Patrick Wendell	2013-12-25	6	-8/+14
\|
*	Adding better option parsing	Patrick Wendell	2013-12-25	9	-42/+142
\|
*	Initial cut at driver submission.	Patrick Wendell	2013-12-25	16	-53/+781
\|
*	Renaming Client => AppClient	Patrick Wendell	2013-12-25	4	-11/+12
\|
*	Merge pull request #127 from kayousterhout/consolidate_schedulers	Patrick Wendell	2013-12-24	17	-1202/+874
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Deduplicate Local and Cluster schedulers. The code in LocalScheduler/LocalTaskSetManager was nearly identical to the code in ClusterScheduler/ClusterTaskSetManager. The redundancy made making updating the schedulers unnecessarily painful and error- prone. This commit combines the two into a single TaskScheduler/ TaskSetManager. Unfortunately the diff makes this change look much more invasive than it is -- TaskScheduler.scala is only superficially changed (names updated, overrides removed) from the old ClusterScheduler.scala, and the same with TaskSetManager.scala. Thanks @rxin for suggesting this change!
\| *	Responded to Reynold's style comments	Kay Ousterhout	2013-12-24	3	-6/+7
\| \|
\| *	Correctly merged in maxTaskFailures fix	Kay Ousterhout	2013-12-22	1	-1/+1
\| \|
\| *	Renamed ClusterScheduler to TaskSchedulerImpl	Kay Ousterhout	2013-12-20	10	-27/+27
\| \|
\| *	Merge remote branch 'upstream/master' into consolidate_schedulers	Kay Ousterhout	2013-12-20	98	-604/+742
\| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterTaskSetManager.scala core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala core/src/test/scala/org/apache/spark/scheduler/TaskSetManagerSuite.scala
\| * \	Merge master into 127	Aaron Davidson	2013-12-08	42	-586/+1196
\| \|\ \
\| * \| \|	Fixed error message in ClusterScheduler to be consistent with the old ↵	Kay Ousterhout	2013-11-15	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	LocalScheduler
\| * \| \|	Merge remote-tracking branch 'upstream/master' into consolidate_schedulers	Kay Ousterhout	2013-11-15	6	-79/+45
\| \|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterTaskSetManager.scala
\| * \| \| \|	Don't retry tasks if result wasn't serializable	Kay Ousterhout	2013-11-14	1	-1/+11
\| \| \| \| \|
\| * \| \| \|	Fix bug where scheduler could hang after task failure.	Kay Ousterhout	2013-11-14	1	-10/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a task fails, we need to call reviveOffers() so that the task can be rescheduled on a different machine. In the current code, the state in ClusterTaskSetManager indicating which tasks are pending may be updated after revive offers is called (there's a race condition here), so when revive offers is called, the task set manager does not yet realize that there are failed tasks that need to be relaunched.
\| * \| \| \|	Changed local backend to use Akka actor	Kay Ousterhout	2013-11-14	1	-23/+57
\| \| \| \| \|
\| * \| \| \|	Fixed naming issues and added back ability to specify max task failures.	Kay Ousterhout	2013-11-13	7	-20/+80
\| \| \| \| \|
\| * \| \| \|	Merge remote-tracking branch 'upstream/master' into consolidate_schedulers	Kay Ousterhout	2013-11-13	29	-504/+1455
\| \|\ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/scheduler/ClusterScheduler.scala
\| * \| \| \| \|	Extracted TaskScheduler interface.	Kay Ousterhout	2013-11-13	10	-52/+57
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Also changed the default maximum number of task failures to be 0 when running in local mode.
\| * \| \| \| \|	Cleaned up imports and fixed test bug	Kay Ousterhout	2013-10-31	2	-3/+1
\| \| \| \| \| \|
\| * \| \| \| \|	Deduplicate Local and Cluster schedulers.	Kay Ousterhout	2013-10-30	17	-1675/+1247
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code in LocalScheduler/LocalTaskSetManager was nearly identical to the code in ClusterScheduler/ClusterTaskSetManager. The redundancy made making updating the schedulers unnecessarily painful and error- prone. This commit combines the two into a single TaskScheduler/ TaskSetManager.
* \| \| \| \| \|	Merge pull request #279 from aarondav/shuffle-cleanup0	Patrick Wendell	2013-12-24	3	-7/+35
\|\ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Clean up shuffle files once their metadata is gone Previously, we would only clean the in-memory metadata for consolidated shuffle files. Additionally, fixes a bug where the Metadata Cleaner was ignoring type-specific TTLs.
\| * \| \| \| \| \|	Clean up shuffle files once their metadata is gone	Aaron Davidson	2013-12-19	3	-7/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously, we would only clean the in-memory metadata for consolidated shuffle files. Additionally, fixes a bug where the Metadata Cleaner was ignoring type- specific TTLs.
* \| \| \| \| \| \|	Merge pull request #277 from tdas/scheduler-update	Matei Zaharia	2013-12-24	2	-6/+2
\|\ \ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Refactored the streaming scheduler and added StreamingListener interface - Refactored the streaming scheduler for cleaner code. Specifically, the JobManager was renamed to JobScheduler, as it does the actual scheduling of Spark jobs to the SparkContext. The earlier Scheduler was renamed to JobGenerator, as it actually generates the jobs from the DStreams. The JobScheduler starts the JobGenerator. Also, moved all the scheduler related code from spark.streaming to spark.streaming.scheduler package. - Implemented the StreamingListener interface, similar to SparkListener. The streaming version of StatusReportListener prints the batch processing time statistics (for now). Added StreamingListernerSuite to test it. - Refactored streaming TestSuiteBase for deduping code in the other streaming testsuites.
\| * \| \| \| \| \| \|	Minor changes.	Tathagata Das	2013-12-18	1	-1/+0
\| \| \| \| \| \| \| \|
\| * \| \| \| \| \| \|	Merge branch 'apache-master' into scheduler-update	Tathagata Das	2013-12-18	92	-525/+649
\| \|\ \ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: streaming/src/main/scala/org/apache/spark/streaming/StreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/dstream/ForEachDStream.scala
\| * \| \| \| \| \| \| \|	Added StatsReportListener to generate processing time statistics across ↵	Tathagata Das	2013-12-18	1	-4/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	multiple batches.
\| * \| \| \| \| \| \| \|	Refactored streaming scheduler and added listener interface.	Tathagata Das	2013-12-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Refactored Scheduler + JobManager to JobGenerator + JobScheduler and added JobSet for cleaner code. Moved scheduler related code to streaming.scheduler package. - Added StreamingListener trait (similar to SparkListener) to enable gathering to streaming stats like processing times and delays. StreamingContext.addListener() to added listeners. - Deduped some code in streaming tests by modifying TestSuiteBase, and added StreamingListenerSuite.
* \| \| \| \| \| \| \| \|	Merge pull request #244 from leftnoteasy/master	Reynold Xin	2013-12-23	6	-5/+184
\|\ \ \ \ \ \ \ \ \ \| \|_\|_\|_\|_\|_\|_\|_\|/ \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Added SPARK-968 implementation for review Added SPARK-968 implementation for review
\| * \| \| \| \| \| \| \|	SPARK-968, added executor address showing in aggregated metrics by executors ↵	wangda.tan	2013-12-23	1	-0/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	table
\| * \| \| \| \| \| \| \|	added changes according to comments from rxin	wangda.tan	2013-12-22	7	-44/+24
\| \| \| \| \| \| \| \| \|
\| * \| \| \| \| \| \| \|	spark-968, changes for avoid a NPE	wangda.tan	2013-12-17	2	-25/+29
\| \| \| \| \| \| \| \| \|