spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	Removed extra empty lines.	Tathagata Das	2013-12-31	1	-1/+0
\|
*	Removed unnecessary comments.	Tathagata Das	2013-12-31	3	-55/+8
\|
*	Refactored kafka, flume, zeromq, mqtt as separate external projects, with ↵	Tathagata Das	2013-12-30	12	-978/+81
\| \| \| \|	their own self-contained scala API, java API, scala unit tests and java unit tests. Updated examples to use the external projects.
*	Refactored streaming project to separate out the twitter functionality.	Tathagata Das	2013-12-26	3	-105/+8
\|
*	Minor change for PR 277.	Tathagata Das	2013-12-23	1	-1/+1
\|
*	Minor formatting fixes.	Tathagata Das	2013-12-23	1	-5/+4
\|
*	Added comments to BatchInfo and JobSet, based on Patrick's comment on PR 277.	Tathagata Das	2013-12-23	2	-3/+26
\|
*	Minor updated based on comments on PR 277.	Tathagata Das	2013-12-20	2	-1/+6
\|
*	Minor changes.	Tathagata Das	2013-12-18	9	-32/+36
\|
*	Merge branch 'apache-master' into scheduler-update	Tathagata Das	2013-12-18	41	-397/+445
\|\ \| \| \| \| \| \| \| \| \| \|	Conflicts: streaming/src/main/scala/org/apache/spark/streaming/StreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/dstream/ForEachDStream.scala
\| *	Style fixes and addressed review comments at #221	Prashant Sharma	2013-12-10	1	-2/+2
\| \|
\| *	Merge branch 'master' of github.com:apache/incubator-spark into scala-2.10-temp	Prashant Sharma	2013-11-21	5	-11/+12
\| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/util/collection/PrimitiveVector.scala streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala
\| * \	Merge branch 'scala210-master' of github.com:colorant/incubator-spark into ↵	Prashant Sharma	2013-11-21	22	-257/+1454
\| \|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	scala-2.10 Conflicts: core/src/main/scala/org/apache/spark/deploy/client/Client.scala core/src/main/scala/org/apache/spark/deploy/worker/Worker.scala core/src/main/scala/org/apache/spark/executor/CoarseGrainedExecutorBackend.scala core/src/test/scala/org/apache/spark/MapOutputTrackerSuite.scala
\| \| * \|	Various merge corrections	Aaron Davidson	2013-11-14	5	-63/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I've diff'd this patch against my own -- since they were both created independently, this means that two sets of eyes have gone over all the merge conflicts that were created, so I'm feeling significantly more confident in the resulting PR. @rxin has looked at the changes to the repl and is resoundingly confident that they are correct.
\| \| * \|	Merge branch 'master' into scala-2.10	Raymond Liu	2013-11-14	2	-7/+80
\| \| \|\ \
\| \| * \ \	Merge branch 'master' into scala-2.10	Raymond Liu	2013-11-13	21	-190/+1375
\| \| \|\ \ \
\| * \| \| \| \|	Remove deprecated actorFor and use actorSelection everywhere.	Prashant Sharma	2013-11-12	1	-1/+1
\| \|/ / / /
\| * \| \| \|	fixed some warnings	Martin Weindel	2013-10-05	15	-61/+76
\| \| \| \| \|
\| * \| \| \|	Akka 2.2 migration	Prashant Sharma	2013-09-22	6	-12/+15
\| \| \| \| \|
\| * \| \| \|	Few more fixes to tests broken during merge	Prashant Sharma	2013-09-10	1	-3/+3
\| \| \| \| \|
\| * \| \| \|	Merged with master	Prashant Sharma	2013-09-06	76	-699/+1706
\| \|\ \ \ \
\| * \| \| \| \|	code formatting, The warning related to scope exit and enter is not worth ↵	Prashant Sharma	2013-07-16	1	-14/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	fixing as it only affects debugging scopes and nothing else.
\| * \| \| \| \|	Fixed warning erasure -> runtimeClass	Prashant Sharma	2013-07-16	1	-1/+1
\| \| \| \| \| \|
\| * \| \| \| \|	Fixed warning Throwables	Prashant Sharma	2013-07-16	1	-1/+1
\| \| \| \| \| \|
\| * \| \| \| \|	Fixed warning ClassManifest -> ClassTag	Prashant Sharma	2013-07-16	1	-1/+1
\| \| \| \| \| \|
\| * \| \| \| \|	Merge branch 'master' into master-merge	Prashant Sharma	2013-07-12	5	-18/+25
\| \|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: README.md core/pom.xml core/src/main/scala/spark/deploy/JsonProtocol.scala core/src/main/scala/spark/deploy/LocalSparkCluster.scala core/src/main/scala/spark/deploy/master/Master.scala core/src/main/scala/spark/deploy/master/MasterWebUI.scala core/src/main/scala/spark/deploy/worker/Worker.scala core/src/main/scala/spark/deploy/worker/WorkerWebUI.scala core/src/main/scala/spark/storage/BlockManagerUI.scala core/src/main/scala/spark/util/AkkaUtils.scala pom.xml project/SparkBuild.scala streaming/src/main/scala/spark/streaming/receivers/ActorReceiver.scala
\| * \ \ \ \ \	Merge branch 'master' into master-merge	Prashant Sharma	2013-07-03	15	-148/+249
\| \|\ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/pom.xml core/src/main/scala/spark/MapOutputTracker.scala core/src/main/scala/spark/RDD.scala core/src/main/scala/spark/RDDCheckpointData.scala core/src/main/scala/spark/SparkContext.scala core/src/main/scala/spark/Utils.scala core/src/main/scala/spark/api/python/PythonRDD.scala core/src/main/scala/spark/deploy/client/Client.scala core/src/main/scala/spark/deploy/master/MasterWebUI.scala core/src/main/scala/spark/deploy/worker/Worker.scala core/src/main/scala/spark/deploy/worker/WorkerWebUI.scala core/src/main/scala/spark/rdd/BlockRDD.scala core/src/main/scala/spark/rdd/ZippedRDD.scala core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala core/src/main/scala/spark/storage/BlockManager.scala core/src/main/scala/spark/storage/BlockManagerMaster.scala core/src/main/scala/spark/storage/BlockManagerMasterActor.scala core/src/main/scala/spark/storage/BlockManagerUI.scala core/src/main/scala/spark/util/AkkaUtils.scala core/src/test/scala/spark/SizeEstimatorSuite.scala pom.xml project/SparkBuild.scala repl/src/main/scala/spark/repl/SparkILoop.scala repl/src/test/scala/spark/repl/ReplSuite.scala streaming/src/main/scala/spark/streaming/StreamingContext.scala streaming/src/main/scala/spark/streaming/api/java/JavaStreamingContext.scala streaming/src/main/scala/spark/streaming/dstream/KafkaInputDStream.scala streaming/src/main/scala/spark/streaming/util/MasterFailureTest.scala
\| * \| \| \| \| \| \|	Fixied other warnings	Prashant Sharma	2013-04-29	1	-3/+1
\| \| \| \| \| \| \| \|
\| * \| \| \| \| \| \|	Fixed warning: erasure -> runtimeClass	Prashant Sharma	2013-04-29	1	-4/+4
\| \| \| \| \| \| \| \|
\| * \| \| \| \| \| \|	Fixed Warning: ClassManifest -> ClassTag	Prashant Sharma	2013-04-29	39	-239/+285
\| \| \| \| \| \| \| \|
\| * \| \| \| \| \| \|	Fixed breaking tests in streaming checkpoint suite. Changed RichInt to Int ↵	Prashant Sharma	2013-04-25	1	-16/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	as it is final and not serializable
\| * \| \| \| \| \| \|	scala 2.10 and master merge	Prashant Sharma	2013-04-24	4	-15/+16
\| \| \| \| \| \| \| \|
* \| \| \| \| \| \| \|	Added StatsReportListener to generate processing time statistics across ↵	Tathagata Das	2013-12-18	2	-2/+45
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	multiple batches.
* \| \| \| \| \| \| \|	Refactored streaming scheduler and added listener interface.	Tathagata Das	2013-12-12	22	-195/+496
\| \|_\|_\|_\|_\|_\|/ \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Refactored Scheduler + JobManager to JobGenerator + JobScheduler and added JobSet for cleaner code. Moved scheduler related code to streaming.scheduler package. - Added StreamingListener trait (similar to SparkListener) to enable gathering to streaming stats like processing times and delays. StreamingContext.addListener() to added listeners. - Deduped some code in streaming tests by modifying TestSuiteBase, and added StreamingListenerSuite.
* \| \| \| \| \| \|	Another set of changes to remove unnecessary semicolon (;) from Scala code.	Henry Saputra	2013-11-19	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Passed the sbt/sbt compile and test
* \| \| \| \| \| \|	Remove the semicolons at the end of Scala code to make it more pure Scala code.	Henry Saputra	2013-11-19	5	-10/+9
\| \|_\|_\|_\|_\|/ \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Also remove unused imports as I found them along the way. Remove return statements when returning value in the Scala code. Passing compile and tests.
* \| \| \| \| \|	Made block generator thread safe to fix Kafka bug.	Tathagata Das	2013-11-12	2	-7/+80
\| \|_\|_\|_\|/ \|/\| \| \| \|
* \| \| \| \|	Merge branch 'apache-master' into transform	Tathagata Das	2013-10-25	8	-15/+176
\|\ \ \ \ \
\| * \| \| \| \|	Style fixes	Patrick Wendell	2013-10-24	1	-9/+9
\| \| \| \| \| \|
\| * \| \| \| \|	Spacing fix	Patrick Wendell	2013-10-24	1	-4/+4
\| \| \| \| \| \|
\| * \| \| \| \|	Small spacing fix	Patrick Wendell	2013-10-24	1	-2/+2
\| \| \| \| \| \|
\| * \| \| \| \|	Adding Java versions and associated tests	Patrick Wendell	2013-10-24	4	-0/+68
\| \| \| \| \| \|
\| * \| \| \| \|	Some clean-up of tests	Patrick Wendell	2013-10-24	3	-7/+10
\| \| \| \| \| \|
\| * \| \| \| \|	Removing Java for now	Patrick Wendell	2013-10-24	1	-7/+0
\| \| \| \| \| \|
\| * \| \| \| \|	Adding tests	Patrick Wendell	2013-10-24	2	-5/+88
\| \| \| \| \| \|
\| * \| \| \| \|	Add a `repartition` operator.	Patrick Wendell	2013-10-24	2	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds an operator called repartition with more straightforward semantics than the current `coalesce` operator. There are a few use cases where this operator is useful: 1. If a user wants to increase the number of partitions in the RDD. This is more common now with streaming. E.g. a user is ingesting data on one node but they want to add more partitions to ensure parallelism of subsequent operations across threads or the cluster. Right now they have to call rdd.coalesce(numSplits, shuffle=true) - that's super confusing. 2. If a user has input data where the number of partitions is not known. E.g. > sc.textFile("some file").coalesce(50).... This is both vague semantically (am I growing or shrinking this RDD) but also, may not work correctly if the base RDD has fewer than 50 partitions. The new operator forces shuffles every time, so it will always produce exactly the number of new partitions. It also throws an exception rather than silently not-working if a bad input is passed. I am currently adding streaming tests (requires refactoring some of the test suite to allow testing at partition granularity), so this is not ready for merge yet. But feedback is welcome.
* \| \| \| \| \|	Fixed accidental bug.	Tathagata Das	2013-10-24	1	-1/+1
\| \| \| \| \| \|
* \| \| \| \| \|	Merge branch 'apache-master' into transform	Tathagata Das	2013-10-24	2	-0/+124
\|\\| \| \| \| \|
\| * \| \| \| \|	Merge pull request #64 from prabeesh/master	Matei Zaharia	2013-10-23	2	-0/+124
\| \|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	MQTT Adapter for Spark Streaming MQTT is a machine-to-machine (M2M)/Internet of Things connectivity protocol. It was designed as an extremely lightweight publish/subscribe messaging transport. You may read more about it here http://mqtt.org/ Message Queue Telemetry Transport (MQTT) is an open message protocol for M2M communications. It enables the transfer of telemetry-style data in the form of messages from devices like sensors and actuators, to mobile phones, embedded systems on vehicles, or laptops and full scale computers. The protocol was invented by Andy Stanford-Clark of IBM, and Arlen Nipper of Cirrus Link Solutions This protocol enables a publish/subscribe messaging model in an extremely lightweight way. It is useful for connections with remote locations where line of code and network bandwidth is a constraint. MQTT is one of the widely used protocol for 'Internet of Things'. This protocol is getting much attraction as anything and everything is getting connected to internet and they all produce data. Researchers and companies predict some 25 billion devices will be connected to the internet by 2015. Plugin/Support for MQTT is available in popular MQs like RabbitMQ, ActiveMQ etc. Support for MQTT in Spark will help people with Internet of Things (IoT) projects to use Spark Streaming for their real time data processing needs (from sensors and other embedded devices etc).
\| \| * \| \| \| \|	Update MQTTInputDStream.scala	Prabeesh K	2013-10-18	1	-4/+11
\| \| \| \| \| \| \|