spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	graph -> graphx	Ankur Dave	2014-01-09	1	-6/+6
\|
*	Merge remote-tracking branch 'spark-upstream/master' into HEAD	Ankur Dave	2014-01-08	1	-88/+173
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: README.md core/src/main/scala/org/apache/spark/util/collection/OpenHashMap.scala core/src/main/scala/org/apache/spark/util/collection/OpenHashSet.scala core/src/main/scala/org/apache/spark/util/collection/PrimitiveKeyOpenHashMap.scala pom.xml project/SparkBuild.scala repl/src/main/scala/org/apache/spark/repl/SparkILoop.scala
\| *	Merge pull request #313 from tdas/project-refactor	Patrick Wendell	2014-01-07	1	-26/+67
\| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Refactored the streaming project to separate external libraries like Twitter, Kafka, Flume, etc. At a high level, these are the following changes. 1. All the external code was put in `SPARK_HOME/external/` as separate SBT projects and Maven modules. Their artifact names are `spark-streaming-twitter`, `spark-streaming-kafka`, etc. Both SparkBuild.scala and pom.xml files have been updated. References to external libraries and repositories have been removed from the settings of root and streaming projects/modules. 2. To avail the external functionality (say, creating a Twitter stream), the developer has to `import org.apache.spark.streaming.twitter._` . For Scala API, the developer has to call `TwitterUtils.createStream(streamingContext, ...)`. For the Java API, the developer has to call `TwitterUtils.createStream(javaStreamingContext, ...)`. 3. Each external project has its own scala and java unit tests. Note the unit tests of each external library use classes of the streaming unit tests (`TestSuiteBase`, `LocalJavaStreamingContext`, etc.). To enable this code sharing among test classes, `dependsOn(streaming % "compile->compile,test->test")` was used in the SparkBuild.scala . In the streaming/pom.xml, an additional `maven-jar-plugin` was necessary to capture this dependency (see comment inside the pom.xml for more information). 4. Jars of the external projects have been added to examples project but not to the assembly project. 5. In some files, imports have been rearrange to conform to the Spark coding guidelines.
\| \| *	Merge remote-tracking branch 'apache/master' into project-refactor	Tathagata Das	2014-01-06	1	-14/+40
\| \| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: examples/src/main/java/org/apache/spark/streaming/examples/JavaFlumeEventCount.java streaming/src/main/scala/org/apache/spark/streaming/StreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala streaming/src/test/java/org/apache/spark/streaming/JavaAPISuite.java streaming/src/test/scala/org/apache/spark/streaming/InputStreamsSuite.scala streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala
\| \| * \|	Added pom.xml for external projects and removed unnecessary dependencies and ↵	Tathagata Das	2013-12-31	1	-14/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	repositoris from other poms and sbt.
\| \| * \|	Refactored kafka, flume, zeromq, mqtt as separate external projects, with ↵	Tathagata Das	2013-12-30	1	-25/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	their own self-contained scala API, java API, scala unit tests and java unit tests. Updated examples to use the external projects.
\| \| * \|	Refactored streaming project to separate out the twitter functionality.	Tathagata Das	2013-12-26	1	-2/+11
\| \| \| \|
\| * \| \|	Merge pull request #340 from ScrapCodes/sbt-fixes	Patrick Wendell	2014-01-06	1	-5/+3
\| \|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Made java options to be applied during tests so that they become self explanatory.
\| \| * \| \|	Made java options to be applied during tests so that they become self ↵	Prashant Sharma	2014-01-06	1	-5/+3
\| \| \| \|/ \| \| \|/\| \| \| \| \| \| \| \| \|	explanatory.
\| * / \|	SPARK-1005 Ning upgrade	Prashant Sharma	2014-01-06	1	-1/+1
\| \|/ /
\| * \|	Merge remote-tracking branch 'apache-github/master' into remove-binaries	Patrick Wendell	2014-01-03	1	-7/+25
\| \|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/test/scala/org/apache/spark/DriverSuite.scala docs/python-programming-guide.md
\| \| * \|	Using name yarn-alpha/yarn instead of yarn-2.0/yarn-2.2	Raymond Liu	2014-01-03	1	-8/+8
\| \| \| \|
\| \| * \|	Add yarn/common/src/test dir in building script	Raymond Liu	2014-01-03	1	-0/+7
\| \| \| \|
\| \| * \|	Use unmanaged source dir to include common yarn code	Raymond Liu	2014-01-03	1	-11/+15
\| \| \| \|
\| \| * \|	Reorganize yarn related codes into sub projects to remove duplicate files.	Raymond Liu	2014-01-03	1	-8/+15
\| \| \| \|
\| * \| \|	Changes on top of Prashant's patch.	Patrick Wendell	2014-01-03	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Closes #316
\| * \| \|	fixed review comments	Prashant Sharma	2014-01-03	1	-5/+9
\| \| \| \|
\| * \| \|	Merge branch 'master' into spark-1002-remove-jars	Prashant Sharma	2014-01-03	1	-0/+1
\| \|\\| \|
\| \| * \|	Merge remote-tracking branch 'apache/master' into conf2	Matei Zaharia	2014-01-01	1	-1/+2
\| \| \|\ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: project/SparkBuild.scala
\| \| * \ \	Merge remote-tracking branch 'apache/master' into conf2	Matei Zaharia	2013-12-31	1	-1/+1
\| \| \|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/rdd/CheckpointRDD.scala streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala
\| \| * \ \ \	Merge remote-tracking branch 'origin/master' into conf2	Matei Zaharia	2013-12-29	1	-1/+4
\| \| \|\ \ \ \ \| \| \| \| \|_\|/ \| \| \| \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/SparkContext.scala core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala core/src/main/scala/org/apache/spark/scheduler/TaskSchedulerImpl.scala core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterTaskSetManager.scala core/src/main/scala/org/apache/spark/scheduler/local/LocalScheduler.scala core/src/main/scala/org/apache/spark/util/MetadataCleaner.scala core/src/test/scala/org/apache/spark/scheduler/TaskResultGetterSuite.scala core/src/test/scala/org/apache/spark/scheduler/TaskSetManagerSuite.scala new-yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala streaming/src/test/scala/org/apache/spark/streaming/BasicOperationsSuite.scala streaming/src/test/scala/org/apache/spark/streaming/CheckpointSuite.scala streaming/src/test/scala/org/apache/spark/streaming/InputStreamsSuite.scala streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala streaming/src/test/scala/org/apache/spark/streaming/WindowOperationsSuite.scala
\| \| * \| \| \|	spark-544, introducing SparkConf and related configuration overhaul.	Prashant Sharma	2013-12-25	1	-1/+2
\| \| \| \| \| \|
\| * \| \| \| \|	Deleted py4j jar and added to assembly dependency	Prashant Sharma	2014-01-02	1	-0/+1
\| \| \|_\|_\|/ \| \|/\| \| \|
\| * \| \| \|	Merge pull request #73 from falaki/ApproximateDistinctCount	Reynold Xin	2013-12-31	1	-1/+2
\| \|\ \ \ \ \| \| \|_\|_\|/ \| \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Approximate distinct count Added countApproxDistinct() to RDD and countApproxDistinctByKey() to PairRDDFunctions to approximately count distinct number of elements and distinct number of values per key, respectively. Both functions use HyperLogLog from stream-lib for counting. Both functions take a parameter that controls the trade-off between accuracy and memory consumption. Also added Scala docs and test suites for both methods.
\| \| * \| \|	Added stream 2.5.1 jar depenency	Hossein Falaki	2013-12-30	1	-1/+2
\| \| \| \|/ \| \| \|/\|
\| * / \|	upgrade Netty from 4.0.0.Beta2 to 4.0.13.Final	Binh Nguyen	2013-12-24	1	-1/+1
\| \|/ /
\| * /	Show full stack trace and time taken in unit tests.	Reynold Xin	2013-12-23	1	-1/+4
\| \|/
\| *	[SPARK-959] Explicitly depend on org.eclipse.jetty.orbit jar	Aaron Davidson	2013-12-18	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Without this, in some cases, Ivy attempts to download the wrong file and fails, stopping the whole build. See bug for more details. (This is probably also the beginning of the slow death of our recently prettified dependencies. Form follow function.)
\| *	Attempt with extra repositories	Patrick Wendell	2013-12-16	1	-22/+10
\| \|
\| *	Review comments on the PR for scala 2.10 migration.	Prashant Sharma	2013-12-13	1	-3/+3
\| \|
\| *	Disabled yarn 2.2 and added a message in the sbt build	Prashant Sharma	2013-12-12	1	-7/+17
\| \|
\| *	Merge branch 'akka-bug-fix' of github.com:ScrapCodes/incubator-spark into ↵	Prashant Sharma	2013-12-11	1	-1/+1
\| \|\ \| \| \| \| \| \| \| \| \|	akka-bug-fix
\| \| *	added eclipse repository for spark streaming.	Prashant Sharma	2013-12-11	1	-1/+1
\| \| \|
\| * \|	Merge branch 'master' into akka-bug-fix	Prashant Sharma	2013-12-11	1	-7/+23
\| \|\ \ \| \| \|/ \| \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/pom.xml core/src/main/scala/org/apache/spark/scheduler/DAGScheduler.scala pom.xml project/SparkBuild.scala streaming/pom.xml yarn/src/main/scala/org/apache/spark/deploy/yarn/YarnAllocationHandler.scala
\| \| *	Use published "org.spark-project.akka-*" in sbt build for Hadoop-2.2 ↵	Harvey Feng	2013-12-03	1	-13/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	dependencies. This also includes: -Change `isNewYarn` to `isNewHadoop`, since the protobuf-2.5 dependency is from Hadoop-2.2 itself. -Regexp bugix Credits to @alig for this patch.
\| \| *	Merge remote-tracking branch 'origin/master' into yarn-2.2	Harvey Feng	2013-11-26	1	-0/+1
\| \| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: yarn/src/main/scala/org/apache/spark/deploy/yarn/Client.scala
\| \| * \|	Add optional Hadoop 2.2 settings in sbt build.	Harvey Feng	2013-11-26	1	-9/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the Hadoop used is version 2.2 or derived from it, then Spark will be compiled against protobuf-2.5 and a protobuf-2.5 version of Akka 2.0.5.
\| * \| \|	Merge branch 'master' into scala-2.10-wip	Prashant Sharma	2013-11-25	1	-1/+2
\| \|\ \ \ \| \| \| \|/ \| \| \|/\| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/rdd/RDD.scala project/SparkBuild.scala
\| * \| \|	Use Kafka 2.10 (again)	Aaron Davidson	2013-11-14	1	-2/+3
\| \| \| \|
\| * \| \|	Various merge corrections	Aaron Davidson	2013-11-14	1	-9/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I've diff'd this patch against my own -- since they were both created independently, this means that two sets of eyes have gone over all the merge conflicts that were created, so I'm feeling significantly more confident in the resulting PR. @rxin has looked at the changes to the repl and is resoundingly confident that they are correct.
\| * \| \|	Some fixes for previous master merge commits	Raymond Liu	2013-11-15	1	-0/+1
\| \| \| \|
\| * \| \|	Merge branch 'master' into scala-2.10	Raymond Liu	2013-11-14	1	-2/+3
\| \|\ \ \ \| \| \| \|/ \| \| \|/\|
\| * \| \|	Merge branch 'master' into scala-2.10	Raymond Liu	2013-11-13	1	-5/+30
\| \|\ \ \
\| * \| \| \|	Updating to latest akka 2.2.3, which fixes our only failing Driver Suite	Prashant Sharma	2013-10-24	1	-4/+4
\| \| \| \| \|
\| * \| \| \|	Merge branch 'scala-2.10' of github.com:ScrapCodes/spark into scala-2.10	Prashant Sharma	2013-10-10	1	-8/+14
\| \|\ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/scheduler/cluster/ClusterTaskSetManager.scala project/SparkBuild.scala
\| \| * \ \ \	Merge branch 'master' into wip-merge-master	Prashant Sharma	2013-10-08	1	-6/+8
\| \| \|\ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: bagel/pom.xml core/pom.xml core/src/test/scala/org/apache/spark/ui/UISuite.scala examples/pom.xml mllib/pom.xml pom.xml project/SparkBuild.scala repl/pom.xml streaming/pom.xml tools/pom.xml In scala 2.10, a shorter representation is used for naming artifacts so changed to shorter scala version for artifacts and made it a property in pom.
\| \| * \ \ \ \	Merge branch 'master' into scala-2.10	Prashant Sharma	2013-10-05	1	-0/+3
\| \| \|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/test/scala/org/apache/spark/DistributedSuite.scala project/SparkBuild.scala
\| \| * \ \ \ \ \	Merge branch 'master' into scala-2.10	Prashant Sharma	2013-10-01	1	-2/+3
\| \| \|\ \ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/ui/jobs/JobProgressUI.scala docs/_config.yml project/SparkBuild.scala repl/src/main/scala/org/apache/spark/repl/SparkILoop.scala
\| * \| \| \| \| \| \| \|	scala 2.10 requires Java 1.6,	Martin Weindel	2013-10-05	1	-3/+3
\| \|/ / / / / / / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	using Scala 2.10.3, resolved maven-scala-plugin warning
\| * \| \| \| \| \| \|	Sync with master and some build fixes	Prashant Sharma	2013-09-26	1	-8/+8
\| \|\ \ \ \ \ \ \