spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	fixed job name and usage information for the JavaSparkPi example	Kevin Mader	2014-01-22	1	-2/+2
\|
*	Added StreamingContext.awaitTermination to streaming examples.	Tathagata Das	2014-01-20	4	-0/+4
\|
*	Updated java API docs for streaming, along with very minor changes in the ↵	Tathagata Das	2014-01-16	1	-2/+1
\| \| \| \|	code examples.
*	Merge remote-tracking branch 'apache/master' into driver-test	Tathagata Das	2014-01-10	4	-1/+9
\|\ \| \| \| \| \| \| \| \|	Conflicts: streaming/src/main/scala/org/apache/spark/streaming/DStreamGraph.scala
\| *	Minor clean-up	Patrick Wendell	2014-01-09	1	-1/+1
\| \|
\| *	Set default logging to WARN for Spark streaming examples.	Patrick Wendell	2014-01-09	4	-0/+8
\| \| \| \| \| \| \| \| \| \| \| \|	This programatically sets the log level to WARN by default for streaming tests. If the user has already specified a log4j.properties file, the user's file will take precedence over this default.
* \|	Merge branch 'standalone-driver' into driver-test	Tathagata Das	2014-01-09	14	-107/+162
\|\\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/SparkContext.scala core/src/main/scala/org/apache/spark/deploy/worker/DriverRunner.scala examples/src/main/java/org/apache/spark/streaming/examples/JavaNetworkWordCount.java streaming/src/main/scala/org/apache/spark/streaming/Checkpoint.scala streaming/src/main/scala/org/apache/spark/streaming/StreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala
\| *	Merge pull request #313 from tdas/project-refactor	Patrick Wendell	2014-01-07	2	-8/+9
\| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Refactored the streaming project to separate external libraries like Twitter, Kafka, Flume, etc. At a high level, these are the following changes. 1. All the external code was put in `SPARK_HOME/external/` as separate SBT projects and Maven modules. Their artifact names are `spark-streaming-twitter`, `spark-streaming-kafka`, etc. Both SparkBuild.scala and pom.xml files have been updated. References to external libraries and repositories have been removed from the settings of root and streaming projects/modules. 2. To avail the external functionality (say, creating a Twitter stream), the developer has to `import org.apache.spark.streaming.twitter._` . For Scala API, the developer has to call `TwitterUtils.createStream(streamingContext, ...)`. For the Java API, the developer has to call `TwitterUtils.createStream(javaStreamingContext, ...)`. 3. Each external project has its own scala and java unit tests. Note the unit tests of each external library use classes of the streaming unit tests (`TestSuiteBase`, `LocalJavaStreamingContext`, etc.). To enable this code sharing among test classes, `dependsOn(streaming % "compile->compile,test->test")` was used in the SparkBuild.scala . In the streaming/pom.xml, an additional `maven-jar-plugin` was necessary to capture this dependency (see comment inside the pom.xml for more information). 4. Jars of the external projects have been added to examples project but not to the assembly project. 5. In some files, imports have been rearrange to conform to the Spark coding guidelines.
\| \| *	Removed XYZFunctions and added XYZUtils as a common Scala and Java interface ↵	Tathagata Das	2014-01-07	2	-8/+6
\| \| \| \| \| \| \| \| \| \| \| \|	for creating XYZ streams.
\| \| *	Merge remote-tracking branch 'apache/master' into project-refactor	Tathagata Das	2014-01-06	14	-15/+19
\| \| \|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: examples/src/main/java/org/apache/spark/streaming/examples/JavaFlumeEventCount.java streaming/src/main/scala/org/apache/spark/streaming/StreamingContext.scala streaming/src/main/scala/org/apache/spark/streaming/api/java/JavaStreamingContext.scala streaming/src/test/java/org/apache/spark/streaming/JavaAPISuite.java streaming/src/test/scala/org/apache/spark/streaming/InputStreamsSuite.scala streaming/src/test/scala/org/apache/spark/streaming/TestSuiteBase.scala
\| \| * \|	Changed JavaStreamingContextWith* to Function in streaming.api.java.** ↵	Tathagata Das	2014-01-06	2	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	package. Also fixed packages of Flume and MQTT tests.
\| \| * \|	Refactored kafka, flume, zeromq, mqtt as separate external projects, with ↵	Tathagata Das	2013-12-30	2	-6/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	their own self-contained scala API, java API, scala unit tests and java unit tests. Updated examples to use the external projects.
\| * \| \|	Issue #318 : minor style updates per review from Reynold Xin	Sean Owen	2014-01-07	10	-33/+2
\| \| \| \|
\| * \| \|	Merge remote-tracking branch 'upstream/master'	Sean Owen	2014-01-06	14	-15/+19
\| \|\ \ \ \| \| \| \|/ \| \| \|/\|
\| \| * \|	Removing SPARK_EXAMPLES_JAR in the code	Patrick Wendell	2014-01-05	14	-14/+18
\| \| \| \|
\| \| * \|	run-example -> bin/run-example	Prashant Sharma	2014-01-02	1	-1/+1
\| \| \|/
\| * /	Suggested small changes to Java code for slightly more standard style, ↵	Sean Owen	2014-01-02	14	-83/+164
\| \|/ \| \| \| \| \| \|	encapsulation and in some cases performance
\| *	Fixed job name in the java streaming example.	azuryyu	2013-12-24	1	-1/+1
\| \|
* \|	Changed the way StreamingContext finds and reads checkpoint files, and added ↵	Tathagata Das	2014-01-09	1	-4/+3
\|/ \| \| \|	JavaStreamingContext.getOrCreate.
*	Merge branch 'master' into scala-2.10	Raymond Liu	2013-11-13	1	-0/+98
\|\
\| *	Upgrade Kafka 0.7.2 to Kafka 0.8.0-beta1 for Spark Streaming	jerryshao	2013-10-12	1	-0/+98
\| \|
* \|	fixed some warnings	Martin Weindel	2013-10-05	4	-5/+3
\|/
*	Initial work to rename package to org.apache.spark	Matei Zaharia	2013-09-01	13	-84/+84
\|
*	Merge pull request #762 from shivaram/sgd-cleanup	Evan Sparks	2013-08-11	1	-0/+85
\|\ \| \| \| \|	Refactor SGD options into a new class.
\| *	Add setters for optimizer, gradient in SGD.	Shivaram Venkataraman	2013-08-08	1	-1/+1
\| \| \| \| \| \| \| \|	Also remove java-specific constructor for LabeledPoint.
\| *	Merge branch 'master' of git://github.com/mesos/spark into sgd-cleanup	Shivaram Venkataraman	2013-08-06	1	-0/+116
\| \|\ \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: mllib/src/main/scala/spark/mllib/util/MLUtils.scala
\| * \|	Refactor GLM algorithms and add Java tests	Shivaram Venkataraman	2013-08-06	1	-0/+85
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This change adds Java examples and unit tests for all GLM algorithms to make sure the MLLib interface works from Java. Changes include - Introduce LabeledPoint and avoid using Doubles in train arguments - Rename train to run in class methods - Make the optimizer a member variable of GLM to make sure the builder pattern works
* \| \|	Fixed path to JavaALS.java and JavaKMeans.java, fixed hadoop2-yarn profile	Alexander Pivovarov	2013-08-10	2	-0/+0
\| \| \|
* \| \|	Merge pull request #786 from shivaram/mllib-java	Matei Zaharia	2013-08-09	2	-0/+168
\|\ \ \ \| \| \| \| \| \| \| \|	Java fixes, tests and examples for ALS, KMeans
\| * \| \|	Remove Java-specific constructor for Rating.	Shivaram Venkataraman	2013-08-08	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The scala constructor works for native type java types. Modify examples to match this.
\| * \| \|	Java examples, tests for KMeans and ALS	Shivaram Venkataraman	2013-08-06	2	-0/+168
\| \| \|/ \| \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Changes ALS to accept RDD[Rating] instead of (Int, Int, Double) making it easier to call from Java - Renames class methods from `train` to `run` to enable static methods to be called from Java. - Add unit tests which check if both static / class methods can be called. - Also add examples which port the main() function in ALS, KMeans to the examples project. Couple of minor changes to existing code: - Add a toJavaRDD method in RDD to convert scala RDD to java RDD easily - Workaround a bug where using double[] from Java leads to class cast exception in KMeans init
* / \|	Optimize JavaPageRank to use reduceByKey instead of groupByKey	Matei Zaharia	2013-08-08	1	-9/+8
\|/ /
* \|	Got rid of unnecessary map function	stayhf	2013-08-06	1	-6/+2
\| \|
* \|	changes as reviewer requested	stayhf	2013-08-06	1	-10/+1
\| \|
* \|	Updated code with reviewer's suggestions	stayhf	2013-08-05	1	-47/+47
\| \|
* \|	Simple PageRank algorithm implementation in Java for SPARK-760	stayhf	2013-08-03	1	-0/+129
\|/
*	Add Apache license headers and LICENSE and NOTICE files	Matei Zaharia	2013-07-16	9	-0/+153
\|
*	Java indentation 4 --> 2 spaces	Nick Pentreath	2013-03-20	3	-200/+200
\|
*	A few cosmetic changes for JavaKMeans	Nick Pentreath	2013-03-19	1	-1/+4
\|
*	Adding Java K-Means example	Nick Pentreath	2013-03-19	1	-0/+111
\|
*	Changes to more closely match line length limit style	Nick Pentreath	2013-03-17	1	-3/+5
\|
*	Adding Java versions of Pi and LogQuery	Nick Pentreath	2013-03-15	2	-0/+160
\|
*	Pass a code JAR to SparkContext in our examples. Fixes SPARK-594.	Matei Zaharia	2013-02-25	6	-9/+17
\|
*	Fixed bugs in examples.	Tathagata Das	2013-02-24	1	-1/+1
\|
*	Changed networkStream to socketStream and pluggableNetworkStream to become ↵	Tathagata Das	2013-02-18	1	-1/+1
\| \| \| \|	networkStream as a way to create streams from arbitrary network receiver.
*	Moved Java streaming examples to examples/src/main/java/spark/streaming/... ↵	Tathagata Das	2013-02-14	3	-0/+174
\| \| \| \|	and fixed logging in NetworkInputTracker to highlight errors when receiver deregisters/shuts down.
*	Some doc and usability improvements:	Matei Zaharia	2012-10-12	2	-2/+2
\| \| \| \| \| \| \|	- Added a StorageLevels class for easy access to StorageLevel constants in Java - Added doc comments on Function classes in Java - Updated Accumulator and HadoopWriter docs slightly
*	Renamed apply() to call() in Java API and allowed it to throw Exceptions	Matei Zaharia	2012-08-12	3	-21/+22
\|
*	Remove StringOps.split() from Java WordCount.	Josh Rosen	2012-07-25	1	-5/+2
\|
*	Minor cleanup and optimizations in Java API.	Josh Rosen	2012-07-24	1	-6/+7
\| \| \| \| \| \|	- Add override keywords. - Cache RDDs and counts in TC example. - Clean up JavaRDDLike's abstract methods.