spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	sbin/spark-class* -> bin/spark-class*	Prashant Sharma	2014-01-03	14	-15/+15
\|
*	a few left over document change	Prashant Sharma	2014-01-02	3	-4/+4
\|
*	pyspark -> bin/pyspark	Prashant Sharma	2014-01-02	5	-19/+19
\|
*	run-example -> bin/run-example	Prashant Sharma	2014-01-02	19	-31/+31
\|
*	spark-shell -> bin/spark-shell	Prashant Sharma	2014-01-02	9	-15/+15
\|
*	Merge branch 'scripts-reorg' of github.com:shane-huang/incubator-spark into ↵	Prashant Sharma	2014-01-02	41	-96/+90
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	spark-915-segregate-scripts Conflicts: bin/spark-shell core/pom.xml core/src/main/scala/org/apache/spark/SparkContext.scala core/src/main/scala/org/apache/spark/scheduler/cluster/mesos/CoarseMesosSchedulerBackend.scala core/src/main/scala/org/apache/spark/ui/UIWorkloadGenerator.scala core/src/test/scala/org/apache/spark/DriverSuite.scala python/run-tests sbin/compute-classpath.sh sbin/spark-class sbin/stop-slaves.sh
\| *	deprecate "spark" script and SPAKR_CLASSPATH environment variable	Andrew xia	2013-10-12	7	-99/+4
\| \|
\| *	refactor $FWD variable	Andrew xia	2013-09-29	5	-7/+7
\| \|
\| *	Merge branch 'reorgscripts' into scripts-reorg	shane-huang	2013-09-27	40	-87/+175
\| \|\
\| \| *	rm bin/spark.cmd as we don't have windows test environment. Will added it ↵	shane-huang	2013-09-26	1	-27/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	later if needed Signed-off-by: shane-huang <shengsheng.huang@intel.com>
\| \| *	fix paths and change spark to use APP_MEM as application driver memory ↵	shane-huang	2013-09-26	3	-35/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	instead of SPARK_MEM, user should add application jars to SPARK_CLASSPATH Signed-off-by: shane-huang <shengsheng.huang@intel.com>
\| \| *	fix path	shane-huang	2013-09-26	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: shane-huang <shengsheng.huang@intel.com>
\| \| *	add scripts in bin	shane-huang	2013-09-23	12	-17/+163
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: shane-huang <shengsheng.huang@intel.com>
\| \| *	moved user scripts to bin folder	shane-huang	2013-09-23	11	-0/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: shane-huang <shengsheng.huang@intel.com>
\| \| *	add admin scripts to sbin	shane-huang	2013-09-23	14	-47/+47
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: shane-huang <shengsheng.huang@intel.com>
\| \| *	added spark-class and spark-executor to sbin	shane-huang	2013-09-23	14	-22/+16
\| \| \| \| \| \| \| \| \| \| \| \|	Signed-off-by: shane-huang <shengsheng.huang@intel.com>
* \| \|	Merge pull request #309 from mateiz/conf2	Patrick Wendell	2014-01-01	140	-941/+1731
\|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SPARK-544. Migrate configuration to a SparkConf class This is still a work in progress based on Prashant and Evan's code. So far I've done the following: - Got rid of global SparkContext.globalConf - Passed SparkConf to serializers and compression codecs - Made SparkConf public instead of private[spark] - Improved API of SparkContext and SparkConf - Switched executor environment vars to be passed through SparkConf - Fixed some places that were still using system properties - Fixed some tests, though others are still failing This still fails several tests in core, repl and streaming, likely due to properties not being set or cleared correctly (some of the tests run fine in isolation). But the API at least is hopefully ready for review. Unfortunately there was a lot of global stuff before due to a "SparkContext.globalConf" method that let you set a "default" configuration of sorts, which meant I had to make some pretty big changes.
\| * \| \|	Fix Python code after change of getOrElse	Matei Zaharia	2014-01-01	2	-7/+14
\| \| \| \|
\| * \| \|	Fixed two uses of conf.get with no default value in Mesos	Matei Zaharia	2014-01-01	2	-2/+2
\| \| \| \|
\| * \| \|	Miscellaneous fixes from code review.	Matei Zaharia	2014-01-01	49	-189/+206
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Also replaced SparkConf.getOrElse with just a "get" that takes a default value, and added getInt, getLong, etc to make code that uses this simpler later on.
\| * \| \|	Merge remote-tracking branch 'apache/master' into conf2	Matei Zaharia	2014-01-01	19	-37/+52
\| \|\ \ \ \| \|/ / / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/main/scala/org/apache/spark/SparkContext.scala core/src/main/scala/org/apache/spark/metrics/MetricsSystem.scala core/src/main/scala/org/apache/spark/storage/BlockManagerMasterActor.scala
* \| \| \|	Merge pull request #312 from pwendell/log4j-fix-2	Patrick Wendell	2014-01-01	19	-37/+52
\|\ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SPARK-1008: Logging improvments 1. Adds a default log4j file that gets loaded if users haven't specified a log4j file. 2. Isolates use of the tools assembly jar. I found this produced SLF4J warnings after building with SBT (and I've seen similar warnings on the mailing list).
\| * \ \ \	Merge remote-tracking branch 'apache-github/master' into log4j-fix-2	Patrick Wendell	2014-01-01	38	-465/+803
\| \|\ \ \ \ \| \|/ / / / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: streaming/src/main/scala/org/apache/spark/streaming/scheduler/JobGenerator.scala
\| * \| \| \|	Adding outer checkout when initializing logging	Patrick Wendell	2013-12-31	1	-3/+5
\| \| \| \| \|
\| * \| \| \|	Tiny typo fix	Patrick Wendell	2013-12-31	1	-2/+2
\| \| \| \| \|
\| * \| \| \|	Removing use in test	Patrick Wendell	2013-12-31	1	-2/+0
\| \| \| \| \|
\| * \| \| \|	Minor fixes	Patrick Wendell	2013-12-30	2	-3/+3
\| \| \| \| \|
\| * \| \| \|	Removing initLogging entirely	Patrick Wendell	2013-12-30	17	-32/+21
\| \| \| \| \|
\| * \| \| \|	Response to Shivaram's review	Patrick Wendell	2013-12-30	2	-15/+18
\| \| \| \| \|
\| * \| \| \|	SPARK-1008: Logging improvments	Patrick Wendell	2013-12-29	4	-13/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1. Adds a default log4j file that gets loaded if users haven't specified a log4j file. 2. Isolates use of the tools assembly jar. I found this produced SLF4J warnings after building with SBT (and I've seen similar warnings on the mailing list).
\| \| * \| \|	Merge remote-tracking branch 'apache/master' into conf2	Matei Zaharia	2014-01-01	12	-211/+457
\| \| \|\ \ \ \| \|_\|/ / / \|/\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: project/SparkBuild.scala
* \| \| \| \|	Merge pull request #314 from witgo/master	Reynold Xin	2013-12-31	2	-1356/+240
\|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	restore core/pom.xml file modification
\| * \| \| \| \|	restore core/pom.xml file modification	liguoqiang	2014-01-01	2	-1356/+240
\|/ / / / /
* \| \| \| \|	Merge pull request #73 from falaki/ApproximateDistinctCount	Reynold Xin	2013-12-31	12	-233/+1595
\|\ \ \ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Approximate distinct count Added countApproxDistinct() to RDD and countApproxDistinctByKey() to PairRDDFunctions to approximately count distinct number of elements and distinct number of values per key, respectively. Both functions use HyperLogLog from stream-lib for counting. Both functions take a parameter that controls the trade-off between accuracy and memory consumption. Also added Scala docs and test suites for both methods.
\| * \| \| \| \|	Made the code more compact and readable	Hossein Falaki	2013-12-31	3	-23/+8
\| \| \| \| \| \|
\| * \| \| \| \|	minor improvements	Hossein Falaki	2013-12-31	2	-4/+5
\| \| \| \| \| \|
\| * \| \| \| \|	Added Java unit tests for countApproxDistinct and countApproxDistinctByKey	Hossein Falaki	2013-12-30	1	-0/+32
\| \| \| \| \| \|
\| * \| \| \| \|	Added Java API for countApproxDistinct	Hossein Falaki	2013-12-30	1	-0/+11
\| \| \| \| \| \|
\| * \| \| \| \|	Added Java API for countApproxDistinctByKey	Hossein Falaki	2013-12-30	1	-0/+36
\| \| \| \| \| \|
\| * \| \| \| \|	Added stream 2.5.1 jar depenency	Hossein Falaki	2013-12-30	1	-1/+2
\| \| \| \| \| \|
\| * \| \| \| \|	Renamed countDistinct and countDistinctByKey methods to include Approx	Hossein Falaki	2013-12-30	5	-15/+15
\| \| \| \| \| \|
\| * \| \| \| \|	Using origin version	Hossein Falaki	2013-12-30	374	-8424/+19051
\| \|\ \ \ \ \
\| * \| \| \| \| \|	Removed superfluous abs call from test cases.	Hossein Falaki	2013-12-10	1	-2/+2
\| \| \| \| \| \| \|
\| * \| \| \| \| \|	Made SerializableHyperLogLog Externalizable and added Kryo tests	Hossein Falaki	2013-10-18	2	-5/+10
\| \| \| \| \| \| \|
\| * \| \| \| \| \|	Added stream-lib dependency to Maven build	Hossein Falaki	2013-10-18	2	-0/+9
\| \| \| \| \| \| \|
\| * \| \| \| \| \|	Improved code style.	Hossein Falaki	2013-10-17	4	-15/+19
\| \| \| \| \| \| \|
\| * \| \| \| \| \|	Fixed document typo	Hossein Falaki	2013-10-17	2	-4/+4
\| \| \| \| \| \| \|
\| * \| \| \| \| \|	Added dependency on stream-lib version 2.4.0 for approximate distinct count ↵	Hossein Falaki	2013-10-17	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	support.
\| * \| \| \| \| \|	Added countDistinctByKey to PairRDDFunctions that counts the approximate ↵	Hossein Falaki	2013-10-17	2	-0/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	number of unique values for each key in the RDD.
\| * \| \| \| \| \|	Added a countDistinct method to RDD that takes takes an accuracy parameter ↵	Hossein Falaki	2013-10-17	2	-1/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	and returns the (approximate) number of distinct elements in the RDD.