spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	Place Spray in front of Cloudera in Maven search path	Matei Zaharia	2012-10-02	1	-2/+2
\|
*	Revert "Place Spray repo ahead of Cloudera in Maven search path"	Matei Zaharia	2012-10-02	6	-230/+180
\| \| \| \|	This reverts commit 42e0a68082327c78dbd0fd313145124d9b8a9d98.
*	Place Spray repo ahead of Cloudera in Maven search path	Matei Zaharia	2012-10-02	6	-180/+230
\|
*	Merge branch 'dev' of github.com:mesos/spark into dev	Matei Zaharia	2012-10-02	1	-1/+1
\|\
\| *	Merge pull request #235 from pwendell/publish-local-maven	Matei Zaharia	2012-10-01	1	-1/+1
\| \|\ \| \| \| \| \| \|	publish-local should go to maven + ivy by default
\| \| *	publish-local should go to maven + ivy by default	Patrick Wendell	2012-10-01	1	-1/+1
\| \| \|
* \| \|	Include date in folder name for Spark local dir.	Matei Zaharia	2012-10-01	1	-5/+7
\|/ /
* \|	Merge branch 'dev' of github.com:mesos/spark into dev	Matei Zaharia	2012-10-01	1	-30/+32
\|\ \
\| * \	Merge pull request #233 from rxin/dev	Matei Zaharia	2012-10-01	1	-30/+32
\| \|\ \ \| \| \|/ \| \|/\|	Fixed #232: DirectBuffer's cleaner was empty and Spark tried to invoke clean on it.
\| \| *	Fixed #232: DirectBuffer's cleaner was empty and Spark tried to invoke	Reynold Xin	2012-10-01	1	-30/+32
\| \|/ \| \| \| \| \| \|	clean on it.
* \|	Some bug fixes and logging fixes for broadcast.	Matei Zaharia	2012-10-01	10	-108/+113
\| \|
* \|	Ignore file spark-tests.log in git	Matei Zaharia	2012-10-01	1	-0/+1
\| \|
* \|	Write all unit test output to a file	Matei Zaharia	2012-10-01	3	-12/+18
\|/
*	Improve log messages from BlockManager	Matei Zaharia	2012-10-01	4	-45/+41
\|
*	Merge branch 'dev' of github.com:mesos/spark into dev	Matei Zaharia	2012-10-01	1	-4/+17
\|\
\| *	Merge pull request #231 from rxin/dev	Matei Zaharia	2012-10-01	1	-4/+17
\| \|\ \| \| \| \| \| \|	Added a new command "pl" in sbt to publish to both Maven and Ivy.
\| \| *	Added a new command "pl" in sbt to publish to both Maven and Ivy.	Reynold Xin	2012-10-01	1	-4/+17
\| \|/
* \|	Remove some printlns in tests	Matei Zaharia	2012-10-01	2	-2/+5
\| \|
* \|	Use underscores instead of colons in RDD IDs	Matei Zaharia	2012-10-01	5	-6/+6
\|/
*	Added a (failing) test for LRU with MEMORY_AND_DISK.	Matei Zaharia	2012-09-30	2	-5/+9
\|
*	Simplified Class / ClassLoader test	Matei Zaharia	2012-09-30	1	-1/+1
\|
*	Fixed several bugs that caused weird behavior with files in spark-shell:	Matei Zaharia	2012-09-30	8	-15/+64
\| \| \| \| \| \| \| \| \|	- SizeEstimator was following through a ClassLoader field of Hadoop JobConfs, which referenced the whole interpreter, Scala compiler, etc. Chaos ensued, giving an estimated size in the tens of gigabytes. - Broadcast variables in local mode were only stored as MEMORY_ONLY and never made accessible over a server, so they fell out of the cache when they were deemed too large and couldn't be reloaded.
*	Comment	Matei Zaharia	2012-09-29	2	-2/+2
\|
*	Removed Logging trait from CoalescedRDD since we don't log anything	Matei Zaharia	2012-09-29	1	-2/+1
\|
*	Merge pull request #228 from rxin/dev	Matei Zaharia	2012-09-29	1	-0/+6
\|\ \| \| \| \|	Added mapPartitionsWithSplit to the programming guide.
\| *	Added mapPartitionsWithSplit to the programming guide.	Reynold Xin	2012-09-29	1	-0/+6
\| \|
* \|	Added a CoalescedRDD class for reducing the number of partitions in an RDD.	Matei Zaharia	2012-09-29	2	-0/+75
\| \|
* \|	Comment	Matei Zaharia	2012-09-29	1	-0/+1
\| \|
* \|	Merge branch 'dev' of github.com:mesos/spark into dev	Matei Zaharia	2012-09-29	5	-1/+12
\|\\|
\| *	Merge pull request #227 from JoshRosen/fix/distinct_numsplits	Matei Zaharia	2012-09-28	5	-1/+12
\| \|\ \| \| \| \| \| \|	Allow controlling number of splits in distinct().
\| \| *	Use null as dummy value in distinct().	Josh Rosen	2012-09-28	1	-1/+1
\| \| \|
\| \| *	Allow controlling number of splits in distinct().	Josh Rosen	2012-09-28	5	-1/+12
\| \| \|
* \| \|	Made BlockManager unmap memory-mapped files when necessary to reduce the	Matei Zaharia	2012-09-29	10	-118/+279
\|/ / \| \| \| \| \| \|	number of open files. Also optimized sending of disk-based blocks.
* \|	Don't create a Cache in SparkEnv because we don't use it	Matei Zaharia	2012-09-28	1	-5/+1
\| \|
* \|	Logging tweaks	Matei Zaharia	2012-09-28	5	-17/+23
\| \|
* \|	Renamed subdirs option	Matei Zaharia	2012-09-28	1	-1/+1
\| \|
* \|	Made subdirs per local dir configurable, and reduced lock usage a bit	Matei Zaharia	2012-09-28	1	-12/+15
\| \|
* \|	Made disk store use multiple directories, deleted ShuffleManager	Matei Zaharia	2012-09-28	6	-417/+345
\| \|
* \|	Print and track user call sites in more places in Spark	Matei Zaharia	2012-09-28	6	-54/+63
\|/
*	Merge pull request #225 from pwendell/dev	Matei Zaharia	2012-09-28	2	-1/+39
\|\ \| \| \| \|	Log message which records RDD origin
\| *	Fixing some whitespace issues	Patrick Wendell	2012-09-28	1	-9/+9
\| \|
\| *	Changes based on Matei's comments	Patrick Wendell	2012-09-28	1	-14/+14
\| \|
\| *	Log message which records RDD origin	Patrick Wendell	2012-09-28	2	-1/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds tracking to determine the "origin" of an RDD. Origin is defined by the boundary between the user's code and the spark code, during an RDD's instantiation. It is meant to help users understand where a Spark RDD is coming from in their code. This patch also logs origin data when stages are submitted to the scheduler. Finally, it adds a new log message to fix an inconsitency in the way that dependent stages (those missing parents) and independent stages (those without) are logged during submission.
* \|	Changed the way tasks' dependency files are sent to workers so that	Matei Zaharia	2012-09-28	14	-156/+206
\| \| \| \| \| \| \| \|	custom serializers or Kryo registrators can be loaded.
* \|	Fixed a bug where isLocal was set to false when using local[K]	Matei Zaharia	2012-09-28	1	-1/+1
\| \|
* \|	Fix a bug in JAR fetcher that made it always fetch the JAR	Matei Zaharia	2012-09-27	3	-13/+9
\| \|
* \|	Added an option to compress blocks in the block store	Matei Zaharia	2012-09-27	5	-8/+56
\| \|
* \|	Renamed storage levels to something cleaner; fixes #223.	Matei Zaharia	2012-09-27	10	-59/+59
\|/
*	Merge branch 'dev' of github.com:mesos/spark into dev	Matei Zaharia	2012-09-26	2	-0/+23
\|\
\| *	Merge pull request #222 from rxin/dev	Matei Zaharia	2012-09-26	2	-0/+23
\| \|\ \| \| \| \| \| \|	Added MapPartitionsWithSplitRDD.