aboutsummaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAgeFilesLines
* Place Spray in front of Cloudera in Maven search pathMatei Zaharia2012-10-021-2/+2
|
* Revert "Place Spray repo ahead of Cloudera in Maven search path"Matei Zaharia2012-10-026-230/+180
| | | | This reverts commit 42e0a68082327c78dbd0fd313145124d9b8a9d98.
* Place Spray repo ahead of Cloudera in Maven search pathMatei Zaharia2012-10-026-180/+230
|
* Merge branch 'dev' of github.com:mesos/spark into devMatei Zaharia2012-10-021-1/+1
|\
| * Merge pull request #235 from pwendell/publish-local-mavenMatei Zaharia2012-10-011-1/+1
| |\ | | | | | | publish-local should go to maven + ivy by default
| | * publish-local should go to maven + ivy by defaultPatrick Wendell2012-10-011-1/+1
| | |
* | | Include date in folder name for Spark local dir.Matei Zaharia2012-10-011-5/+7
|/ /
* | Merge branch 'dev' of github.com:mesos/spark into devMatei Zaharia2012-10-011-30/+32
|\ \
| * \ Merge pull request #233 from rxin/devMatei Zaharia2012-10-011-30/+32
| |\ \ | | |/ | |/| Fixed #232: DirectBuffer's cleaner was empty and Spark tried to invoke clean on it.
| | * Fixed #232: DirectBuffer's cleaner was empty and Spark tried to invokeReynold Xin2012-10-011-30/+32
| |/ | | | | | | clean on it.
* | Some bug fixes and logging fixes for broadcast.Matei Zaharia2012-10-0110-108/+113
| |
* | Ignore file spark-tests.log in gitMatei Zaharia2012-10-011-0/+1
| |
* | Write all unit test output to a fileMatei Zaharia2012-10-013-12/+18
|/
* Improve log messages from BlockManagerMatei Zaharia2012-10-014-45/+41
|
* Merge branch 'dev' of github.com:mesos/spark into devMatei Zaharia2012-10-011-4/+17
|\
| * Merge pull request #231 from rxin/devMatei Zaharia2012-10-011-4/+17
| |\ | | | | | | Added a new command "pl" in sbt to publish to both Maven and Ivy.
| | * Added a new command "pl" in sbt to publish to both Maven and Ivy.Reynold Xin2012-10-011-4/+17
| |/
* | Remove some printlns in testsMatei Zaharia2012-10-012-2/+5
| |
* | Use underscores instead of colons in RDD IDsMatei Zaharia2012-10-015-6/+6
|/
* Added a (failing) test for LRU with MEMORY_AND_DISK.Matei Zaharia2012-09-302-5/+9
|
* Simplified Class / ClassLoader testMatei Zaharia2012-09-301-1/+1
|
* Fixed several bugs that caused weird behavior with files in spark-shell:Matei Zaharia2012-09-308-15/+64
| | | | | | | | | - SizeEstimator was following through a ClassLoader field of Hadoop JobConfs, which referenced the whole interpreter, Scala compiler, etc. Chaos ensued, giving an estimated size in the tens of gigabytes. - Broadcast variables in local mode were only stored as MEMORY_ONLY and never made accessible over a server, so they fell out of the cache when they were deemed too large and couldn't be reloaded.
* CommentMatei Zaharia2012-09-292-2/+2
|
* Removed Logging trait from CoalescedRDD since we don't log anythingMatei Zaharia2012-09-291-2/+1
|
* Merge pull request #228 from rxin/devMatei Zaharia2012-09-291-0/+6
|\ | | | | Added mapPartitionsWithSplit to the programming guide.
| * Added mapPartitionsWithSplit to the programming guide.Reynold Xin2012-09-291-0/+6
| |
* | Added a CoalescedRDD class for reducing the number of partitions in an RDD.Matei Zaharia2012-09-292-0/+75
| |
* | CommentMatei Zaharia2012-09-291-0/+1
| |
* | Merge branch 'dev' of github.com:mesos/spark into devMatei Zaharia2012-09-295-1/+12
|\|
| * Merge pull request #227 from JoshRosen/fix/distinct_numsplitsMatei Zaharia2012-09-285-1/+12
| |\ | | | | | | Allow controlling number of splits in distinct().
| | * Use null as dummy value in distinct().Josh Rosen2012-09-281-1/+1
| | |
| | * Allow controlling number of splits in distinct().Josh Rosen2012-09-285-1/+12
| | |
* | | Made BlockManager unmap memory-mapped files when necessary to reduce theMatei Zaharia2012-09-2910-118/+279
|/ / | | | | | | number of open files. Also optimized sending of disk-based blocks.
* | Don't create a Cache in SparkEnv because we don't use itMatei Zaharia2012-09-281-5/+1
| |
* | Logging tweaksMatei Zaharia2012-09-285-17/+23
| |
* | Renamed subdirs optionMatei Zaharia2012-09-281-1/+1
| |
* | Made subdirs per local dir configurable, and reduced lock usage a bitMatei Zaharia2012-09-281-12/+15
| |
* | Made disk store use multiple directories, deleted ShuffleManagerMatei Zaharia2012-09-286-417/+345
| |
* | Print and track user call sites in more places in SparkMatei Zaharia2012-09-286-54/+63
|/
* Merge pull request #225 from pwendell/devMatei Zaharia2012-09-282-1/+39
|\ | | | | Log message which records RDD origin
| * Fixing some whitespace issuesPatrick Wendell2012-09-281-9/+9
| |
| * Changes based on Matei's commentsPatrick Wendell2012-09-281-14/+14
| |
| * Log message which records RDD originPatrick Wendell2012-09-282-1/+39
| | | | | | | | | | | | | | | | | | | | | | | | | | This adds tracking to determine the "origin" of an RDD. Origin is defined by the boundary between the user's code and the spark code, during an RDD's instantiation. It is meant to help users understand where a Spark RDD is coming from in their code. This patch also logs origin data when stages are submitted to the scheduler. Finally, it adds a new log message to fix an inconsitency in the way that dependent stages (those missing parents) and independent stages (those without) are logged during submission.
* | Changed the way tasks' dependency files are sent to workers so thatMatei Zaharia2012-09-2814-156/+206
| | | | | | | | custom serializers or Kryo registrators can be loaded.
* | Fixed a bug where isLocal was set to false when using local[K]Matei Zaharia2012-09-281-1/+1
| |
* | Fix a bug in JAR fetcher that made it always fetch the JARMatei Zaharia2012-09-273-13/+9
| |
* | Added an option to compress blocks in the block storeMatei Zaharia2012-09-275-8/+56
| |
* | Renamed storage levels to something cleaner; fixes #223.Matei Zaharia2012-09-2710-59/+59
|/
* Merge branch 'dev' of github.com:mesos/spark into devMatei Zaharia2012-09-262-0/+23
|\
| * Merge pull request #222 from rxin/devMatei Zaharia2012-09-262-0/+23
| |\ | | | | | | Added MapPartitionsWithSplitRDD.