Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Place Spray in front of Cloudera in Maven search path | Matei Zaharia | 2012-10-02 | 1 | -2/+2 |
| | |||||
* | Revert "Place Spray repo ahead of Cloudera in Maven search path" | Matei Zaharia | 2012-10-02 | 6 | -230/+180 |
| | | | | This reverts commit 42e0a68082327c78dbd0fd313145124d9b8a9d98. | ||||
* | Place Spray repo ahead of Cloudera in Maven search path | Matei Zaharia | 2012-10-02 | 6 | -180/+230 |
| | |||||
* | Merge branch 'dev' of github.com:mesos/spark into dev | Matei Zaharia | 2012-10-02 | 1 | -1/+1 |
|\ | |||||
| * | Merge pull request #235 from pwendell/publish-local-maven | Matei Zaharia | 2012-10-01 | 1 | -1/+1 |
| |\ | | | | | | | publish-local should go to maven + ivy by default | ||||
| | * | publish-local should go to maven + ivy by default | Patrick Wendell | 2012-10-01 | 1 | -1/+1 |
| | | | |||||
* | | | Include date in folder name for Spark local dir. | Matei Zaharia | 2012-10-01 | 1 | -5/+7 |
|/ / | |||||
* | | Merge branch 'dev' of github.com:mesos/spark into dev | Matei Zaharia | 2012-10-01 | 1 | -30/+32 |
|\ \ | |||||
| * \ | Merge pull request #233 from rxin/dev | Matei Zaharia | 2012-10-01 | 1 | -30/+32 |
| |\ \ | | |/ | |/| | Fixed #232: DirectBuffer's cleaner was empty and Spark tried to invoke clean on it. | ||||
| | * | Fixed #232: DirectBuffer's cleaner was empty and Spark tried to invoke | Reynold Xin | 2012-10-01 | 1 | -30/+32 |
| |/ | | | | | | | clean on it. | ||||
* | | Some bug fixes and logging fixes for broadcast. | Matei Zaharia | 2012-10-01 | 10 | -108/+113 |
| | | |||||
* | | Ignore file spark-tests.log in git | Matei Zaharia | 2012-10-01 | 1 | -0/+1 |
| | | |||||
* | | Write all unit test output to a file | Matei Zaharia | 2012-10-01 | 3 | -12/+18 |
|/ | |||||
* | Improve log messages from BlockManager | Matei Zaharia | 2012-10-01 | 4 | -45/+41 |
| | |||||
* | Merge branch 'dev' of github.com:mesos/spark into dev | Matei Zaharia | 2012-10-01 | 1 | -4/+17 |
|\ | |||||
| * | Merge pull request #231 from rxin/dev | Matei Zaharia | 2012-10-01 | 1 | -4/+17 |
| |\ | | | | | | | Added a new command "pl" in sbt to publish to both Maven and Ivy. | ||||
| | * | Added a new command "pl" in sbt to publish to both Maven and Ivy. | Reynold Xin | 2012-10-01 | 1 | -4/+17 |
| |/ | |||||
* | | Remove some printlns in tests | Matei Zaharia | 2012-10-01 | 2 | -2/+5 |
| | | |||||
* | | Use underscores instead of colons in RDD IDs | Matei Zaharia | 2012-10-01 | 5 | -6/+6 |
|/ | |||||
* | Added a (failing) test for LRU with MEMORY_AND_DISK. | Matei Zaharia | 2012-09-30 | 2 | -5/+9 |
| | |||||
* | Simplified Class / ClassLoader test | Matei Zaharia | 2012-09-30 | 1 | -1/+1 |
| | |||||
* | Fixed several bugs that caused weird behavior with files in spark-shell: | Matei Zaharia | 2012-09-30 | 8 | -15/+64 |
| | | | | | | | | | - SizeEstimator was following through a ClassLoader field of Hadoop JobConfs, which referenced the whole interpreter, Scala compiler, etc. Chaos ensued, giving an estimated size in the tens of gigabytes. - Broadcast variables in local mode were only stored as MEMORY_ONLY and never made accessible over a server, so they fell out of the cache when they were deemed too large and couldn't be reloaded. | ||||
* | Comment | Matei Zaharia | 2012-09-29 | 2 | -2/+2 |
| | |||||
* | Removed Logging trait from CoalescedRDD since we don't log anything | Matei Zaharia | 2012-09-29 | 1 | -2/+1 |
| | |||||
* | Merge pull request #228 from rxin/dev | Matei Zaharia | 2012-09-29 | 1 | -0/+6 |
|\ | | | | | Added mapPartitionsWithSplit to the programming guide. | ||||
| * | Added mapPartitionsWithSplit to the programming guide. | Reynold Xin | 2012-09-29 | 1 | -0/+6 |
| | | |||||
* | | Added a CoalescedRDD class for reducing the number of partitions in an RDD. | Matei Zaharia | 2012-09-29 | 2 | -0/+75 |
| | | |||||
* | | Comment | Matei Zaharia | 2012-09-29 | 1 | -0/+1 |
| | | |||||
* | | Merge branch 'dev' of github.com:mesos/spark into dev | Matei Zaharia | 2012-09-29 | 5 | -1/+12 |
|\| | |||||
| * | Merge pull request #227 from JoshRosen/fix/distinct_numsplits | Matei Zaharia | 2012-09-28 | 5 | -1/+12 |
| |\ | | | | | | | Allow controlling number of splits in distinct(). | ||||
| | * | Use null as dummy value in distinct(). | Josh Rosen | 2012-09-28 | 1 | -1/+1 |
| | | | |||||
| | * | Allow controlling number of splits in distinct(). | Josh Rosen | 2012-09-28 | 5 | -1/+12 |
| | | | |||||
* | | | Made BlockManager unmap memory-mapped files when necessary to reduce the | Matei Zaharia | 2012-09-29 | 10 | -118/+279 |
|/ / | | | | | | | number of open files. Also optimized sending of disk-based blocks. | ||||
* | | Don't create a Cache in SparkEnv because we don't use it | Matei Zaharia | 2012-09-28 | 1 | -5/+1 |
| | | |||||
* | | Logging tweaks | Matei Zaharia | 2012-09-28 | 5 | -17/+23 |
| | | |||||
* | | Renamed subdirs option | Matei Zaharia | 2012-09-28 | 1 | -1/+1 |
| | | |||||
* | | Made subdirs per local dir configurable, and reduced lock usage a bit | Matei Zaharia | 2012-09-28 | 1 | -12/+15 |
| | | |||||
* | | Made disk store use multiple directories, deleted ShuffleManager | Matei Zaharia | 2012-09-28 | 6 | -417/+345 |
| | | |||||
* | | Print and track user call sites in more places in Spark | Matei Zaharia | 2012-09-28 | 6 | -54/+63 |
|/ | |||||
* | Merge pull request #225 from pwendell/dev | Matei Zaharia | 2012-09-28 | 2 | -1/+39 |
|\ | | | | | Log message which records RDD origin | ||||
| * | Fixing some whitespace issues | Patrick Wendell | 2012-09-28 | 1 | -9/+9 |
| | | |||||
| * | Changes based on Matei's comments | Patrick Wendell | 2012-09-28 | 1 | -14/+14 |
| | | |||||
| * | Log message which records RDD origin | Patrick Wendell | 2012-09-28 | 2 | -1/+39 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | This adds tracking to determine the "origin" of an RDD. Origin is defined by the boundary between the user's code and the spark code, during an RDD's instantiation. It is meant to help users understand where a Spark RDD is coming from in their code. This patch also logs origin data when stages are submitted to the scheduler. Finally, it adds a new log message to fix an inconsitency in the way that dependent stages (those missing parents) and independent stages (those without) are logged during submission. | ||||
* | | Changed the way tasks' dependency files are sent to workers so that | Matei Zaharia | 2012-09-28 | 14 | -156/+206 |
| | | | | | | | | custom serializers or Kryo registrators can be loaded. | ||||
* | | Fixed a bug where isLocal was set to false when using local[K] | Matei Zaharia | 2012-09-28 | 1 | -1/+1 |
| | | |||||
* | | Fix a bug in JAR fetcher that made it always fetch the JAR | Matei Zaharia | 2012-09-27 | 3 | -13/+9 |
| | | |||||
* | | Added an option to compress blocks in the block store | Matei Zaharia | 2012-09-27 | 5 | -8/+56 |
| | | |||||
* | | Renamed storage levels to something cleaner; fixes #223. | Matei Zaharia | 2012-09-27 | 10 | -59/+59 |
|/ | |||||
* | Merge branch 'dev' of github.com:mesos/spark into dev | Matei Zaharia | 2012-09-26 | 2 | -0/+23 |
|\ | |||||
| * | Merge pull request #222 from rxin/dev | Matei Zaharia | 2012-09-26 | 2 | -0/+23 |
| |\ | | | | | | | Added MapPartitionsWithSplitRDD. |