aboutsummaryrefslogtreecommitdiff
path: root/core
Commit message (Collapse)AuthorAgeFilesLines
* Merge pull request #525 from stephenh/subtractByKeyMatei Zaharia2013-03-164-40/+91
|\ | | | | Add PairRDDFunctions.subtractByKey.
| * Fix tabs that snuck in.Stephen Haberman2013-03-141-8/+8
| |
| * Allow subtractByKey's other argument to have a different value type.Stephen Haberman2013-03-143-8/+8
| |
| * Finished subtractByKeys.Stephen Haberman2013-03-144-23/+62
| |
| * Simplify SubtractedRDD in preparation from subtractByKey.Stephen Haberman2013-03-133-46/+58
| |
* | Merge pull request #516 from squito/fix_local_metricsMatei Zaharia2013-03-158-12/+104
|\ \ | | | | | | Fix local metrics
| * | increase sleep timeImran Rashid2013-03-101-1/+1
| | |
| * | add a small wait to one task to make sure some task runtime really is non-zeroImran Rashid2013-03-101-4/+10
| | |
| * | enable task metrics in local mode, add testsImran Rashid2013-03-092-2/+88
| | |
| * | rename remoteFetchWaitTime to fetchWaitTime, since it also includes time ↵Imran Rashid2013-03-096-10/+10
| | | | | | | | | | | | from local fetches
* | | Add a log4j compile dependency to fix build in IntelliJMikhail Bautin2013-03-151-1/+5
| | | | | | | | | | | | | | | Also rename parent project to spark-parent (otherwise it shows up as "parent" in IntelliJ, which is very confusing).
* | | Merge pull request #521 from stephenh/earlycloseMatei Zaharia2013-03-134-51/+153
|\ \ \ | | | | | | | | Close the reader in HadoopRDD as soon as iteration end.
| * | | Add a test for NextIterator.Stephen Haberman2013-03-131-0/+68
| | | |
| * | | Add NextIterator.closeIfNeeded.Stephen Haberman2013-03-132-2/+16
| | | |
| * | | Remove NextIterator.close default implementation.Stephen Haberman2013-03-122-4/+7
| | | |
| * | | More quickly call close in HadoopRDD.Stephen Haberman2013-03-113-52/+69
| | |/ | |/| | | | | | | | | | This also refactors out the common "gotNext" iterator pattern into a shared utility class.
* / | Send block sizes as longs.Charles Reiss2013-03-111-4/+4
|/ /
* | Merge remote-tracking branch 'woggling/dag-sched-driver-port'Matei Zaharia2013-03-101-5/+4
|\ \ | | | | | | | | | | | | Conflicts: core/src/test/scala/spark/scheduler/DAGSchedulerSuite.scala
| * | Prevent DAGSchedulerSuite from corrupting driver.port.Charles Reiss2013-03-091-4/+5
| |/ | | | | | | | | Use the LocalSparkContext abstraction to properly manage clearing spark.driver.port.
* | Merge pull request #512 from patelh/fix-kryo-serializerMatei Zaharia2013-03-102-11/+19
|\ \ | | | | | | Fix reference bug in Kryo serializer, add test, update version
| * | Fix reference bug in Kryo serializer, add test, update versionHiral Patel2013-03-072-11/+19
| |/
* | Merge pull request #515 from woggling/deploy-app-deathMatei Zaharia2013-03-103-7/+17
|\ \ | | | | | | Notify standalone deploy client of application death.
| * | Notify standalone deploy client of application death.Charles Reiss2013-03-093-7/+17
| |/ | | | | | | | | | | | | | | | | Usually, this isn't necessary since the application will be removed as a result of the deploy client disconnecting, but occassionally, the standalone deploy master removes an application otherwise. Also mark applications as FAILED instead of FINISHED when they are killed as a result of their executors failing too many times.
* | Merge remote-tracking branch 'stephenh/nomocks'Matei Zaharia2013-03-109-536/+275
|\ \ | |/ |/| | | | | | | Conflicts: core/src/main/scala/spark/storage/BlockManagerMaster.scala core/src/test/scala/spark/scheduler/DAGSchedulerSuite.scala
| * Fix MapOutputTrackerSuite.Stephen Haberman2013-02-261-2/+4
| |
| * Override DAGScheduler.runLocally so we can remove the Thread.sleep.Stephen Haberman2013-02-252-19/+27
| |
| * Merge branch 'master' into nomocksStephen Haberman2013-02-2595-890/+1411
| |\ | | | | | | | | | | | | Conflicts: core/src/test/scala/spark/scheduler/DAGSchedulerSuite.scala
| * | Use stubs instead of mocks for DAGSchedulerSuite.Stephen Haberman2013-02-098-527/+253
| | |
* | | Fix TaskMetrics not being serializableMatei Zaharia2013-03-041-6/+13
| | |
* | | Merge pull request #506 from rxin/spark-706Matei Zaharia2013-03-032-51/+120
|\ \ \ | | | | | | | | Fixed SPARK-706: Failures in block manager put leads to read task hanging.
| * | | Fixed SPARK-706: Failures in block manager put leads to read taskReynold Xin2013-02-282-51/+120
| | | | | | | | | | | | | | | | hanging.
* | | | minor cleanup based on feedback in review requestImran Rashid2013-03-034-24/+24
| | | |
* | | | change CleanupIterator to CompletionIteratorImran Rashid2013-03-033-27/+27
| | | |
* | | | refactoring of TaskMetricsImran Rashid2013-03-036-59/+110
| | | |
* | | | Merge branch 'master' into stageInfoImran Rashid2013-03-0329-107/+474
|\ \ \ \
| * \ \ \ Merge pull request #504 from mosharaf/masterMatei Zaharia2013-03-022-2/+2
| |\ \ \ \ | | | | | | | | | | | | Worker address was getting removed when removing an app.
| | * | | | Fixed master datastructure updates after removing an application; and a typo.Mosharaf Chowdhury2013-02-272-2/+2
| | | | | |
| * | | | | bump version to 0.7.1-SNAPSHOT in the subproject poms to keep the maven ↵Mark Hamstra2013-02-281-1/+1
| |/ / / / | | | | | | | | | | | | | | | build building.
| * | | | Fix a problem with no hosts being counted as alive in the first jobMatei Zaharia2013-02-261-3/+3
| | | | |
| * | | | Fix overly large thread names in PySparkMatei Zaharia2013-02-261-2/+2
| | | | |
| * | | | Fixed replication bug in BlockManagerTathagata Das2013-02-252-3/+17
| | | | |
| * | | | Allow passing sparkHome and JARs to StreamingContext constructorMatei Zaharia2013-02-252-3/+3
| | | | | | | | | | | | | | | | | | | | | | | | | Also warns if spark.cleaner.ttl is not set in the version where you pass your own SparkContext.
| * | | | Set spark.deploy.spreadOut to true by default in 0.7 (improves locality)Matei Zaharia2013-02-251-1/+1
| | | | |
| * | | | Add a config property for Akka lifecycle event loggingMatei Zaharia2013-02-251-2/+4
| | | | |
| * | | | Merge pull request #498 from pwendell/shutup-akkaMatei Zaharia2013-02-251-1/+1
| |\ \ \ \ | | | | | | | | | | | | Disable remote lifecycle logging from Akka.
| | * | | | Disable remote lifecycle logging from Akka.Patrick Wendell2013-02-251-1/+1
| | | | | | | | | | | | | | | | | | | | | | | | This changes the default setting to `off` for remote lifecycle events. When this is on, it is very chatty at the INFO level. It also prints out several ERROR messages sometimes when sc.stop() is called.
| * | | | | Get spark.default.paralellism on each call to defaultPartitioner,Matei Zaharia2013-02-251-4/+1
| | |_|_|/ | |/| | | | | | | | | | | | | instead of only once, in case the user changes it across Spark uses
| * | | | Merge pull request #459 from stephenh/bettersplitsMatei Zaharia2013-02-258-38/+90
| |\ \ \ \ | | |/ / / | |/| | | Change defaultPartitioner to use upstream split size.
| | * | | Use default parallelism if its set.Stephen Haberman2013-02-242-6/+19
| | | | |
| | * | | Merge branch 'master' into bettersplitsStephen Haberman2013-02-2492-793/+1260
| | |\ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: core/src/main/scala/spark/RDD.scala core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala core/src/test/scala/spark/ShuffleSuite.scala