aboutsummaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAgeFilesLines
...
| | * | | | | | | | | | Add warning message for spillingAndrew Or2013-12-311-5/+9
| | | | | | | | | | | |
| | * | | | | | | | | | Address Aaron's and Jerry's commentsAndrew Or2013-12-312-5/+6
| | |/ / / / / / / / /
| | * | | | | | | | | Fix CheckpointSuite test failAndrew Or2013-12-301-2/+2
| | | | | | | | | | |
| | * | | | | | | | | Simplify merge logic based on the invariant that all spills contain unique keysAndrew Or2013-12-301-37/+22
| | | | | | | | | | |
| | * | | | | | | | | Merge pull request from aarondav: Utilize DiskBlockManager pathway for temp ↵Andrew Or2013-12-303-16/+49
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | file writing This gives us a couple advantages: - Uses spark.local.dir and randomly selects a directory/disk. - Ensure files are deleted on normal DiskBlockManager cleanup. - Availability of same stats as usual DiskBlockObjectWriter (currenty unused). Also enable basic cleanup when iterator is fully drained. Still requires cleanup for operations that fail or don't go through all elements.
| | * | | | | | | | | Merge branch 'master' of github.com:andrewor14/incubator-sparkAndrew Or2013-12-292-9/+11
| | |\ \ \ \ \ \ \ \ \
| | | * | | | | | | | | Use Comparator instead of OrderingAaron Davidson2013-12-292-9/+11
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | lower object creation costs
| | * | | | | | | | | | Add test suite for ExternalAppendOnlyMapAndrew Or2013-12-291-0/+217
| | | | | | | | | | | |
| | * | | | | | | | | | Make serializer a parameter to ExternalAppendOnlyMapAndrew Or2013-12-292-4/+4
| | |/ / / / / / / / /
| | * | | | | | | | | Address Aaron's commentsAndrew Or2013-12-295-88/+188
| | | | | | | | | | |
| | * | | | | | | | | Add Apache headersAaron Davidson2013-12-273-4/+54
| | | | | | | | | | |
| | * | | | | | | | | Rename spark.shuffle.buffer variablesAndrew Or2013-12-271-2/+2
| | | | | | | | | | |
| | * | | | | | | | | Final cleanupAndrew Or2013-12-264-25/+28
| | | | | | | | | | |
| | * | | | | | | | | Use real serializer & manual orderingAaron Davidson2013-12-261-11/+27
| | | | | | | | | | |
| | * | | | | | | | | Return efficient iterator if no spillage happenedAaron Davidson2013-12-263-9/+20
| | | | | | | | | | |
| | * | | | | | | | | Sort AppendOnlyMap in-placeAndrew Or2013-12-262-14/+51
| | | | | | | | | | |
| | * | | | | | | | | Fix streaming JavaAPISuite againAndrew Or2013-12-261-8/+12
| | | | | | | | | | |
| | * | | | | | | | | Fix streaming JavaAPISuite that depended on orderAaron Davidson2013-12-261-11/+16
| | | | | | | | | | |
| | * | | | | | | | | Allow Product2 rather than just tuple kv pairsAaron Davidson2013-12-262-8/+8
| | | | | | | | | | |
| | * | | | | | | | | Move maps to util, and refactor moreAndrew Or2013-12-267-23/+22
| | | | | | | | | | |
| | * | | | | | | | | SamplingSizeTracker + Map + test suiteAaron Davidson2013-12-265-11/+204
| | | | | | | | | | |
| | * | | | | | | | | New minor editsAndrew Or2013-12-263-54/+49
| | | | | | | | | | |
| | * | | | | | | | | Minor cleanup for Scala styleAaron Davidson2013-12-263-55/+55
| | | | | | | | | | |
| | * | | | | | | | | Add toggle for ExternalAppendOnlyMap in Aggregator and CoGroupedRDDAndrew Or2013-12-263-24/+65
| | | | | | | | | | |
| | * | | | | | | | | Provide for cases when mergeCombiners is not specified in ExternalAppendOnlyMapAndrew Or2013-12-262-68/+121
| | | | | | | | | | |
| | * | | | | | | | | Refactor ExternalAppendOnlyMap to take in KVC instead of just KVAndrew Or2013-12-263-76/+78
| | | | | | | | | | |
| | * | | | | | | | | Working ExternalAppendOnlyMap for both CoGroupedRDDs and AggregatorAndrew Or2013-12-264-66/+61
| | | | | | | | | | |
| | * | | | | | | | | Working ExternalAppendOnlyMap for Aggregator, but not for CoGroupedRDDAndrew Or2013-12-264-21/+182
| | | | | | | | | | |
| * | | | | | | | | | Merge pull request #369 from pillis/masterReynold Xin2014-01-102-0/+52
| |\ \ \ \ \ \ \ \ \ \ | | |_|_|_|_|_|/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | SPARK-961 Add a Vector.random() method Added method and testcases
| | * | | | | | | | | SPARK-961. Add a Vector.random() method - update 1Pillis2014-01-102-10/+12
| | | | | | | | | | |
| | * | | | | | | | | SPARK-961 Add a Vector.random() methodPillis2014-01-092-0/+50
| | | | | | | | | | |
* | | | | | | | | | | Added waitForStop and stop to JavaStreamingContext.Tathagata Das2014-01-112-3/+23
| | | | | | | | | | |
* | | | | | | | | | | Converted JobScheduler to use actors for event handling. Changed ↵Tathagata Das2014-01-1117-185/+485
| |_|_|_|_|/ / / / / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | protected[streaming] to private[streaming] in StreamingContext and DStream. Added waitForStop to StreamingContext, and StreamingContextSuite.
* | | | | | | | | | Merge remote-tracking branch 'apache/master' into driver-testTathagata Das2014-01-1037-105/+205
|\| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: streaming/src/main/scala/org/apache/spark/streaming/DStreamGraph.scala
| * | | | | | | | | Merge pull request #371 from tgravescs/yarn_client_addjar_misc_fixesThomas Graves2014-01-108-51/+105
| |\ \ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Yarn client addjar and misc fixes Fix the addJar functionality in yarn-client mode, add support for the other options supported in yarn-standalone mode, set the application type on yarn in hadoop 2.X, add documentation, change heartbeat interval to be same code as the yarn-standalone so it doesn't take so long to get containers and exit.
| | * | | | | | | | | Update based on review commentsThomas Graves2014-01-094-27/+23
| | | | | | | | | | |
| | * | | | | | | | | add application typeThomas Graves2014-01-091-0/+1
| | | | | | | | | | |
| | * | | | | | | | | yarn-client addJar fix and misc otherThomas Graves2014-01-095-37/+94
| | | | | | | | | | |
| * | | | | | | | | | Merge pull request #384 from pwendell/debug-logsPatrick Wendell2014-01-103-4/+8
| |\ \ \ \ \ \ \ \ \ \ | | |_|_|_|_|_|/ / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Make DEBUG-level logs consummable. Removes two things that caused issues with the debug logs: (a) Internal polling in the DAGScheduler was polluting the logs. (b) The Scala REPL logs were really noisy.
| | * | | | | | | | | Make DEBUG-level logs consummable.Patrick Wendell2014-01-103-4/+8
| | | |_|_|_|_|/ / / | | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Removes two things that caused issues with the debug logs: (a) Internal polling in the DAGScheduler was polluting the logs. (b) The Scala REPL logs were really noisy.
| * | | | | | | | | Merge pull request #375 from mateiz/option-fixMatei Zaharia2014-01-091-2/+2
| |\ \ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Fix bug added when we changed AppDescription.maxCores to an Option The Scala compiler warned about this -- we were comparing an Option against an integer now.
| | * | | | | | | | | Fix bug added when we changed AppDescription.maxCores to an OptionMatei Zaharia2014-01-091-2/+2
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The Scala compiler warned about this -- we were comparing an Option against an integer now.
| * | | | | | | | | | Merge pull request #378 from pwendell/consolidate_onPatrick Wendell2014-01-092-2/+2
| |\ \ \ \ \ \ \ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Enable shuffle consolidation by default. Bump this to being enabled for 0.9.0.
| | * | | | | | | | | | Enable shuffle consolidation by default.Patrick Wendell2014-01-092-2/+2
| | | |/ / / / / / / / | | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Bump this to being enabled for 0.9.0.
| * | | | | | | | | | Merge pull request #363 from pwendell/streaming-logsPatrick Wendell2014-01-0921-35/+81
| |\ \ \ \ \ \ \ \ \ \ | | |_|_|_|_|_|_|/ / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Set default logging to WARN for Spark streaming examples. This programatically sets the log level to WARN by default for streaming tests. If the user has already specified a log4j.properties file, the user's file will take precedence over this default.
| | * | | | | | | | | Minor clean-upPatrick Wendell2014-01-091-1/+1
| | | | | | | | | | |
| | * | | | | | | | | Set default logging to WARN for Spark streaming examples.Patrick Wendell2014-01-0921-34/+80
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This programatically sets the log level to WARN by default for streaming tests. If the user has already specified a log4j.properties file, the user's file will take precedence over this default.
| * | | | | | | | | | Merge pull request #353 from pwendell/ipython-simplifyPatrick Wendell2014-01-092-8/+4
| |\ \ \ \ \ \ \ \ \ \ | | |_|/ / / / / / / / | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Simplify and fix pyspark script. This patch removes compatibility for IPython < 1.0 but fixes the launch script and makes it much simpler. I tested this using the three commands in the PySpark documentation page: 1. IPYTHON=1 ./pyspark 2. IPYTHON_OPTS="notebook" ./pyspark 3. IPYTHON_OPTS="notebook --pylab inline" ./pyspark There are two changes: - We rely on PYTHONSTARTUP env var to start PySpark - Removed the quotes around $IPYTHON_OPTS... having quotes gloms them together as a single argument passed to `exec` which seemed to cause ipython to fail (it instead expects them as multiple arguments).
| | * | | | | | | | | Small fix suggested by joshPatrick Wendell2014-01-091-0/+1
| | | | | | | | | | |
| | * | | | | | | | | Simplify and fix pyspark script.Patrick Wendell2014-01-072-9/+4
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This patch removes compatibility for IPython < 1.0 but fixes the launch script and makes it much simpler. I tested this using the three commands in the PySpark documentation page: 1. IPYTHON=1 ./pyspark 2. IPYTHON_OPTS="notebook" ./pyspark 3. IPYTHON_OPTS="notebook --pylab inline" ./pyspark There are two changes: - We rely on PYTHONSTARTUP env var to start PySpark - Removed the quotes around $IPYTHON_OPTS... having quotes gloms them together as a single argument passed to `exec` which seemed to cause ipython to fail (it instead expects them as multiple arguments).