Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Merge branch 'streaming' into ScrapCodes-streaming-actor | Tathagata Das | 2013-02-19 | 46 | -1056/+2026 |
|\ | | | | | | | | | | | | | | | | | | | Conflicts: docs/plugin-custom-receiver.md streaming/src/main/scala/spark/streaming/StreamingContext.scala streaming/src/main/scala/spark/streaming/dstream/KafkaInputDStream.scala streaming/src/main/scala/spark/streaming/dstream/PluggableInputDStream.scala streaming/src/main/scala/spark/streaming/receivers/ActorReceiver.scala streaming/src/test/scala/spark/streaming/InputStreamsSuite.scala | ||||
| * | Changed networkStream to socketStream and pluggableNetworkStream to become ↵ | Tathagata Das | 2013-02-18 | 4 | -21/+22 |
| | | | | | | | | networkStream as a way to create streams from arbitrary network receiver. | ||||
| * | Merge branch 'streaming' into ScrapCode-streaming | Tathagata Das | 2013-02-18 | 45 | -1027/+2015 |
| |\ | | | | | | | | | | | | | | | | Conflicts: streaming/src/main/scala/spark/streaming/dstream/KafkaInputDStream.scala streaming/src/main/scala/spark/streaming/dstream/NetworkInputDStream.scala | ||||
| | * | Added checkpointing and fault-tolerance semantics to the programming guide. ↵ | Tathagata Das | 2013-02-18 | 6 | -6/+11 |
| | | | | | | | | | | | | Fixed default checkpoint interval to being a multiple of slide duration. Fixed visibility of some classes and objects to clean up docs. | ||||
| | * | Many changes to ensure better 2nd recovery if 2nd failure happens while | Tathagata Das | 2013-02-17 | 18 | -97/+208 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | recovering from 1st failure - Made the scheduler to checkpoint after clearing old metadata which ensures that a new checkpoint is written as soon as at least one batch gets computed while recovering from a failure. This ensures that if there is a 2nd failure while recovering from 1st failure, the system start 2nd recovery from a newer checkpoint. - Modified Checkpoint writer to write checkpoint in a different thread. - Added a check to make sure that compute for InputDStreams gets called only for strictly increasing times. - Changed implementation of slice to call getOrCompute on parent DStream in time-increasing order. - Added testcase to test slice. - Fixed testGroupByKeyAndWindow testcase in JavaAPISuite to verify results with expected output in an order-independent manner. | ||||
| | * | Made MasterFailureTest more robust. | Tathagata Das | 2013-02-15 | 1 | -4/+22 |
| | | | |||||
| | * | Moved Java streaming examples to examples/src/main/java/spark/streaming/... ↵ | Tathagata Das | 2013-02-14 | 1 | -1/+1 |
| | | | | | | | | | | | | and fixed logging in NetworkInputTracker to highlight errors when receiver deregisters/shuts down. | ||||
| | * | Added TwitterInputDStream from example to StreamingContext. Renamed example ↵ | Tathagata Das | 2013-02-14 | 2 | -17/+105 |
| | | | | | | | | | | | | TwitterBasic to TwitterPopularTags. | ||||
| | * | Removed countByKeyAndWindow on paired DStreams, and added ↵ | Tathagata Das | 2013-02-14 | 8 | -183/+226 |
| | | | | | | | | | | | | countByValueAndWindow for all DStreams. Updated both scala and java API and testsuites. | ||||
| | * | Changes functions comments to make them more consistent. | Tathagata Das | 2013-02-13 | 2 | -45/+45 |
| | | | |||||
| | * | Added filter functionality to reduceByKeyAndWindow with inverse. ↵ | Tathagata Das | 2013-02-13 | 7 | -81/+102 |
| | | | | | | | | | | | | Consolidated reduceByKeyAndWindow's many functions into smaller number of functions with optional parameters. | ||||
| | * | Changed scheduler and file input stream to fix bugs in the driver fault ↵ | Tathagata Das | 2013-02-13 | 18 | -452/+693 |
| | | | | | | | | | | | | tolerance. Added MasterFailureTest to rigorously test master fault tolerance with file input stream. | ||||
| | * | Fixed bugs in FileInputDStream and Scheduler that occasionally failed to ↵ | Tathagata Das | 2013-02-10 | 6 | -91/+221 |
| | | | | | | | | | | | | reprocess old files after recovering from master failure. Completely modified spark.streaming.FailureTest to test multiple master failures using file input stream. | ||||
| | * | Added an initial spark job to ensure worker nodes are initialized. | Tathagata Das | 2013-02-09 | 2 | -2/+7 |
| | | | |||||
| | * | Merge branch 'mesos-master' into streaming | Tathagata Das | 2013-02-07 | 22 | -15/+195 |
| | |\ | |||||
| | | * | Streaming constructor which takes JavaSparkContext | Patrick Wendell | 2013-02-05 | 1 | -0/+8 |
| | | | | | | | | | | | | | | | | | | | | It's sometimes helpful to directly pass a JavaSparkContext, and take advantage of the various constructors available for that. | ||||
| | | * | Remove activation of profiles by default | Mikhail Bautin | 2013-01-31 | 1 | -11/+0 |
| | | | | | | | | | | | | | | | | | | | | See the discussion at https://github.com/mesos/spark/pull/355 for why default profile activation is a problem. | ||||
| | | * | Merge pull request #415 from stephenh/driver | Matei Zaharia | 2013-01-29 | 7 | -8/+8 |
| | | |\ | | | | | | | | | | | Replace old 'master' term with 'driver'. | ||||
| | | | * | Replace old 'master' term with 'driver'. | Stephen Haberman | 2013-01-25 | 7 | -8/+8 |
| | | | | | |||||
| | | * | | Fix code that depended on metadata cleaner interval being in minutes | Matei Zaharia | 2013-01-28 | 2 | -5/+5 |
| | | |/ | |||||
| | | * | Refactor daemon thread pool creation. | Josh Rosen | 2013-01-21 | 1 | -2/+3 |
| | | | | |||||
| | | * | Move JavaAPISuite into spark.streaming. | Stephen Haberman | 2013-01-21 | 2 | -0/+0 |
| | | | | |||||
| | | * | Add Maven build file for streaming, and fix some issues in SBT file | Matei Zaharia | 2013-01-20 | 10 | -0/+182 |
| | | | | | | | | | | | | | | | | | | | | | | | | As part of this, changed our Scala 2.9.2 Kafka library to be available as a local Maven repository, following the example in (http://blog.dub.podval.org/2010/01/maven-in-project-repository.html) | ||||
| | * | | Updated JavaStreamingContext with updated kafkaStream API. | Tathagata Das | 2013-02-07 | 1 | -17/+9 |
| | | | | |||||
| | * | | Merge branch 'mesos-streaming' into streaming | Tathagata Das | 2013-02-07 | 5 | -88/+91 |
| | |\ \ | |||||
| | | * \ | Merge pull request #372 from Reinvigorate/sm-kafka | Tathagata Das | 2013-02-07 | 2 | -91/+13 |
| | | |\ \ | | | | | | | | | | | | | Removing offset management code that is non-existent in kafka 0.7.0+ | ||||
| | | | * | | kafkaStream API cleanup. A quorum of zookeepers can now be specified | seanm | 2013-01-18 | 2 | -15/+10 |
| | | | | | | |||||
| | | | * | | further KafkaInputDStream cleanup (removing unused and commented out code ↵ | seanm | 2013-01-18 | 1 | -69/+3 |
| | | | | | | | | | | | | | | | | | | | | | | | | relating to offset management) | ||||
| | | | * | | Removing offset management code that is non-existent in kafka 0.7.0+ | seanm | 2013-01-14 | 1 | -7/+0 |
| | | | | | | |||||
| | | * | | | Merge pull request #373 from Reinvigorate/sm-updateStateByKey | Tathagata Das | 2013-02-07 | 4 | -6/+78 |
| | | |\ \ \ | | | | |_|/ | | | |/| | | StateDStream changes to give updateStateByKey consistent behavior | ||||
| | | | * | | Splitting StreamingContext.queueStream into two methods | seanm | 2013-01-20 | 1 | -4/+18 |
| | | | | | | |||||
| | | | * | | adding updateStateByKey object lifecycle test | seanm | 2013-01-20 | 2 | -0/+50 |
| | | | | | | |||||
| | | | * | | Merge branch 'streaming' into sm-updateStateByKey | seanm | 2013-01-15 | 3 | -8/+8 |
| | | | |\ \ | |||||
| | | | * | | | StateDStream changes to give updateStateByKey consistent behavior | seanm | 2013-01-14 | 1 | -2/+10 |
| | | | | |/ | | | | |/| | |||||
| | * | | | | Fixed checkpoint testcases | Tathagata Das | 2013-01-23 | 3 | -172/+129 |
| | | | | | | |||||
| | * | | | | Added support for rescheduling unprocessed batches on master failure. | Tathagata Das | 2013-01-23 | 5 | -12/+53 |
| | | | | | | |||||
| | * | | | | Added support for saving input files of FileInputDStream to graph ↵ | Tathagata Das | 2013-01-22 | 6 | -66/+159 |
| | | | | | | | | | | | | | | | | | | | | | | | | checkpoints. Modified 'file input stream with checkpoint' testcase to test recovery of pre-master-failure input files. | ||||
| | * | | | | Refactored DStreamCheckpointData. | Tathagata Das | 2013-01-22 | 4 | -64/+99 |
| | |/ / / | |||||
| * | | | | Plug in actor as stream receiver API | Prashant Sharma | 2013-01-19 | 3 | -0/+152 |
| | | | | | |||||
| * | | | | Changed method name of createReceiver to getReceiver as it is not intended ↵ | Prashant Sharma | 2013-01-19 | 6 | -28/+28 |
| | | | | | | | | | | | | | | | | | | | | to be a factory. | ||||
* | | | | | actor as receiver | Prashant Sharma | 2013-01-22 | 4 | -0/+269 |
| | | | | | |||||
* | | | | | Changed method name of createReceiver to getReceiver as it is not intended ↵ | Prashant Sharma | 2013-01-21 | 6 | -8/+8 |
| |/ / / |/| | | | | | | | | | | | to be a factory. | ||||
* | | | | Fixed streaming testsuite bugs | Tathagata Das | 2013-01-20 | 7 | -6/+24 |
| | | | | |||||
* | | | | Merge branch 'mesos-streaming' into streaming | Tathagata Das | 2013-01-20 | 8 | -120/+2452 |
|\| | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: core/src/main/scala/spark/api/java/JavaRDDLike.scala core/src/main/scala/spark/api/java/JavaSparkContext.scala core/src/test/scala/spark/JavaAPISuite.java | ||||
| * | | | Moving tests to appropriate directory | Patrick Wendell | 2013-01-17 | 2 | -0/+0 |
| | | | | |||||
| * | | | Adding queueStream and some slight refactoring | Patrick Wendell | 2013-01-17 | 2 | -81/+163 |
| | | | | |||||
| * | | | Merge branch 'streaming' into streaming-java-api | Patrick Wendell | 2013-01-17 | 1 | -3/+3 |
| |\ \ \ | | | |/ | | |/| | |||||
| * | | | Import fixup | Patrick Wendell | 2013-01-17 | 3 | -3/+0 |
| | | | | |||||
| * | | | Style cleanup | Patrick Wendell | 2013-01-17 | 3 | -3/+26 |
| | | | | |||||
| * | | | Checkpointing in Streaming java API | Patrick Wendell | 2013-01-17 | 3 | -3/+93 |
| | | | |