Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Move some classes to more appropriate packages: | Matei Zaharia | 2013-09-01 | 1 | -2/+3 |
| | | | | | | * RDD, *RDDFunctions -> org.apache.spark.rdd * Utils, ClosureCleaner, SizeEstimator -> org.apache.spark.util * JavaSerializer, KryoSerializer -> org.apache.spark.serializer | ||||
* | Initial work to rename package to org.apache.spark | Matei Zaharia | 2013-09-01 | 8 | -47/+48 |
| | |||||
* | Add Apache license headers and LICENSE and NOTICE files | Matei Zaharia | 2013-07-16 | 9 | -2/+155 |
| | |||||
* | Initialize Twitter4J OAuth from system properties instead of prompting | Matei Zaharia | 2013-06-29 | 1 | -1/+1 |
| | |||||
* | Merge branch 'master' into streaming | Tathagata Das | 2013-06-24 | 4 | -14/+25 |
|\ | | | | | | | | | Conflicts: .gitignore | ||||
| * | Remove debug statements | Mridul Muralidharan | 2013-04-29 | 1 | -2/+0 |
| | | |||||
| * | Attempt to fix streaming test failures after yarn branch merge | Mridul Muralidharan | 2013-04-28 | 5 | -1/+8 |
| | | |||||
| * | Move streaming test initialization into 'before' blocks | Jey Kottalam | 2013-03-28 | 2 | -4/+8 |
| | | |||||
| * | Instead of failing to bind to a fixed, already-in-use port, let the OS ↵ | Mark Hamstra | 2013-03-01 | 1 | -8/+10 |
| | | | | | | | | choose an available port for TestServer. | ||||
| * | Changed Flume test to use the same port as other tests, so that can be ↵ | Tathagata Das | 2013-02-25 | 1 | -2/+2 |
| | | | | | | | | controlled centrally. | ||||
* | | Merge pull request #571 from Reinvigorate/sm-kafka-serializers | Tathagata Das | 2013-06-24 | 2 | -4/+19 |
|\ \ | | | | | | | Surfacing decoders on KafkaInputDStream | ||||
| * | | fixing kafkaStream Java API and adding test | seanm | 2013-05-10 | 1 | -0/+6 |
| | | | |||||
| * | | adding kafkaStream API tests | seanm | 2013-05-10 | 2 | -2/+13 |
| | | | |||||
| * | | Surfacing decoders on KafkaInputDStream | seanm | 2013-04-16 | 1 | -4/+2 |
| |/ | |||||
* / | fixing Spark Streaming count() so that 0 will be emitted when there is ↵ | seanm | 2013-04-15 | 1 | -2/+2 |
|/ | | | | nothing to count | ||||
* | Fixed differences in APIs of StreamingContext and JavaStreamingContext. ↵ | Tathagata Das | 2013-02-23 | 2 | -6/+33 |
| | | | | Change rawNetworkStream to rawSocketStream, and added twitter, actor, zeroMQ streams to JavaStreamingContext. Also added them to JavaAPISuite. | ||||
* | Merge branch 'mesos-streaming' into streaming | Tathagata Das | 2013-02-20 | 2 | -4/+48 |
|\ | | | | | | | | | Conflicts: streaming/src/test/java/spark/streaming/JavaAPISuite.java | ||||
| * | Small changes that were missing in merge | Patrick Wendell | 2013-02-19 | 1 | -0/+1 |
| | | |||||
| * | Use RDD type for `transform` operator in Java. | Patrick Wendell | 2013-02-19 | 1 | -2/+87 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is an improved implementation of the `transform` operator in Java. The main difference is that this allows all four possible types of transform functions 1. JavaRDD -> JavaRDD 2. JavaRDD -> JavaPairRDD 3. JavaPairRDD -> JavaPairRDD 4. JavaPairRDD -> JavaRDD whereas previously only (1) and (3) were possible. Conflicts: streaming/src/test/java/spark/streaming/JavaAPISuite.java | ||||
| * | Use RDD type for `foreach` operator in Java. | Patrick Wendell | 2013-02-19 | 2 | -2/+4 |
| | | |||||
* | | Merge branch 'mesos-master' into streaming | Tathagata Das | 2013-02-20 | 1 | -3/+183 |
|\ \ | |/ |/| | | | | | | | Conflicts: core/src/main/scala/spark/rdd/CheckpointRDD.scala streaming/src/main/scala/spark/streaming/dstream/ReducedWindowedDStream.scala | ||||
| * | STREAMING-50: Support transform workaround in JavaPairDStream | Patrick Wendell | 2013-02-12 | 1 | -0/+45 |
| | | | | | | | | | | | | This ports a useful workaround (the `transform` function) to JavaPairDStream. It is necessary to do things like sorting which are not supported yet in the core streaming API. | ||||
| * | Using tuple swap() | Patrick Wendell | 2013-02-11 | 1 | -2/+2 |
| | | |||||
| * | small fix | Patrick Wendell | 2013-02-11 | 1 | -2/+2 |
| | | |||||
| * | Fix for MapPartitions | Patrick Wendell | 2013-02-11 | 1 | -15/+52 |
| | | |||||
| * | Fix for flatmap | Patrick Wendell | 2013-02-11 | 1 | -0/+42 |
| | | |||||
| * | Indentation fix | Patrick Wendell | 2013-02-11 | 1 | -10/+10 |
| | | |||||
| * | Initial cut at replacing K, V in Java files | Patrick Wendell | 2013-02-11 | 1 | -0/+56 |
| | | |||||
* | | Merge branch 'streaming' into ScrapCodes-streaming-actor | Tathagata Das | 2013-02-19 | 9 | -440/+443 |
|\ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: docs/plugin-custom-receiver.md streaming/src/main/scala/spark/streaming/StreamingContext.scala streaming/src/main/scala/spark/streaming/dstream/KafkaInputDStream.scala streaming/src/main/scala/spark/streaming/dstream/PluggableInputDStream.scala streaming/src/main/scala/spark/streaming/receivers/ActorReceiver.scala streaming/src/test/scala/spark/streaming/InputStreamsSuite.scala | ||||
| * | | Changed networkStream to socketStream and pluggableNetworkStream to become ↵ | Tathagata Das | 2013-02-18 | 2 | -4/+3 |
| | | | | | | | | | | | | networkStream as a way to create streams from arbitrary network receiver. | ||||
| * | | Added checkpointing and fault-tolerance semantics to the programming guide. ↵ | Tathagata Das | 2013-02-18 | 1 | -1/+1 |
| | | | | | | | | | | | | Fixed default checkpoint interval to being a multiple of slide duration. Fixed visibility of some classes and objects to clean up docs. | ||||
| * | | Many changes to ensure better 2nd recovery if 2nd failure happens while | Tathagata Das | 2013-02-17 | 7 | -29/+67 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | recovering from 1st failure - Made the scheduler to checkpoint after clearing old metadata which ensures that a new checkpoint is written as soon as at least one batch gets computed while recovering from a failure. This ensures that if there is a 2nd failure while recovering from 1st failure, the system start 2nd recovery from a newer checkpoint. - Modified Checkpoint writer to write checkpoint in a different thread. - Added a check to make sure that compute for InputDStreams gets called only for strictly increasing times. - Changed implementation of slice to call getOrCompute on parent DStream in time-increasing order. - Added testcase to test slice. - Fixed testGroupByKeyAndWindow testcase in JavaAPISuite to verify results with expected output in an order-independent manner. | ||||
| * | | Removed countByKeyAndWindow on paired DStreams, and added ↵ | Tathagata Das | 2013-02-14 | 3 | -55/+53 |
| | | | | | | | | | | | | countByValueAndWindow for all DStreams. Updated both scala and java API and testsuites. | ||||
| * | | Added filter functionality to reduceByKeyAndWindow with inverse. ↵ | Tathagata Das | 2013-02-13 | 2 | -17/+34 |
| | | | | | | | | | | | | Consolidated reduceByKeyAndWindow's many functions into smaller number of functions with optional parameters. | ||||
| * | | Changed scheduler and file input stream to fix bugs in the driver fault ↵ | Tathagata Das | 2013-02-13 | 8 | -348/+136 |
| | | | | | | | | | | | | tolerance. Added MasterFailureTest to rigorously test master fault tolerance with file input stream. | ||||
| * | | Fixed bugs in FileInputDStream and Scheduler that occasionally failed to ↵ | Tathagata Das | 2013-02-10 | 1 | -81/+200 |
| | | | | | | | | | | | | reprocess old files after recovering from master failure. Completely modified spark.streaming.FailureTest to test multiple master failures using file input stream. | ||||
| * | | Added an initial spark job to ensure worker nodes are initialized. | Tathagata Das | 2013-02-09 | 1 | -1/+1 |
| | | | |||||
| * | | Merge branch 'mesos-master' into streaming | Tathagata Das | 2013-02-07 | 7 | -6/+6 |
| |\| | |||||
| | * | Replace old 'master' term with 'driver'. | Stephen Haberman | 2013-01-25 | 6 | -6/+6 |
| | | | |||||
| | * | Move JavaAPISuite into spark.streaming. | Stephen Haberman | 2013-01-21 | 2 | -0/+0 |
| | | | |||||
| * | | Merge branch 'mesos-streaming' into streaming | Tathagata Das | 2013-02-07 | 2 | -0/+50 |
| |\ \ | |||||
| | * \ | Merge pull request #373 from Reinvigorate/sm-updateStateByKey | Tathagata Das | 2013-02-07 | 2 | -0/+50 |
| | |\ \ | | | |/ | | |/| | StateDStream changes to give updateStateByKey consistent behavior | ||||
| | | * | adding updateStateByKey object lifecycle test | seanm | 2013-01-20 | 2 | -0/+50 |
| | | | | |||||
| * | | | Fixed checkpoint testcases | Tathagata Das | 2013-01-23 | 3 | -172/+129 |
| | | | | |||||
| * | | | Added support for rescheduling unprocessed batches on master failure. | Tathagata Das | 2013-01-23 | 1 | -7/+16 |
| | | | | |||||
| * | | | Added support for saving input files of FileInputDStream to graph ↵ | Tathagata Das | 2013-01-22 | 1 | -20/+44 |
| | | | | | | | | | | | | | | | | checkpoints. Modified 'file input stream with checkpoint' testcase to test recovery of pre-master-failure input files. | ||||
| * | | | Refactored DStreamCheckpointData. | Tathagata Das | 2013-01-22 | 1 | -6/+6 |
| |/ / | |||||
* / / | actor as receiver | Prashant Sharma | 2013-01-22 | 1 | -0/+68 |
|/ / | |||||
* | | Fixed streaming testsuite bugs | Tathagata Das | 2013-01-20 | 7 | -6/+24 |
| | | |||||
* | | Moving tests to appropriate directory | Patrick Wendell | 2013-01-17 | 2 | -0/+0 |
| | |