aboutsummaryrefslogtreecommitdiff
path: root/streaming
Commit message (Collapse)AuthorAgeFilesLines
...
* | | | | | | Fixed bug in the number of splits in RDD after checkpointing. Modified ↵Tathagata Das2012-11-194-3/+11
| |_|_|_|_|/ |/| | | | | | | | | | | | | | | | | reduceByKeyAndWindow (naive) computation from window+reduceByKey to reduceByKey+window+reduceByKey.
* | | | | | Merge branch 'dev' of github.com:radlab/spark into devTathagata Das2012-11-172-0/+170
|\ \ \ \ \ \
| * \ \ \ \ \ Merge pull request #4 from radlab/streaming-examplePatrick Wendell2012-11-162-0/+170
| |\ \ \ \ \ \ | | |_|_|_|_|/ | |/| | | | | A "streaming page view" example.
| | * | | | | A "streaming page view" example.Patrick Wendell2012-11-162-0/+170
| | |/ / / /
* | / / / / Fixed checkpointing bug in CoGroupedRDD. CoGroupSplits kept around the RDD ↵Tathagata Das2012-11-172-3/+3
|/ / / / / | | | | | | | | | | | | | | | splits of its parent RDDs, thus checkpointing its parents did not release the references to the parent splits.
* | | | | Fixed bug in MappedValuesRDD, and set default graph checkpoint interval to ↵Tathagata Das2012-11-133-5/+22
| | | | | | | | | | | | | | | | | | | | be batch duration.
* | | | | Replaced StateRDD in StateDStream with MapPartitionsRDD.Tathagata Das2012-11-131-12/+3
| | | | |
* | | | | Optimized checkpoint writing by reusing FileSystem object. Fixed bug in ↵Tathagata Das2012-11-137-106/+128
| |_|_|/ |/| | | | | | | | | | | updating of checkpoint data in DStream where the checkpointed RDDs, upon recovery, were not recognized as checkpointed RDDs and therefore deleted from HDFS. Made InputStreamsSuite more robust to timing delays.
* | | | Speeded up CheckpointSuiteTathagata Das2012-11-121-12/+14
| | | |
* | | | Changed default level on calling DStream.persist() to be MEMORY_ONLY_SER. ↵Tathagata Das2012-11-122-2/+2
| | | | | | | | | | | | | | | | Also changed the persist level of StateDStream to be MEMORY_ONLY_SER.
* | | | Fixed bugs in RawNetworkInputDStream and in its examples. Made the ↵Tathagata Das2012-11-1211-376/+167
| | | | | | | | | | | | | | | | ReducedWindowedDStream persist RDDs to MEMOERY_SER_ONLY by default. Removed unncessary examples. Added streaming-env.sh.template to add recommended setting for streaming.
* | | | Merge branch 'dev' of github.com:radlab/spark into devtdas2012-11-112-22/+148
|\ \ \ \
| * | | | Fixed bug in FileInputDStream that allowed it to miss new files. Added tests ↵Tathagata Das2012-11-112-22/+148
| | |_|/ | |/| | | | | | | | | | in the InputStreamsSuite to test checkpointing of file and network streams.
* / | | Removed unnecessary files.tdas2012-11-0810-1643/+0
|/ / /
* | | Fixed bug in InputStreamsSuitetdas2012-11-081-0/+2
| | |
* | | Added FailureTestsuite for testing multiple, repeated master failures.Tathagata Das2012-11-067-49/+256
| | |
* | | Added a few more comments to the checkpoint-related functions.Tathagata Das2012-11-051-0/+8
| |/ |/|
* | Made more bug fixes for checkpointing.Tathagata Das2012-11-056-66/+97
| |
* | Fixed major bugs in checkpointing.Tathagata Das2012-11-058-85/+213
| |
* | Made checkpointing of dstream graph to work with checkpointing of RDDs. For ↵Tathagata Das2012-11-0415-188/+337
|/ | | | streams requiring checkpointing of its RDD, the default checkpoint interval is set to 10 seconds.
* Fixed serialization bug in countByWindow, added countByKey and ↵Tathagata Das2012-11-025-48/+186
| | | | countByKeyAndWindow, and added testcases for them.
* Added save operations to DStreams.Tathagata Das2012-10-273-3/+84
|
* Merge branch 'dev' of github.com:radlab/spark into devTathagata Das2012-10-2516-23/+29
|\
| * Merge remote-tracking branch 'public/master' into devMatei Zaharia2012-10-2416-23/+29
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Conflicts: core/src/main/scala/spark/BlockStoreShuffleFetcher.scala core/src/main/scala/spark/KryoSerializer.scala core/src/main/scala/spark/MapOutputTracker.scala core/src/main/scala/spark/RDD.scala core/src/main/scala/spark/SparkContext.scala core/src/main/scala/spark/executor/Executor.scala core/src/main/scala/spark/network/Connection.scala core/src/main/scala/spark/network/ConnectionManagerTest.scala core/src/main/scala/spark/rdd/BlockRDD.scala core/src/main/scala/spark/rdd/NewHadoopRDD.scala core/src/main/scala/spark/scheduler/ShuffleMapTask.scala core/src/main/scala/spark/scheduler/cluster/StandaloneSchedulerBackend.scala core/src/main/scala/spark/storage/BlockManager.scala core/src/main/scala/spark/storage/BlockMessage.scala core/src/main/scala/spark/storage/BlockStore.scala core/src/main/scala/spark/storage/StorageLevel.scala core/src/main/scala/spark/util/AkkaUtils.scala project/SparkBuild.scala run
* | Added tests for the file input stream.Tathagata Das2012-10-242-8/+64
|/
* Minor fixes.Tathagata Das2012-10-244-27/+27
|
* Added tests for testing network input stream.Tathagata Das2012-10-245-23/+134
|
* Renamed the streaming testsuites.Tathagata Das2012-10-234-6/+8
|
* Renamed APIs to create queueStream and fileStream.Tathagata Das2012-10-235-10/+10
|
* Updated StateDStream api to use Options instead of nulls.Tathagata Das2012-10-233-18/+18
|
* Renamed the network input streams.Tathagata Das2012-10-237-35/+40
|
* Modified API of NetworkInputDStreams and got ObjectInputDStream and ↵Tathagata Das2012-10-238-382/+360
| | | | RawInputDStream working.
* Renamed examplesTathagata Das2012-10-222-7/+7
|
* Added MapValueDStream, FlatMappedValuesDStream and CoGroupedDStream, and ↵Tathagata Das2012-10-219-35/+293
| | | | therefore DStream operations mapValue, flatMapValues, cogroup, and join. Also, added tests for DStream operations filter, glom, mapPartitions, groupByKey, mapValues, flatMapValues, cogroup, and join.
* Fixed some bugs in tests for forgetting RDDs, and made sure that use of ↵Tathagata Das2012-10-2110-89/+100
| | | | manual clock leads to a zeroTime of 0 in the DStreams (more intuitive).
* Added functionality to forget RDDs from DStreams.Tathagata Das2012-10-1911-79/+224
|
* Minor modifications.Tathagata Das2012-10-153-13/+14
|
* Refactored DStreamSuiteBase to create CheckpointSuite- testsuite for testing ↵Tathagata Das2012-10-1412-135/+290
| | | | checkpointing under different operations.
* Fixed bugs in the streaming testsuites.Tathagata Das2012-10-132-7/+18
|
* Implemented checkpointing of StreamingContext and DStream graph.Tathagata Das2012-10-1313-163/+534
|
* Merge branch 'dev' of github.com:radlab/spark into devTathagata Das2012-09-175-31/+46
|\
| * Minor tweaksTathagata Das2012-09-142-22/+24
| |
| * Merge branch 'dev' of github.com:radlab/spark into devTathagata Das2012-09-0720-404/+824
| |\ | | | | | | | | | | | | Conflicts: streaming/src/main/scala/spark/streaming/Scheduler.scala
| | * add warmup code for TopKWordCountRaw.scalahaoyuan2012-09-061-5/+14
| | |
| * | Fixed bugs in streaming Scheduler and optimized QueueInputDStream.Tathagata Das2012-09-073-4/+8
| | |
* | | Allowed StreamingContext to be created from existing SparkContextTathagata Das2012-09-171-8/+5
| |/ |/|
* | Merge branch 'dev' of github.com:radlab/spark into devhaoyuan2012-09-079-313/+424
|\ \
| * | Fixed bugs and added testcases for naive reduceByKeyAndWindow.Tathagata Das2012-09-065-87/+140
| | |
| * | Merge branch 'dev' of github.com:radlab/spark into devTathagata Das2012-09-062-5/+5
| |\ \
| * | | Re-implemented ReducedWindowedDSteam to simplify and fix bugs. Added slice ↵Tathagata Das2012-09-067-276/+334
| | | | | | | | | | | | | | | | operator to DStream. Also, refactored DStream testsuites and added tests for reduceByKeyAndWindow.