aboutsummaryrefslogtreecommitdiff
path: root/streaming/src/main
diff options
context:
space:
mode:
authorGabriele Nizzoli <mail@nizzoli.net>2014-07-08 14:23:38 -0700
committerTathagata Das <tathagata.das1565@gmail.com>2014-07-08 14:24:50 -0700
commit885489112c82eb909df7efbf0515fd7abfae41a4 (patch)
treea8804cd9f62deb26ad6d81f72a2f4a21f7441548 /streaming/src/main
parent1c12b0b5ca7cffa6fff4341ebc2823938601f71e (diff)
downloadspark-885489112c82eb909df7efbf0515fd7abfae41a4.tar.gz
spark-885489112c82eb909df7efbf0515fd7abfae41a4.tar.bz2
spark-885489112c82eb909df7efbf0515fd7abfae41a4.zip
[SPARK-2362] Fix for newFilesOnly logic in file DStream
The newFilesOnly logic should be inverted: the logic should be that if the flag newFilesOnly==true then only start reading files older than current time. As the code is now if newFilesOnly==true then it will start to read files that are older than 0L (that is: every file in the directory). Author: Gabriele Nizzoli <mail@nizzoli.net> Closes #1077 from gabrielenizzoli/master and squashes the following commits: 4f1d261 [Gabriele Nizzoli] Fix for newFilesOnly logic in file DStream (cherry picked from commit e6f7bfcfbf6aff7a9f8cd8e0a2166d0bf62b0912) Signed-off-by: Tathagata Das <tathagata.das1565@gmail.com>
Diffstat (limited to 'streaming/src/main')
-rw-r--r--streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala2
1 files changed, 1 insertions, 1 deletions
diff --git a/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala b/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala
index e878285f6a..9eecbfaef3 100644
--- a/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala
+++ b/streaming/src/main/scala/org/apache/spark/streaming/dstream/FileInputDStream.scala
@@ -45,7 +45,7 @@ class FileInputDStream[K: ClassTag, V: ClassTag, F <: NewInputFormat[K,V] : Clas
// Files with mod time earlier than this is ignored. This is updated every interval
// such that in the current interval, files older than any file found in the
// previous interval will be ignored. Obviously this time keeps moving forward.
- private var ignoreTime = if (newFilesOnly) 0L else System.currentTimeMillis()
+ private var ignoreTime = if (newFilesOnly) System.currentTimeMillis() else 0L
// Latest file mod time seen till any point of time
@transient private var path_ : Path = null