index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
sql
/
dataframe.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-19454][PYTHON][SQL] DataFrame.replace improvements
zero323
2017-04-05
1
-25
/
+56
*
[SPARK-20041][DOC] Update docs for NaN handling in approxQuantile
Zheng RuiFeng
2017-03-21
1
-1
/
+2
*
[SPARK-19497][SS] Implement streaming deduplication
Shixiong Zhu
2017-02-23
1
-0
/
+6
*
[SPARK-19399][SPARKR] Add R coalesce API for DataFrame and Column
Felix Cheung
2017-02-15
1
-1
/
+9
*
[SPARK-19453][PYTHON][SQL][DOC] Correct and extend DataFrame.replace docstring
zero323
2017-02-14
1
-6
/
+12
*
[SPARK-14352][SQL] approxQuantile should support multi columns
Zheng RuiFeng
2017-02-01
1
-7
/
+30
*
[SPARK-19126][DOCS] Update Join Documentation Across Languages
anabranch
2017-01-08
1
-2
/
+3
*
[SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...
hyukjinkwon
2016-11-22
1
-15
/
+13
*
[SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframe
Burak Yavuz
2016-11-21
1
-3
/
+54
*
[SPARK-18365][DOCS] Improve Sample Method Documentation
anabranch
2016-11-17
1
-0
/
+5
*
[SPARK-17946][PYSPARK] Python crossJoin API similar to Scala
Srinath Shankar
2016-10-14
1
-5
/
+21
*
[SPARK-16063][SQL] Add storageLevel to Dataset
Nick Pentreath
2016-10-14
1
-6
/
+30
*
[SPARK-14761][SQL] Reject invalid join methods when join columns are not spec...
Bijay Pathak
2016-10-12
1
-16
/
+15
*
[SPARK-17338][SQL] add global temp view
Wenchen Fan
2016-10-10
1
-2
/
+23
*
[MINOR][PYSPARK][DOCS] Fix examples in PySpark documentation
hyukjinkwon
2016-09-28
1
-1
/
+1
*
[SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in...
Josh Rosen
2016-09-14
1
-4
/
+1
*
[SPARK-17298][SQL] Require explicit CROSS join for cartesian products
Srinath Shankar
2016-09-03
1
-1
/
+1
*
[SPARK-16772] Correct API doc references to PySpark classes + formatting fixes
Nicholas Chammas
2016-07-28
1
-1
/
+1
*
[SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...
Dongjoon Hyun
2016-07-22
1
-0
/
+2
*
[DOC] improve python doc for rdd.histogram and dataframe.join
Mortada Mehyar
2016-07-18
1
-5
/
+5
*
[SPARK-16546][SQL][PYSPARK] update python dataframe.drop
WeichenXu
2016-07-14
1
-8
/
+19
*
[SPARK-16429][SQL] Include `StringType` columns in `describe()`
Dongjoon Hyun
2016-07-08
1
-4
/
+4
*
[SPARK-16052][SQL] Improve `CollapseRepartition` optimizer for Repartition/Re...
Dongjoon Hyun
2016-07-08
1
-2
/
+2
*
[MINOR][PYSPARK][DOC] Fix wrongly formatted examples in PySpark documentation
hyukjinkwon
2016-07-06
1
-4
/
+4
*
[SPARK-16266][SQL][STREAING] Moved DataStreamReader/Writer from pyspark.sql t...
Tathagata Das
2016-06-28
1
-1
/
+2
*
[MINOR][DOCS][STRUCTURED STREAMING] Minor doc fixes around `DataFrameWriter` ...
Burak Yavuz
2016-06-28
1
-2
/
+2
*
[SPARK-16128][SQL] Allow setting length of characters to be truncated to, in ...
Prashant Sharma
2016-06-28
1
-3
/
+15
*
[SPARK-15953][WIP][STREAMING] Renamed ContinuousQuery to StreamingQuery
Tathagata Das
2016-06-15
1
-1
/
+1
*
[SPARK-15933][SQL][STREAMING] Refactored DF reader-writer to use readStream a...
Tathagata Das
2016-06-14
1
-2
/
+16
*
[SPARK-15392][SQL] fix default value of size estimation of logical plan
Davies Liu
2016-05-19
1
-1
/
+1
*
[SPARK-14603][SQL][FOLLOWUP] Verification of Metadata Operations by Session C...
gatorsmile
2016-05-19
1
-2
/
+1
*
[SPARK-15171][SQL] Deprecate registerTempTable and add dataset.createTempView
Sean Zhong
2016-05-12
1
-3
/
+48
*
[SPARK-15278] [SQL] Remove experimental tag from Python DataFrame
Reynold Xin
2016-05-11
1
-2
/
+2
*
[MINOR] remove dead code
Davies Liu
2016-05-04
1
-9
/
+0
*
[SPARK-14555] First cut of Python API for Structured Streaming
Burak Yavuz
2016-04-20
1
-0
/
+12
*
[SPARK-14717] [PYTHON] Scala, Python APIs for Dataset.unpersist differ in def...
felixcheung
2016-04-19
1
-1
/
+3
*
[SPARK-14573][PYSPARK][BUILD] Fix PyDoc Makefile & highlighting issues
Holden Karau
2016-04-14
1
-1
/
+1
*
[SPARK-14334] [SQL] add toLocalIterator for Dataset/DataFrame
Davies Liu
2016-04-04
1
-0
/
+14
*
[SPARK-14142][SQL] Replace internal use of unionAll with union
Reynold Xin
2016-03-24
1
-2
/
+2
*
[SPARK-14088][SQL] Some Dataset API touch-up
Reynold Xin
2016-03-22
1
-2
/
+12
*
[SPARK-10380][SQL] Fix confusing documentation examples for astype/drop_dupli...
Reynold Xin
2016-03-14
1
-5
/
+15
*
[SPARK-13671] [SPARK-13311] [SQL] Use different physical plans for RDD and da...
Davies Liu
2016-03-12
1
-2
/
+1
*
[SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python Dat...
Wenchen Fan
2016-03-02
1
-40
/
+2
*
[SPARK-13479][SQL][PYTHON] Added Python API for approxQuantile
Joseph K. Bradley
2016-02-24
1
-0
/
+54
*
[SPARK-13250] [SQL] Update PhysicallRDD to convert to UnsafeRow if using the ...
Nong Li
2016-02-24
1
-1
/
+2
*
[SPARK-13329] [SQL] considering output for statistics of logical plan
Davies Liu
2016-02-23
1
-2
/
+2
*
[SPARK-13296][SQL] Move UserDefinedFunction into sql.expressions.
Reynold Xin
2016-02-13
1
-1
/
+1
*
[SPARK-12706] [SQL] grouping() and grouping_id()
Davies Liu
2016-02-10
1
-11
/
+11
*
[SPARK-5865][API DOC] Add doc warnings for methods that return local data str...
Tommy YU
2016-02-06
1
-0
/
+6
*
[SPARK-12756][SQL] use hash expression in Exchange
Wenchen Fan
2016-01-13
1
-13
/
+13
[next]