aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/sql/dataframe.py
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-19454][PYTHON][SQL] DataFrame.replace improvementszero3232017-04-051-25/+56
* [SPARK-20041][DOC] Update docs for NaN handling in approxQuantileZheng RuiFeng2017-03-211-1/+2
* [SPARK-19497][SS] Implement streaming deduplicationShixiong Zhu2017-02-231-0/+6
* [SPARK-19399][SPARKR] Add R coalesce API for DataFrame and ColumnFelix Cheung2017-02-151-1/+9
* [SPARK-19453][PYTHON][SQL][DOC] Correct and extend DataFrame.replace docstringzero3232017-02-141-6/+12
* [SPARK-14352][SQL] approxQuantile should support multi columnsZheng RuiFeng2017-02-011-7/+30
* [SPARK-19126][DOCS] Update Join Documentation Across Languagesanabranch2017-01-081-2/+3
* [SPARK-18447][DOCS] Fix the markdown for `Note:`/`NOTE:`/`Note that` across P...hyukjinkwon2016-11-221-15/+13
* [SPARK-18493] Add missing python APIs: withWatermark and checkpoint to dataframeBurak Yavuz2016-11-211-3/+54
* [SPARK-18365][DOCS] Improve Sample Method Documentationanabranch2016-11-171-0/+5
* [SPARK-17946][PYSPARK] Python crossJoin API similar to ScalaSrinath Shankar2016-10-141-5/+21
* [SPARK-16063][SQL] Add storageLevel to DatasetNick Pentreath2016-10-141-6/+30
* [SPARK-14761][SQL] Reject invalid join methods when join columns are not spec...Bijay Pathak2016-10-121-16/+15
* [SPARK-17338][SQL] add global temp viewWenchen Fan2016-10-101-2/+23
* [MINOR][PYSPARK][DOCS] Fix examples in PySpark documentationhyukjinkwon2016-09-281-1/+1
* [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in...Josh Rosen2016-09-141-4/+1
* [SPARK-17298][SQL] Require explicit CROSS join for cartesian productsSrinath Shankar2016-09-031-1/+1
* [SPARK-16772] Correct API doc references to PySpark classes + formatting fixesNicholas Chammas2016-07-281-1/+1
* [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...Dongjoon Hyun2016-07-221-0/+2
* [DOC] improve python doc for rdd.histogram and dataframe.joinMortada Mehyar2016-07-181-5/+5
* [SPARK-16546][SQL][PYSPARK] update python dataframe.dropWeichenXu2016-07-141-8/+19
* [SPARK-16429][SQL] Include `StringType` columns in `describe()`Dongjoon Hyun2016-07-081-4/+4
* [SPARK-16052][SQL] Improve `CollapseRepartition` optimizer for Repartition/Re...Dongjoon Hyun2016-07-081-2/+2
* [MINOR][PYSPARK][DOC] Fix wrongly formatted examples in PySpark documentationhyukjinkwon2016-07-061-4/+4
* [SPARK-16266][SQL][STREAING] Moved DataStreamReader/Writer from pyspark.sql t...Tathagata Das2016-06-281-1/+2
* [MINOR][DOCS][STRUCTURED STREAMING] Minor doc fixes around `DataFrameWriter` ...Burak Yavuz2016-06-281-2/+2
* [SPARK-16128][SQL] Allow setting length of characters to be truncated to, in ...Prashant Sharma2016-06-281-3/+15
* [SPARK-15953][WIP][STREAMING] Renamed ContinuousQuery to StreamingQueryTathagata Das2016-06-151-1/+1
* [SPARK-15933][SQL][STREAMING] Refactored DF reader-writer to use readStream a...Tathagata Das2016-06-141-2/+16
* [SPARK-15392][SQL] fix default value of size estimation of logical planDavies Liu2016-05-191-1/+1
* [SPARK-14603][SQL][FOLLOWUP] Verification of Metadata Operations by Session C...gatorsmile2016-05-191-2/+1
* [SPARK-15171][SQL] Deprecate registerTempTable and add dataset.createTempViewSean Zhong2016-05-121-3/+48
* [SPARK-15278] [SQL] Remove experimental tag from Python DataFrameReynold Xin2016-05-111-2/+2
* [MINOR] remove dead codeDavies Liu2016-05-041-9/+0
* [SPARK-14555] First cut of Python API for Structured StreamingBurak Yavuz2016-04-201-0/+12
* [SPARK-14717] [PYTHON] Scala, Python APIs for Dataset.unpersist differ in def...felixcheung2016-04-191-1/+3
* [SPARK-14573][PYSPARK][BUILD] Fix PyDoc Makefile & highlighting issuesHolden Karau2016-04-141-1/+1
* [SPARK-14334] [SQL] add toLocalIterator for Dataset/DataFrameDavies Liu2016-04-041-0/+14
* [SPARK-14142][SQL] Replace internal use of unionAll with unionReynold Xin2016-03-241-2/+2
* [SPARK-14088][SQL] Some Dataset API touch-upReynold Xin2016-03-221-2/+12
* [SPARK-10380][SQL] Fix confusing documentation examples for astype/drop_dupli...Reynold Xin2016-03-141-5/+15
* [SPARK-13671] [SPARK-13311] [SQL] Use different physical plans for RDD and da...Davies Liu2016-03-121-2/+1
* [SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python Dat...Wenchen Fan2016-03-021-40/+2
* [SPARK-13479][SQL][PYTHON] Added Python API for approxQuantileJoseph K. Bradley2016-02-241-0/+54
* [SPARK-13250] [SQL] Update PhysicallRDD to convert to UnsafeRow if using the ...Nong Li2016-02-241-1/+2
* [SPARK-13329] [SQL] considering output for statistics of logical planDavies Liu2016-02-231-2/+2
* [SPARK-13296][SQL] Move UserDefinedFunction into sql.expressions.Reynold Xin2016-02-131-1/+1
* [SPARK-12706] [SQL] grouping() and grouping_id()Davies Liu2016-02-101-11/+11
* [SPARK-5865][API DOC] Add doc warnings for methods that return local data str...Tommy YU2016-02-061-0/+6
* [SPARK-12756][SQL] use hash expression in ExchangeWenchen Fan2016-01-131-13/+13