aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/sql/dataframe.py
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in...Josh Rosen2016-09-141-4/+1
* [SPARK-17298][SQL] Require explicit CROSS join for cartesian productsSrinath Shankar2016-09-031-1/+1
* [SPARK-16772] Correct API doc references to PySpark classes + formatting fixesNicholas Chammas2016-07-281-1/+1
* [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more co...Dongjoon Hyun2016-07-221-0/+2
* [DOC] improve python doc for rdd.histogram and dataframe.joinMortada Mehyar2016-07-181-5/+5
* [SPARK-16546][SQL][PYSPARK] update python dataframe.dropWeichenXu2016-07-141-8/+19
* [SPARK-16429][SQL] Include `StringType` columns in `describe()`Dongjoon Hyun2016-07-081-4/+4
* [SPARK-16052][SQL] Improve `CollapseRepartition` optimizer for Repartition/Re...Dongjoon Hyun2016-07-081-2/+2
* [MINOR][PYSPARK][DOC] Fix wrongly formatted examples in PySpark documentationhyukjinkwon2016-07-061-4/+4
* [SPARK-16266][SQL][STREAING] Moved DataStreamReader/Writer from pyspark.sql t...Tathagata Das2016-06-281-1/+2
* [MINOR][DOCS][STRUCTURED STREAMING] Minor doc fixes around `DataFrameWriter` ...Burak Yavuz2016-06-281-2/+2
* [SPARK-16128][SQL] Allow setting length of characters to be truncated to, in ...Prashant Sharma2016-06-281-3/+15
* [SPARK-15953][WIP][STREAMING] Renamed ContinuousQuery to StreamingQueryTathagata Das2016-06-151-1/+1
* [SPARK-15933][SQL][STREAMING] Refactored DF reader-writer to use readStream a...Tathagata Das2016-06-141-2/+16
* [SPARK-15392][SQL] fix default value of size estimation of logical planDavies Liu2016-05-191-1/+1
* [SPARK-14603][SQL][FOLLOWUP] Verification of Metadata Operations by Session C...gatorsmile2016-05-191-2/+1
* [SPARK-15171][SQL] Deprecate registerTempTable and add dataset.createTempViewSean Zhong2016-05-121-3/+48
* [SPARK-15278] [SQL] Remove experimental tag from Python DataFrameReynold Xin2016-05-111-2/+2
* [MINOR] remove dead codeDavies Liu2016-05-041-9/+0
* [SPARK-14555] First cut of Python API for Structured StreamingBurak Yavuz2016-04-201-0/+12
* [SPARK-14717] [PYTHON] Scala, Python APIs for Dataset.unpersist differ in def...felixcheung2016-04-191-1/+3
* [SPARK-14573][PYSPARK][BUILD] Fix PyDoc Makefile & highlighting issuesHolden Karau2016-04-141-1/+1
* [SPARK-14334] [SQL] add toLocalIterator for Dataset/DataFrameDavies Liu2016-04-041-0/+14
* [SPARK-14142][SQL] Replace internal use of unionAll with unionReynold Xin2016-03-241-2/+2
* [SPARK-14088][SQL] Some Dataset API touch-upReynold Xin2016-03-221-2/+12
* [SPARK-10380][SQL] Fix confusing documentation examples for astype/drop_dupli...Reynold Xin2016-03-141-5/+15
* [SPARK-13671] [SPARK-13311] [SQL] Use different physical plans for RDD and da...Davies Liu2016-03-121-2/+1
* [SPARK-13594][SQL] remove typed operations(e.g. map, flatMap) from python Dat...Wenchen Fan2016-03-021-40/+2
* [SPARK-13479][SQL][PYTHON] Added Python API for approxQuantileJoseph K. Bradley2016-02-241-0/+54
* [SPARK-13250] [SQL] Update PhysicallRDD to convert to UnsafeRow if using the ...Nong Li2016-02-241-1/+2
* [SPARK-13329] [SQL] considering output for statistics of logical planDavies Liu2016-02-231-2/+2
* [SPARK-13296][SQL] Move UserDefinedFunction into sql.expressions.Reynold Xin2016-02-131-1/+1
* [SPARK-12706] [SQL] grouping() and grouping_id()Davies Liu2016-02-101-11/+11
* [SPARK-5865][API DOC] Add doc warnings for methods that return local data str...Tommy YU2016-02-061-0/+6
* [SPARK-12756][SQL] use hash expression in ExchangeWenchen Fan2016-01-131-13/+13
* [SPARK-12600][SQL] Remove deprecated methods in Spark SQLReynold Xin2016-01-041-47/+1
* [SPARK-12520] [PYSPARK] Correct Descriptions and Add Use Cases in Equi-Joingatorsmile2015-12-271-1/+4
* [SQL] Fix mistake doc of join type for dataframe.joinYanbo Liang2015-12-191-1/+1
* [SPARK-12091] [PYSPARK] Deprecate the JAVA-specific deserialized storage levelsgatorsmile2015-12-181-3/+3
* [SPARK-12012][SQL] Show more comprehensive PhysicalRDD metadata when visualiz...Cheng Lian2015-12-091-1/+1
* [SPARK-11969] [SQL] [PYSPARK] visualization of SQL query for pysparkDavies Liu2015-11-251-1/+1
* [SPARK-11720][SQL][ML] Handle edge cases when count = 0 or 1 for Stats functionJihongMa2015-11-181-1/+1
* [SPARK-11420] Updating Stddev support via Imperative AggregateJihongMa2015-11-121-1/+1
* [SPARK-9830][SQL] Remove AggregateExpression1 and Aggregate Operator used to ...Yin Huai2015-11-101-1/+1
* [SPARK-11410] [PYSPARK] Add python bindings for repartition and sortW…Nong Li2015-11-061-16/+101
* [SPARK-10116][CORE] XORShiftRandom.hashSeed is random in high bitsImran Rashid2015-11-061-3/+3
* [SPARK-11279][PYSPARK] Add DataFrame#toDF in PySparkJeff Zhang2015-10-261-0/+12
* [SPARK-11205][PYSPARK] Delegate to scala DataFrame API rather than p…Jeff Zhang2015-10-201-1/+2
* [SPARK-10782] [PYTHON] Update dropDuplicates documentationasokadiggs2015-09-291-0/+2
* [SPARK-10731] [SQL] Delegate to Scala's DataFrame.take implementation in Pyth...Reynold Xin2015-09-231-1/+4