index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
sql
/
dataframe.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-6548] Adding stddev to DataFrame functions
JihongMa
2015-09-12
1
-18
/
+18
*
[SPARK-10373] [PYSPARK] move @since into pyspark from sql
Davies Liu
2015-09-08
1
-1
/
+1
*
[SPARK-9613] [CORE] Ban use of JavaConversions and migrate all existing uses ...
Sean Owen
2015-08-25
1
-2
/
+2
*
[SPARK-10073] [SQL] Python withColumn should replace the old column
Davies Liu
2015-08-19
1
-6
/
+6
*
[SPARK-8670] [SQL] Nested columns can't be referenced in pyspark
Wenchen Fan
2015-08-14
1
-2
/
+0
*
[SPARK-9726] [PYTHON] PySpark DF join no longer accepts on=None
Brennan Ashton
2015-08-12
1
-2
/
+1
*
[SPARK-9733][SQL] Improve physical plan explain for data sources
Reynold Xin
2015-08-07
1
-3
/
+1
*
[SPARK-7157][SQL] add sampleBy to DataFrame
Xiangrui Meng
2015-07-30
1
-0
/
+41
*
[SPARK-9243] [Documentation] null -> zero in crosstab doc
Xiangrui Meng
2015-07-23
1
-1
/
+1
*
[SPARK-7902] [SPARK-6289] [SPARK-8685] [SQL] [PYSPARK] Refactor of serializat...
Davies Liu
2015-07-09
1
-13
/
+3
*
[SPARK-8770][SQL] Create BinaryOperator abstract class.
Reynold Xin
2015-07-01
1
-5
/
+5
*
[SPARK-8766] support non-ascii character in column names
Davies Liu
2015-07-01
1
-2
/
+1
*
[SPARK-8434][SQL]Add a "pretty" parameter to the "show" method to display lon...
zsxwing
2015-06-29
1
-2
/
+5
*
Revert "[SPARK-7157][SQL] add sampleBy to DataFrame"
Reynold Xin
2015-06-23
1
-40
/
+0
*
[SPARK-7157][SQL] add sampleBy to DataFrame
Xiangrui Meng
2015-06-23
1
-0
/
+40
*
[SPARK-6390] [SQL] [MLlib] Port MatrixUDT to PySpark
MechCoder
2015-06-17
1
-1
/
+5
*
[SPARK-7886] Add built-in expressions to FunctionRegistry.
Reynold Xin
2015-06-09
1
-1
/
+1
*
[SPARK-7990][SQL] Add methods to facilitate equi-join on multiple joining keys
Liang-Chi Hsieh
2015-06-08
1
-13
/
+32
*
[SPARK-8146] DataFrame Python API: Alias replace in df.na
Reynold Xin
2015-06-07
1
-25
/
+22
*
[SPARK-7991] [PySpark] Adding support for passing lists to describe.
amey
2015-06-05
1
-0
/
+12
*
[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...
Mike Dusenberry
2015-06-04
1
-3
/
+18
*
[SPARK-8060] Improve DataFrame Python test coverage and documentation.
Reynold Xin
2015-06-03
1
-55
/
+27
*
[minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...
Reynold Xin
2015-06-01
1
-0
/
+3
*
[SPARK-7840] add insertInto() to Writer
Davies Liu
2015-05-23
1
-1
/
+1
*
[SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...
Davies Liu
2015-05-23
1
-0
/
+2
*
[SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in Python
Davies Liu
2015-05-21
1
-2
/
+46
*
[SPARK-7606] [SQL] [PySpark] add version to Python SQL API docs
Davies Liu
2015-05-20
1
-1
/
+67
*
[SPARK-7738] [SQL] [PySpark] add reader and writer API in Python
Davies Liu
2015-05-19
1
-35
/
+32
*
[SPARK-6657] [PYSPARK] Fix doc warnings
Xiangrui Meng
2015-05-18
1
-0
/
+1
*
[SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files
Davies Liu
2015-05-15
1
-446
/
+3
*
[SPARK-7548] [SQL] Add explode function for DataFrames
Michael Armbrust
2015-05-14
1
-3
/
+9
*
[SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...
Reynold Xin
2015-05-12
1
-0
/
+31
*
[SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark
Daoyuan Wang
2015-05-12
1
-0
/
+85
*
[SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.
Reynold Xin
2015-05-11
1
-1
/
+13
*
[SPARK-7324] [SQL] DataFrame.dropDuplicates
Reynold Xin
2015-05-11
1
-2
/
+34
*
[SPARK-7462] By default retain group by columns in aggregate
Reynold Xin
2015-05-11
1
-1
/
+1
*
[SPARK-7133] [SQL] Implement struct, array, and map field accessor
Wenchen Fan
2015-05-08
1
-12
/
+12
*
[SPARK-7295][SQL] bitwise operations for DataFrame DSL
Shiti
2015-05-07
1
-0
/
+5
*
[SPARK-7294][SQL] ADD BETWEEN
云峤
2015-05-05
1
-0
/
+7
*
[SPARK-7243][SQL] Reduce size for Contingency Tables in DataFrames
Burak Yavuz
2015-05-05
1
-4
/
+5
*
[SPARK-7243][SQL] Contingency Tables for DataFrames
Burak Yavuz
2015-05-04
1
-0
/
+25
*
[SPARK-7319][SQL] Improve the output from DataFrame.show()
云峤
2015-05-04
1
-36
/
+69
*
[SPARK-7241] Pearson correlation for DataFrames
Burak Yavuz
2015-05-03
1
-0
/
+26
*
[SPARK-3444] Fix typo in Dataframes.py introduced in []
Dean Chen
2015-05-02
1
-1
/
+1
*
[SPARK-7242] added python api for freqItems in DataFrames
Burak Yavuz
2015-05-01
1
-0
/
+25
*
[SPARK-3444] Provide an easy way to change log level
Holden Karau
2015-05-01
1
-1
/
+1
*
[SPARK-7240][SQL] Single pass covariance calculation for dataframes
Burak Yavuz
2015-05-01
1
-1
/
+35
*
[SPARK-7156][SQL] Addressed follow up comments for randomSplit
Burak Yavuz
2015-04-29
1
-1
/
+6
*
[SPARK-7156][SQL] support RandomSplit in DataFrames
Burak Yavuz
2015-04-29
1
-1
/
+17
*
Better error message on access to non-existing attribute
ksonj
2015-04-29
1
-1
/
+2
[next]