index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
sql
/
dataframe.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
Revert "[SPARK-7157][SQL] add sampleBy to DataFrame"
Reynold Xin
2015-06-23
1
-40
/
+0
*
[SPARK-7157][SQL] add sampleBy to DataFrame
Xiangrui Meng
2015-06-23
1
-0
/
+40
*
[SPARK-6390] [SQL] [MLlib] Port MatrixUDT to PySpark
MechCoder
2015-06-17
1
-1
/
+5
*
[SPARK-7886] Add built-in expressions to FunctionRegistry.
Reynold Xin
2015-06-09
1
-1
/
+1
*
[SPARK-7990][SQL] Add methods to facilitate equi-join on multiple joining keys
Liang-Chi Hsieh
2015-06-08
1
-13
/
+32
*
[SPARK-8146] DataFrame Python API: Alias replace in df.na
Reynold Xin
2015-06-07
1
-25
/
+22
*
[SPARK-7991] [PySpark] Adding support for passing lists to describe.
amey
2015-06-05
1
-0
/
+12
*
[SPARK-7969] [SQL] Added a DataFrame.drop function that accepts a Column refe...
Mike Dusenberry
2015-06-04
1
-3
/
+18
*
[SPARK-8060] Improve DataFrame Python test coverage and documentation.
Reynold Xin
2015-06-03
1
-55
/
+27
*
[minor doc] Add exploratory data analysis warning for DataFrame.stat.freqItem...
Reynold Xin
2015-06-01
1
-0
/
+3
*
[SPARK-7840] add insertInto() to Writer
Davies Liu
2015-05-23
1
-1
/
+1
*
[SPARK-7322, SPARK-7836, SPARK-7822][SQL] DataFrame window function related u...
Davies Liu
2015-05-23
1
-0
/
+2
*
[SPARK-7783] [SQL] [PySpark] add DataFrame.rollup/cube in Python
Davies Liu
2015-05-21
1
-2
/
+46
*
[SPARK-7606] [SQL] [PySpark] add version to Python SQL API docs
Davies Liu
2015-05-20
1
-1
/
+67
*
[SPARK-7738] [SQL] [PySpark] add reader and writer API in Python
Davies Liu
2015-05-19
1
-35
/
+32
*
[SPARK-6657] [PYSPARK] Fix doc warnings
Xiangrui Meng
2015-05-18
1
-0
/
+1
*
[SPARK-7543] [SQL] [PySpark] split dataframe.py into multiple files
Davies Liu
2015-05-15
1
-446
/
+3
*
[SPARK-7548] [SQL] Add explode function for DataFrames
Michael Armbrust
2015-05-14
1
-3
/
+9
*
[SPARK-7321][SQL] Add Column expression for conditional statements (when/othe...
Reynold Xin
2015-05-12
1
-0
/
+31
*
[SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pyspark
Daoyuan Wang
2015-05-12
1
-0
/
+85
*
[SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.
Reynold Xin
2015-05-11
1
-1
/
+13
*
[SPARK-7324] [SQL] DataFrame.dropDuplicates
Reynold Xin
2015-05-11
1
-2
/
+34
*
[SPARK-7462] By default retain group by columns in aggregate
Reynold Xin
2015-05-11
1
-1
/
+1
*
[SPARK-7133] [SQL] Implement struct, array, and map field accessor
Wenchen Fan
2015-05-08
1
-12
/
+12
*
[SPARK-7295][SQL] bitwise operations for DataFrame DSL
Shiti
2015-05-07
1
-0
/
+5
*
[SPARK-7294][SQL] ADD BETWEEN
云峤
2015-05-05
1
-0
/
+7
*
[SPARK-7243][SQL] Reduce size for Contingency Tables in DataFrames
Burak Yavuz
2015-05-05
1
-4
/
+5
*
[SPARK-7243][SQL] Contingency Tables for DataFrames
Burak Yavuz
2015-05-04
1
-0
/
+25
*
[SPARK-7319][SQL] Improve the output from DataFrame.show()
云峤
2015-05-04
1
-36
/
+69
*
[SPARK-7241] Pearson correlation for DataFrames
Burak Yavuz
2015-05-03
1
-0
/
+26
*
[SPARK-3444] Fix typo in Dataframes.py introduced in []
Dean Chen
2015-05-02
1
-1
/
+1
*
[SPARK-7242] added python api for freqItems in DataFrames
Burak Yavuz
2015-05-01
1
-0
/
+25
*
[SPARK-3444] Provide an easy way to change log level
Holden Karau
2015-05-01
1
-1
/
+1
*
[SPARK-7240][SQL] Single pass covariance calculation for dataframes
Burak Yavuz
2015-05-01
1
-1
/
+35
*
[SPARK-7156][SQL] Addressed follow up comments for randomSplit
Burak Yavuz
2015-04-29
1
-1
/
+6
*
[SPARK-7156][SQL] support RandomSplit in DataFrames
Burak Yavuz
2015-04-29
1
-1
/
+17
*
Better error message on access to non-existing attribute
ksonj
2015-04-29
1
-1
/
+2
*
[SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operations
Patrick Wendell
2015-04-29
1
-1
/
+2
*
[SPARK-7060][SQL] Add alias function to python dataframe
Yin Huai
2015-04-23
1
-0
/
+14
*
[SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.
Reynold Xin
2015-04-22
1
-1
/
+8
*
[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression
Davies Liu
2015-04-21
1
-14
/
+4
*
[SPARK-6661] Python type errors should print type, not object
Elisey Zanko
2015-04-20
1
-1
/
+1
*
Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.
Reynold Xin
2015-04-17
1
-3
/
+4
*
[SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandas
Davies Liu
2015-04-17
1
-30
/
+66
*
[SPARK-6911] [SQL] improve accessor for nested types
Davies Liu
2015-04-16
1
-5
/
+44
*
[SPARK-4897] [PySpark] Python 3 support
Davies Liu
2015-04-16
1
-17
/
+46
*
[SPARK-6638] [SQL] Improve performance of StringType in SQL
Davies Liu
2015-04-15
1
-5
/
+5
*
[SPARK-6781] [SQL] use sqlContext in python shell
Davies Liu
2015-04-08
1
-3
/
+3
*
[Doc] Improve Python DataFrame documentation
Reynold Xin
2015-03-31
1
-124
/
+125
*
[SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.
Reynold Xin
2015-03-31
1
-2
/
+39
[next]