index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
sql
/
dataframe.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-7243][SQL] Reduce size for Contingency Tables in DataFrames
Burak Yavuz
2015-05-05
1
-4
/
+5
*
[SPARK-7243][SQL] Contingency Tables for DataFrames
Burak Yavuz
2015-05-04
1
-0
/
+25
*
[SPARK-7319][SQL] Improve the output from DataFrame.show()
云峤
2015-05-04
1
-36
/
+69
*
[SPARK-7241] Pearson correlation for DataFrames
Burak Yavuz
2015-05-03
1
-0
/
+26
*
[SPARK-3444] Fix typo in Dataframes.py introduced in []
Dean Chen
2015-05-02
1
-1
/
+1
*
[SPARK-7242] added python api for freqItems in DataFrames
Burak Yavuz
2015-05-01
1
-0
/
+25
*
[SPARK-3444] Provide an easy way to change log level
Holden Karau
2015-05-01
1
-1
/
+1
*
[SPARK-7240][SQL] Single pass covariance calculation for dataframes
Burak Yavuz
2015-05-01
1
-1
/
+35
*
[SPARK-7156][SQL] Addressed follow up comments for randomSplit
Burak Yavuz
2015-04-29
1
-1
/
+6
*
[SPARK-7156][SQL] support RandomSplit in DataFrames
Burak Yavuz
2015-04-29
1
-1
/
+17
*
Better error message on access to non-existing attribute
ksonj
2015-04-29
1
-1
/
+2
*
[SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operations
Patrick Wendell
2015-04-29
1
-1
/
+2
*
[SPARK-7060][SQL] Add alias function to python dataframe
Yin Huai
2015-04-23
1
-0
/
+14
*
[SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.
Reynold Xin
2015-04-22
1
-1
/
+8
*
[SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expression
Davies Liu
2015-04-21
1
-14
/
+4
*
[SPARK-6661] Python type errors should print type, not object
Elisey Zanko
2015-04-20
1
-1
/
+1
*
Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.
Reynold Xin
2015-04-17
1
-3
/
+4
*
[SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandas
Davies Liu
2015-04-17
1
-30
/
+66
*
[SPARK-6911] [SQL] improve accessor for nested types
Davies Liu
2015-04-16
1
-5
/
+44
*
[SPARK-4897] [PySpark] Python 3 support
Davies Liu
2015-04-16
1
-17
/
+46
*
[SPARK-6638] [SQL] Improve performance of StringType in SQL
Davies Liu
2015-04-15
1
-5
/
+5
*
[SPARK-6781] [SQL] use sqlContext in python shell
Davies Liu
2015-04-08
1
-3
/
+3
*
[Doc] Improve Python DataFrame documentation
Reynold Xin
2015-03-31
1
-124
/
+125
*
[SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.
Reynold Xin
2015-03-31
1
-2
/
+39
*
[SPARK-6119][SQL] DataFrame support for missing data handling
Reynold Xin
2015-03-30
1
-0
/
+86
*
[DOC] Improvements to Python docs.
Reynold Xin
2015-03-28
1
-8
/
+1
*
[SPARK-6117] [SQL] Improvements to DataFrame.describe()
Reynold Xin
2015-03-26
1
-0
/
+19
*
[SPARK-6536] [PySpark] Column.inSet() in Python
Davies Liu
2015-03-26
1
-0
/
+17
*
[SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTab...
Yin Huai
2015-03-18
1
-2
/
+2
*
[SPARK-6210] [SQL] use prettyString as column name in agg()
Davies Liu
2015-03-14
1
-16
/
+16
*
[SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()
Davies Liu
2015-03-09
1
-11
/
+3
*
[SPARK-6055] [PySpark] fix incorrect __eq__ of DataType
Davies Liu
2015-02-27
1
-1
/
+3
*
[SPARK-6007][SQL] Add numRows param in DataFrame.show()
Jacky Li
2015-02-26
1
-3
/
+3
*
[SPARK-5994] [SQL] Python DataFrame documentation fixes
Davies Liu
2015-02-24
1
-28
/
+28
*
[SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.
Reynold Xin
2015-02-24
1
-3
/
+8
*
[SPARK-5904][SQL] DataFrame API fixes.
Reynold Xin
2015-02-19
1
-36
/
+20
*
[SPARK-5722] [SQL] [PySpark] infer int as LongType
Davies Liu
2015-02-18
1
-6
/
+8
*
[SPARK-5878] fix DataFrame.repartition() in Python
Davies Liu
2015-02-18
1
-1
/
+7
*
[SPARK-5871] output explain in Python
Davies Liu
2015-02-17
1
-3
/
+20
*
[SPARK-5859] [PySpark] [SQL] fix DataFrame Python API
Davies Liu
2015-02-17
1
-11
/
+54
*
[SPARK-5799][SQL] Compute aggregation function on specified numeric columns
Liang-Chi Hsieh
2015-02-16
1
-15
/
+59
*
[SPARK-5752][SQL] Don't implicitly convert RDDs directly to DataFrames
Reynold Xin
2015-02-13
1
-170
/
+51
*
[SQL] Move SaveMode to SQL package.
Yin Huai
2015-02-12
1
-1
/
+1
*
[SPARK-5677] [SPARK-5734] [SQL] [PySpark] Python DataFrame API remaining tasks
Davies Liu
2015-02-11
1
-3
/
+39
*
[SPARK-5658][SQL] Finalize DDL and write support APIs
Yin Huai
2015-02-10
1
-3
/
+69
*
[SQL] Add toString to DataFrame/Column
Michael Armbrust
2015-02-10
1
-1
/
+1
*
[SPARK-5469] restructure pyspark.sql into multiple files
Davies Liu
2015-02-09
1
-0
/
+974