aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/sql
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.Reynold Xin2015-05-111-1/+13
* [SPARK-7324] [SQL] DataFrame.dropDuplicatesReynold Xin2015-05-111-2/+34
* [SPARK-7462][SQL] Update documentation for retaining grouping columns in Data...Reynold Xin2015-05-111-0/+2
* [SPARK-7462] By default retain group by columns in aggregateReynold Xin2015-05-111-1/+1
* [SPARK-7133] [SQL] Implement struct, array, and map field accessorWenchen Fan2015-05-082-12/+19
* [SPARK-7118] [Python] Add the coalesce Spark SQL function available in PySparkOlivier Girardot2015-05-071-0/+37
* [SPARK-7295][SQL] bitwise operations for DataFrame DSLShiti2015-05-073-0/+20
* [SPARK-7358][SQL] Move DataFrame mathfunctions into functionsBurak Yavuz2015-05-053-102/+53
* [SPARK-7294][SQL] ADD BETWEEN云峤2015-05-052-0/+15
* [SPARK-7333] [MLLIB] Add BinaryClassificationEvaluator to PySparkXiangrui Meng2015-05-051-1/+1
* [SPARK-7243][SQL] Reduce size for Contingency Tables in DataFramesBurak Yavuz2015-05-051-4/+5
* [SPARK-7243][SQL] Contingency Tables for DataFramesBurak Yavuz2015-05-042-0/+34
* [SPARK-7319][SQL] Improve the output from DataFrame.show()云峤2015-05-041-36/+69
* [SPARK-7241] Pearson correlation for DataFramesBurak Yavuz2015-05-032-0/+32
* [SPARK-3444] Fix typo in Dataframes.py introduced in []Dean Chen2015-05-021-1/+1
* [SPARK-7242] added python api for freqItems in DataFramesBurak Yavuz2015-05-012-0/+32
* [SPARK-3444] Provide an easy way to change log levelHolden Karau2015-05-011-1/+1
* [SPARK-7240][SQL] Single pass covariance calculation for dataframesBurak Yavuz2015-05-013-2/+43
* [SPARK-7274] [SQL] Create Column expression for array/struct creation.Reynold Xin2015-05-011-19/+61
* [SPARK-7248] implemented random number generators for DataFramesBurak Yavuz2015-04-302-1/+34
* [SPARK-7156][SQL] Addressed follow up comments for randomSplitBurak Yavuz2015-04-291-1/+6
* [SPARK-7156][SQL] support RandomSplit in DataFramesBurak Yavuz2015-04-291-1/+17
* Better error message on access to non-existing attributeksonj2015-04-291-1/+2
* [SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operationsPatrick Wendell2015-04-291-1/+2
* [SPARK-7188] added python support for math DataFrame functionsBurak Yavuz2015-04-293-1/+131
* [SPARK-7135][SQL] DataFrame expression for monotonically increasing IDs.Reynold Xin2015-04-281-1/+21
* [SPARK-7152][SQL] Add a Column expression for partition ID.Reynold Xin2015-04-261-9/+21
* [SPARK-7060][SQL] Add alias function to python dataframeYin Huai2015-04-231-0/+14
* [SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.Reynold Xin2015-04-221-1/+8
* [SPARK-6953] [PySpark] speed up python testsReynold Xin2015-04-211-2/+2
* [SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expressionDavies Liu2015-04-214-23/+46
* [SPARK-6661] Python type errors should print type, not objectElisey Zanko2015-04-203-11/+11
* Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.Reynold Xin2015-04-171-3/+4
* [SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandasDavies Liu2015-04-173-39/+70
* [SPARK-6911] [SQL] improve accessor for nested typesDavies Liu2015-04-162-5/+62
* [SPARK-4897] [PySpark] Python 3 supportDavies Liu2015-04-166-56/+120
* [SPARK-6638] [SQL] Improve performance of StringType in SQLDavies Liu2015-04-151-5/+5
* [SPARK-6677] [SQL] [PySpark] fix cached classesDavies Liu2015-04-111-19/+20
* [SPARK-6696] [SQL] Adds HiveContext.refreshTable to PySparkCheng Lian2015-04-081-0/+9
* [SPARK-6781] [SQL] use sqlContext in python shellDavies Liu2015-04-084-46/+45
* [SPARK-6553] [pyspark] Support functools.partial as UDFksonj2015-04-012-1/+33
* [Doc] Improve Python DataFrame documentationReynold Xin2015-03-315-390/+250
* [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.Reynold Xin2015-03-312-6/+45
* [SPARK-6119][SQL] DataFrame support for missing data handlingReynold Xin2015-03-302-0/+182
* [SPARK-6603] [PySpark] [SQL] add SQLContext.udf and deprecate inferSchema() a...Davies Liu2015-03-301-27/+60
* [DOC] Improvements to Python docs.Reynold Xin2015-03-282-14/+9
* [SPARK-6117] [SQL] Improvements to DataFrame.describe()Reynold Xin2015-03-261-0/+19
* [SPARK-6536] [PySpark] Column.inSet() in PythonDavies Liu2015-03-261-0/+17
* [SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTab...Yin Huai2015-03-181-2/+2
* [SPARK-6210] [SQL] use prettyString as column name in agg()Davies Liu2015-03-141-16/+16