aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/sql/dataframe.py
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-6876] [PySpark] [SQL] add DataFrame na.replace in pysparkDaoyuan Wang2015-05-121-0/+85
* [SPARK-7509][SQL] DataFrame.drop in Python for dropping columns.Reynold Xin2015-05-111-1/+13
* [SPARK-7324] [SQL] DataFrame.dropDuplicatesReynold Xin2015-05-111-2/+34
* [SPARK-7462] By default retain group by columns in aggregateReynold Xin2015-05-111-1/+1
* [SPARK-7133] [SQL] Implement struct, array, and map field accessorWenchen Fan2015-05-081-12/+12
* [SPARK-7295][SQL] bitwise operations for DataFrame DSLShiti2015-05-071-0/+5
* [SPARK-7294][SQL] ADD BETWEEN云峤2015-05-051-0/+7
* [SPARK-7243][SQL] Reduce size for Contingency Tables in DataFramesBurak Yavuz2015-05-051-4/+5
* [SPARK-7243][SQL] Contingency Tables for DataFramesBurak Yavuz2015-05-041-0/+25
* [SPARK-7319][SQL] Improve the output from DataFrame.show()云峤2015-05-041-36/+69
* [SPARK-7241] Pearson correlation for DataFramesBurak Yavuz2015-05-031-0/+26
* [SPARK-3444] Fix typo in Dataframes.py introduced in []Dean Chen2015-05-021-1/+1
* [SPARK-7242] added python api for freqItems in DataFramesBurak Yavuz2015-05-011-0/+25
* [SPARK-3444] Provide an easy way to change log levelHolden Karau2015-05-011-1/+1
* [SPARK-7240][SQL] Single pass covariance calculation for dataframesBurak Yavuz2015-05-011-1/+35
* [SPARK-7156][SQL] Addressed follow up comments for randomSplitBurak Yavuz2015-04-291-1/+6
* [SPARK-7156][SQL] support RandomSplit in DataFramesBurak Yavuz2015-04-291-1/+17
* Better error message on access to non-existing attributeksonj2015-04-291-1/+2
* [SPARK-7204] [SQL] Fix callSite for Dataframe and SQL operationsPatrick Wendell2015-04-291-1/+2
* [SPARK-7060][SQL] Add alias function to python dataframeYin Huai2015-04-231-0/+14
* [SPARK-7059][SQL] Create a DataFrame join API to facilitate equijoin.Reynold Xin2015-04-221-1/+8
* [SPARK-6949] [SQL] [PySpark] Support Date/Timestamp in Column expressionDavies Liu2015-04-211-14/+4
* [SPARK-6661] Python type errors should print type, not objectElisey Zanko2015-04-201-1/+1
* Minor fix to SPARK-6958: Improve Python docstring for DataFrame.sort.Reynold Xin2015-04-171-3/+4
* [SPARK-6957] [SPARK-6958] [SQL] improve API compatibility to pandasDavies Liu2015-04-171-30/+66
* [SPARK-6911] [SQL] improve accessor for nested typesDavies Liu2015-04-161-5/+44
* [SPARK-4897] [PySpark] Python 3 supportDavies Liu2015-04-161-17/+46
* [SPARK-6638] [SQL] Improve performance of StringType in SQLDavies Liu2015-04-151-5/+5
* [SPARK-6781] [SQL] use sqlContext in python shellDavies Liu2015-04-081-3/+3
* [Doc] Improve Python DataFrame documentationReynold Xin2015-03-311-124/+125
* [SPARK-6623][SQL] Alias DataFrame.na.drop and DataFrame.na.fill in Python.Reynold Xin2015-03-311-2/+39
* [SPARK-6119][SQL] DataFrame support for missing data handlingReynold Xin2015-03-301-0/+86
* [DOC] Improvements to Python docs.Reynold Xin2015-03-281-8/+1
* [SPARK-6117] [SQL] Improvements to DataFrame.describe()Reynold Xin2015-03-261-0/+19
* [SPARK-6536] [PySpark] Column.inSet() in PythonDavies Liu2015-03-261-0/+17
* [SPARK-6366][SQL] In Python API, the default save mode for save and saveAsTab...Yin Huai2015-03-181-2/+2
* [SPARK-6210] [SQL] use prettyString as column name in agg()Davies Liu2015-03-141-16/+16
* [SPARK-6194] [SPARK-677] [PySpark] fix memory leak in collect()Davies Liu2015-03-091-11/+3
* [SPARK-6055] [PySpark] fix incorrect __eq__ of DataTypeDavies Liu2015-02-271-1/+3
* [SPARK-6007][SQL] Add numRows param in DataFrame.show()Jacky Li2015-02-261-3/+3
* [SPARK-5994] [SQL] Python DataFrame documentation fixesDavies Liu2015-02-241-28/+28
* [SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.Reynold Xin2015-02-241-3/+8
* [SPARK-5904][SQL] DataFrame API fixes.Reynold Xin2015-02-191-36/+20
* [SPARK-5722] [SQL] [PySpark] infer int as LongTypeDavies Liu2015-02-181-6/+8
* [SPARK-5878] fix DataFrame.repartition() in PythonDavies Liu2015-02-181-1/+7
* [SPARK-5871] output explain in PythonDavies Liu2015-02-171-3/+20
* [SPARK-5859] [PySpark] [SQL] fix DataFrame Python APIDavies Liu2015-02-171-11/+54
* [SPARK-5799][SQL] Compute aggregation function on specified numeric columnsLiang-Chi Hsieh2015-02-161-15/+59
* [SPARK-5752][SQL] Don't implicitly convert RDDs directly to DataFramesReynold Xin2015-02-131-170/+51
* [SQL] Move SaveMode to SQL package.Yin Huai2015-02-121-1/+1