aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/sql.py
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-5469] restructure pyspark.sql into multiple filesDavies Liu2015-02-091-2736/+0
* [SPARK-5678] Convert DataFrame to pandas.DataFrame and SeriesDavies Liu2015-02-091-0/+25
* [SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source...Cheng Lian2015-02-051-2/+7
* [SQL][DataFrame] Minor cleanup.Reynold Xin2015-02-041-11/+0
* [SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggrega...Reynold Xin2015-02-041-5/+8
* [SPARK-5577] Python udf for DataFrameDavies Liu2015-02-041-104/+91
* [SPARK-5588] [SQL] support select/filter by SQL expressionDavies Liu2015-02-041-10/+43
* [SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressionsReynold Xin2015-02-031-3/+2
* [SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python APIDavies Liu2015-02-031-186/+281
* [SQL] Improve DataFrame API error reportingReynold Xin2015-02-021-23/+52
* [SPARK-5464] Fix help() for Python DataFrame instancesJosh Rosen2015-01-291-3/+3
* [SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.Reynold Xin2015-01-291-2/+2
* [SPARK-5445][SQL] Made DataFrame dsl usable in JavaReynold Xin2015-01-281-16/+22
* [SPARK-4586][MLLIB] Python API for ML pipeline and parametersXiangrui Meng2015-01-281-14/+0
* [SPARK-5097][SQL] DataFrameReynold Xin2015-01-271-263/+704
* [SPARK-5193][SQL] Remove Spark SQL Java-specific API.Reynold Xin2015-01-161-36/+12
* [SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.Reynold Xin2015-01-151-8/+8
* [SPARK-5138][SQL] Ensure schema can be inferred from a namedtupleGabe Mulley2015-01-121-4/+14
* [SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...Brennon York2014-12-271-1/+1
* [SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`jbencook2014-12-231-0/+28
* [SPARK-4822] Use sphinx tags for Python doc annotationslewuathe2014-12-171-1/+1
* [SPARK-4866] support StructType as key in MapTypeDavies Liu2014-12-161-7/+10
* [SPARK-4578] fix asDict() with nested Row()Davies Liu2014-11-241-1/+1
* [SPARK-4228][SQL] SchemaRDD to JSONDan McClary2014-11-201-1/+16
* [SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...Davies Liu2014-11-031-11/+7
* [SPARK-4192][SQL] Internal API for Python UDTXiangrui Meng2014-11-031-2/+204
* [SPARK-3594] [PySpark] [SQL] take more rows to infer schema or samplingDavies Liu2014-11-031-68/+128
* [SPARK-3930] [SPARK-3933] Support fixed-precision decimal in SQL, and some op...Matei Zaharia2014-11-011-3/+32
* [SPARK-3569][SQL] Add metadata field to StructFieldXiangrui Meng2014-11-011-4/+11
* [SPARK-3826][SQL]enable hive-thriftserver to support hive-0.13.1wangfei2014-10-311-27/+0
* [SPARK-3988][SQL] add public API for date typeDaoyuan Wang2014-10-281-18/+39
* [SPARK-4051] [SQL] [PySpark] Convert Row into dictionaryDavies Liu2014-10-241-0/+12
* [SPARK-3909][PySpark][Doc] A corrupted format in Sphinx documents and buildin...cocoatomo2014-10-111-5/+5
* [SPARK-3713][SQL] Uses JSON to serialize DataType objectsCheng Lian2014-10-081-78/+75
* [SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docsDavies Liu2014-10-071-12/+21
* [SPARK-2461] [PySpark] Add a toString method to GeneralizedLinearModelSandy Ryza2014-10-061-1/+1
* [SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDDDavies Liu2014-10-011-1/+1
* [SPARK-3478] [PySpark] Profile the Python tasksDavies Liu2014-09-301-1/+1
* [SPARK-3681] [SQL] [PySpark] fix serialization of List and Map in SchemaRDDDavies Liu2014-09-271-27/+13
* Revert "[SPARK-3478] [PySpark] Profile the Python tasks"Josh Rosen2014-09-261-1/+1
* [SPARK-3478] [PySpark] Profile the Python tasksDavies Liu2014-09-261-1/+1
* [SPARK-3592] [SQL] [PySpark] support applySchema to RDD of RowDavies Liu2014-09-191-3/+10
* [SPARK-3554] [PySpark] use broadcast automatically for large closureDavies Liu2014-09-181-2/+6
* [SPARK-3430] [PySpark] [Doc] generate PySpark API docs using SphinxDavies Liu2014-09-161-4/+8
* [SPARK-2314][SQL] Override collect and take in python library, and count in j...Aaron Staple2014-09-161-5/+42
* [SPARK-3519] add distinct(n) to PySparkMatthew Farrellee2014-09-161-2/+5
* [SPARK-3500] [SQL] use JavaSchemaRDD as SchemaRDD._jschema_rddDavies Liu2014-09-121-20/+18
* [SPARK-3417] Use new-style classes in PySparkMatthew Rocklin2014-09-081-1/+1
* [SPARK-2334] fix AttributeError when call PipelineRDD.id()Davies Liu2014-09-061-4/+5
* Spark-3406 add a default storage level to python RDD persist APIHolden Karau2014-09-061-1/+2