index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
/
pyspark
/
sql.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
[SPARK-5469] restructure pyspark.sql into multiple files
Davies Liu
2015-02-09
1
-2736
/
+0
*
[SPARK-5678] Convert DataFrame to pandas.DataFrame and Series
Davies Liu
2015-02-09
1
-0
/
+25
*
[SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source...
Cheng Lian
2015-02-05
1
-2
/
+7
*
[SQL][DataFrame] Minor cleanup.
Reynold Xin
2015-02-04
1
-11
/
+0
*
[SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggrega...
Reynold Xin
2015-02-04
1
-5
/
+8
*
[SPARK-5577] Python udf for DataFrame
Davies Liu
2015-02-04
1
-104
/
+91
*
[SPARK-5588] [SQL] support select/filter by SQL expression
Davies Liu
2015-02-04
1
-10
/
+43
*
[SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions
Reynold Xin
2015-02-03
1
-3
/
+2
*
[SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
Davies Liu
2015-02-03
1
-186
/
+281
*
[SQL] Improve DataFrame API error reporting
Reynold Xin
2015-02-02
1
-23
/
+52
*
[SPARK-5464] Fix help() for Python DataFrame instances
Josh Rosen
2015-01-29
1
-3
/
+3
*
[SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.
Reynold Xin
2015-01-29
1
-2
/
+2
*
[SPARK-5445][SQL] Made DataFrame dsl usable in Java
Reynold Xin
2015-01-28
1
-16
/
+22
*
[SPARK-4586][MLLIB] Python API for ML pipeline and parameters
Xiangrui Meng
2015-01-28
1
-14
/
+0
*
[SPARK-5097][SQL] DataFrame
Reynold Xin
2015-01-27
1
-263
/
+704
*
[SPARK-5193][SQL] Remove Spark SQL Java-specific API.
Reynold Xin
2015-01-16
1
-36
/
+12
*
[SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.
Reynold Xin
2015-01-15
1
-8
/
+8
*
[SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple
Gabe Mulley
2015-01-12
1
-4
/
+14
*
[SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...
Brennon York
2014-12-27
1
-1
/
+1
*
[SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`
jbencook
2014-12-23
1
-0
/
+28
*
[SPARK-4822] Use sphinx tags for Python doc annotations
lewuathe
2014-12-17
1
-1
/
+1
*
[SPARK-4866] support StructType as key in MapType
Davies Liu
2014-12-16
1
-7
/
+10
*
[SPARK-4578] fix asDict() with nested Row()
Davies Liu
2014-11-24
1
-1
/
+1
*
[SPARK-4228][SQL] SchemaRDD to JSON
Dan McClary
2014-11-20
1
-1
/
+16
*
[SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...
Davies Liu
2014-11-03
1
-11
/
+7
*
[SPARK-4192][SQL] Internal API for Python UDT
Xiangrui Meng
2014-11-03
1
-2
/
+204
*
[SPARK-3594] [PySpark] [SQL] take more rows to infer schema or sampling
Davies Liu
2014-11-03
1
-68
/
+128
*
[SPARK-3930] [SPARK-3933] Support fixed-precision decimal in SQL, and some op...
Matei Zaharia
2014-11-01
1
-3
/
+32
*
[SPARK-3569][SQL] Add metadata field to StructField
Xiangrui Meng
2014-11-01
1
-4
/
+11
*
[SPARK-3826][SQL]enable hive-thriftserver to support hive-0.13.1
wangfei
2014-10-31
1
-27
/
+0
*
[SPARK-3988][SQL] add public API for date type
Daoyuan Wang
2014-10-28
1
-18
/
+39
*
[SPARK-4051] [SQL] [PySpark] Convert Row into dictionary
Davies Liu
2014-10-24
1
-0
/
+12
*
[SPARK-3909][PySpark][Doc] A corrupted format in Sphinx documents and buildin...
cocoatomo
2014-10-11
1
-5
/
+5
*
[SPARK-3713][SQL] Uses JSON to serialize DataType objects
Cheng Lian
2014-10-08
1
-78
/
+75
*
[SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs
Davies Liu
2014-10-07
1
-12
/
+21
*
[SPARK-2461] [PySpark] Add a toString method to GeneralizedLinearModel
Sandy Ryza
2014-10-06
1
-1
/
+1
*
[SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDD
Davies Liu
2014-10-01
1
-1
/
+1
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-30
1
-1
/
+1
*
[SPARK-3681] [SQL] [PySpark] fix serialization of List and Map in SchemaRDD
Davies Liu
2014-09-27
1
-27
/
+13
*
Revert "[SPARK-3478] [PySpark] Profile the Python tasks"
Josh Rosen
2014-09-26
1
-1
/
+1
*
[SPARK-3478] [PySpark] Profile the Python tasks
Davies Liu
2014-09-26
1
-1
/
+1
*
[SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row
Davies Liu
2014-09-19
1
-3
/
+10
*
[SPARK-3554] [PySpark] use broadcast automatically for large closure
Davies Liu
2014-09-18
1
-2
/
+6
*
[SPARK-3430] [PySpark] [Doc] generate PySpark API docs using Sphinx
Davies Liu
2014-09-16
1
-4
/
+8
*
[SPARK-2314][SQL] Override collect and take in python library, and count in j...
Aaron Staple
2014-09-16
1
-5
/
+42
*
[SPARK-3519] add distinct(n) to PySpark
Matthew Farrellee
2014-09-16
1
-2
/
+5
*
[SPARK-3500] [SQL] use JavaSchemaRDD as SchemaRDD._jschema_rdd
Davies Liu
2014-09-12
1
-20
/
+18
*
[SPARK-3417] Use new-style classes in PySpark
Matthew Rocklin
2014-09-08
1
-1
/
+1
*
[SPARK-2334] fix AttributeError when call PipelineRDD.id()
Davies Liu
2014-09-06
1
-4
/
+5
*
Spark-3406 add a default storage level to python RDD persist API
Holden Karau
2014-09-06
1
-1
/
+2
[next]