spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-5469] restructure pyspark.sql into multiple files	Davies Liu	2015-02-09	1	-2736/+0
*	[SPARK-5678] Convert DataFrame to pandas.DataFrame and Series	Davies Liu	2015-02-09	1	-0/+25
*	[SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source...	Cheng Lian	2015-02-05	1	-2/+7
*	[SQL][DataFrame] Minor cleanup.	Reynold Xin	2015-02-04	1	-11/+0
*	[SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggrega...	Reynold Xin	2015-02-04	1	-5/+8
*	[SPARK-5577] Python udf for DataFrame	Davies Liu	2015-02-04	1	-104/+91
*	[SPARK-5588] [SQL] support select/filter by SQL expression	Davies Liu	2015-02-04	1	-10/+43
*	[SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions	Reynold Xin	2015-02-03	1	-3/+2
*	[SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API	Davies Liu	2015-02-03	1	-186/+281
*	[SQL] Improve DataFrame API error reporting	Reynold Xin	2015-02-02	1	-23/+52
*	[SPARK-5464] Fix help() for Python DataFrame instances	Josh Rosen	2015-01-29	1	-3/+3
*	[SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.	Reynold Xin	2015-01-29	1	-2/+2
*	[SPARK-5445][SQL] Made DataFrame dsl usable in Java	Reynold Xin	2015-01-28	1	-16/+22
*	[SPARK-4586][MLLIB] Python API for ML pipeline and parameters	Xiangrui Meng	2015-01-28	1	-14/+0
*	[SPARK-5097][SQL] DataFrame	Reynold Xin	2015-01-27	1	-263/+704
*	[SPARK-5193][SQL] Remove Spark SQL Java-specific API.	Reynold Xin	2015-01-16	1	-36/+12
*	[SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.	Reynold Xin	2015-01-15	1	-8/+8
*	[SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple	Gabe Mulley	2015-01-12	1	-4/+14
*	[SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...	Brennon York	2014-12-27	1	-1/+1
*	[SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`	jbencook	2014-12-23	1	-0/+28
*	[SPARK-4822] Use sphinx tags for Python doc annotations	lewuathe	2014-12-17	1	-1/+1
*	[SPARK-4866] support StructType as key in MapType	Davies Liu	2014-12-16	1	-7/+10
*	[SPARK-4578] fix asDict() with nested Row()	Davies Liu	2014-11-24	1	-1/+1
*	[SPARK-4228][SQL] SchemaRDD to JSON	Dan McClary	2014-11-20	1	-1/+16
*	[SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by defa...	Davies Liu	2014-11-03	1	-11/+7
*	[SPARK-4192][SQL] Internal API for Python UDT	Xiangrui Meng	2014-11-03	1	-2/+204
*	[SPARK-3594] [PySpark] [SQL] take more rows to infer schema or sampling	Davies Liu	2014-11-03	1	-68/+128
*	[SPARK-3930] [SPARK-3933] Support fixed-precision decimal in SQL, and some op...	Matei Zaharia	2014-11-01	1	-3/+32
*	[SPARK-3569][SQL] Add metadata field to StructField	Xiangrui Meng	2014-11-01	1	-4/+11
*	[SPARK-3826][SQL]enable hive-thriftserver to support hive-0.13.1	wangfei	2014-10-31	1	-27/+0
*	[SPARK-3988][SQL] add public API for date type	Daoyuan Wang	2014-10-28	1	-18/+39
*	[SPARK-4051] [SQL] [PySpark] Convert Row into dictionary	Davies Liu	2014-10-24	1	-0/+12
*	[SPARK-3909][PySpark][Doc] A corrupted format in Sphinx documents and buildin...	cocoatomo	2014-10-11	1	-5/+5
*	[SPARK-3713][SQL] Uses JSON to serialize DataType objects	Cheng Lian	2014-10-08	1	-78/+75
*	[SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docs	Davies Liu	2014-10-07	1	-12/+21
*	[SPARK-2461] [PySpark] Add a toString method to GeneralizedLinearModel	Sandy Ryza	2014-10-06	1	-1/+1
*	[SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDD	Davies Liu	2014-10-01	1	-1/+1
*	[SPARK-3478] [PySpark] Profile the Python tasks	Davies Liu	2014-09-30	1	-1/+1
*	[SPARK-3681] [SQL] [PySpark] fix serialization of List and Map in SchemaRDD	Davies Liu	2014-09-27	1	-27/+13
*	Revert "[SPARK-3478] [PySpark] Profile the Python tasks"	Josh Rosen	2014-09-26	1	-1/+1
*	[SPARK-3478] [PySpark] Profile the Python tasks	Davies Liu	2014-09-26	1	-1/+1
*	[SPARK-3592] [SQL] [PySpark] support applySchema to RDD of Row	Davies Liu	2014-09-19	1	-3/+10
*	[SPARK-3554] [PySpark] use broadcast automatically for large closure	Davies Liu	2014-09-18	1	-2/+6
*	[SPARK-3430] [PySpark] [Doc] generate PySpark API docs using Sphinx	Davies Liu	2014-09-16	1	-4/+8
*	[SPARK-2314][SQL] Override collect and take in python library, and count in j...	Aaron Staple	2014-09-16	1	-5/+42
*	[SPARK-3519] add distinct(n) to PySpark	Matthew Farrellee	2014-09-16	1	-2/+5
*	[SPARK-3500] [SQL] use JavaSchemaRDD as SchemaRDD._jschema_rdd	Davies Liu	2014-09-12	1	-20/+18
*	[SPARK-3417] Use new-style classes in PySpark	Matthew Rocklin	2014-09-08	1	-1/+1
*	[SPARK-2334] fix AttributeError when call PipelineRDD.id()	Davies Liu	2014-09-06	1	-4/+5
*	Spark-3406 add a default storage level to python RDD persist API	Holden Karau	2014-09-06	1	-1/+2