spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
...
*	[SPARK-5469] restructure pyspark.sql into multiple files	Davies Liu	2015-02-09	11	-2755/+2961
*	[SPARK-5678] Convert DataFrame to pandas.DataFrame and Series	Davies Liu	2015-02-09	1	-0/+25
*	SPARK-5633 pyspark saveAsTextFile support for compression codec	Vladimir Vladimirov	2015-02-06	1	-2/+20
*	[SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source...	Cheng Lian	2015-02-05	1	-2/+7
*	[SQL][DataFrame] Minor cleanup.	Reynold Xin	2015-02-04	1	-11/+0
*	[SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggrega...	Reynold Xin	2015-02-04	1	-5/+8
*	[SPARK-5577] Python udf for DataFrame	Davies Liu	2015-02-04	2	-120/+113
*	[SPARK-5588] [SQL] support select/filter by SQL expression	Davies Liu	2015-02-04	1	-10/+43
*	[SPARK-5585] Flaky test in MLlib python	Davies Liu	2015-02-04	1	-1/+1
*	[SPARK-5379][Streaming] Add awaitTerminationOrTimeout	zsxwing	2015-02-04	1	-0/+9
*	[SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streaming	freeman	2015-02-03	2	-1/+30
*	[SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions	Reynold Xin	2015-02-03	1	-3/+2
*	[SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API	Davies Liu	2015-02-03	4	-447/+581
*	[SPARK-5536] replace old ALS implementation by the new one	Xiangrui Meng	2015-02-02	1	-8/+8
*	[SPARK-5012][MLLib][PySpark]Python API for Gaussian Mixture Model	FlytxtRnD	2015-02-02	4	-5/+147
*	[SPARK-5154] [PySpark] [Streaming] Kafka streaming support in Python	Davies Liu	2015-02-02	3	-2/+100
*	[SQL] Improve DataFrame API error reporting	Reynold Xin	2015-02-02	2	-25/+56
*	Make sure only owner can read / write to directories created for the job.	Marcelo Vanzin	2015-02-02	1	-1/+2
*	[SPARK-5094][MLlib] Add Python API for Gradient Boosted Trees	Kazuki Taniguchi	2015-01-30	2	-53/+209
*	[SPARK-5464] Fix help() for Python DataFrame instances	Josh Rosen	2015-01-29	2	-3/+13
*	[SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.	Reynold Xin	2015-01-29	1	-2/+2
*	[SPARK-5477] refactor stat.py	Xiangrui Meng	2015-01-29	4	-54/+96
*	[SPARK-5445][SQL] Made DataFrame dsl usable in Java	Reynold Xin	2015-01-28	1	-16/+22
*	[SPARK-5430] move treeReduce and treeAggregate from mllib to core	Xiangrui Meng	2015-01-28	1	-1/+90
*	[SPARK-4586][MLLIB] Python API for ML pipeline and parameters	Xiangrui Meng	2015-01-28	16	-16/+1124
*	[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible	Yandu Oppacher	2015-01-28	8	-71/+232
*	[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd	Michael Nazario	2015-01-28	1	-0/+14
*	SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs	Sandy Ryza	2015-01-28	1	-2/+2
*	[SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly	Winston Chen	2015-01-28	1	-0/+19
*	[SPARK-5097][SQL] DataFrame	Reynold Xin	2015-01-27	3	-336/+793
*	[SPARK-5063] More helpful error messages for several invalid operations	Josh Rosen	2015-01-23	2	-0/+19
*	[SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed	nate.crosswhite	2015-01-21	2	-3/+18
*	SPARK-5270 [CORE] Provide isEmpty() function in RDD API	Sean Owen	2015-01-19	1	-0/+12
*	[SPARK-5193][SQL] Remove Spark SQL Java-specific API.	Reynold Xin	2015-01-16	1	-36/+12
*	[SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.	Reynold Xin	2015-01-15	1	-8/+8
*	[SPARK-5224] [PySpark] improve performance of parallelize list/ndarray	Davies Liu	2015-01-15	2	-1/+5
*	[SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing	MechCoder	2015-01-14	2	-0/+29
*	[SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib	Davies Liu	2015-01-13	1	-4/+2
*	[SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple	Gabe Mulley	2015-01-12	1	-4/+14
*	[SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...	RJ Nowling	2015-01-08	1	-0/+187
*	[SPARK-5089][PYSPARK][MLLIB] Fix vector convert	freeman	2015-01-05	2	-1/+11
*	[SPARK-3325][Streaming] Add a parameter to the method print in class DStream	Yadong Qi	2015-01-02	1	-5/+7
*	[SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...	Brennon York	2014-12-27	1	-1/+1
*	[SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`	jbencook	2014-12-23	1	-0/+28
*	[SPARK-4822] Use sphinx tags for Python doc annotations	lewuathe	2014-12-17	6	-17/+17
*	[SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc	Joseph K. Bradley	2014-12-17	3	-30/+5
*	[SPARK-4866] support StructType as key in MapType	Davies Liu	2014-12-16	2	-7/+18
*	[SPARK-4855][mllib] testing the Chi-squared hypothesis test	jbencook	2014-12-16	1	-1/+99
*	[SPARK-4841] fix zip with textFile()	Davies Liu	2014-12-15	3	-14/+26
*	[SPARK-4494][mllib] IDFModel.transform() add support for single vector	Yuu ISHIKAWA	2014-12-15	1	-7/+15