spark - Mirror of Apache Spark

	Commit message (Expand)	Author	Age	Files	Lines
*	[SPARK-4586][MLLIB] Python API for ML pipeline and parameters	Xiangrui Meng	2015-01-28	16	-16/+1124
*	[SPARK-4387][PySpark] Refactoring python profiling code to make it extensible	Yandu Oppacher	2015-01-28	8	-71/+232
*	[SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd	Michael Nazario	2015-01-28	1	-0/+14
*	SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs	Sandy Ryza	2015-01-28	1	-2/+2
*	[SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly	Winston Chen	2015-01-28	1	-0/+19
*	[SPARK-5097][SQL] DataFrame	Reynold Xin	2015-01-27	3	-336/+793
*	[SPARK-5063] More helpful error messages for several invalid operations	Josh Rosen	2015-01-23	2	-0/+19
*	[SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed	nate.crosswhite	2015-01-21	2	-3/+18
*	SPARK-5270 [CORE] Provide isEmpty() function in RDD API	Sean Owen	2015-01-19	1	-0/+12
*	[SPARK-5193][SQL] Remove Spark SQL Java-specific API.	Reynold Xin	2015-01-16	1	-36/+12
*	[SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.	Reynold Xin	2015-01-15	1	-8/+8
*	[SPARK-5224] [PySpark] improve performance of parallelize list/ndarray	Davies Liu	2015-01-15	2	-1/+5
*	[SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing	MechCoder	2015-01-14	2	-0/+29
*	[SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib	Davies Liu	2015-01-13	1	-4/+2
*	[SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple	Gabe Mulley	2015-01-12	1	-4/+14
*	[SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...	RJ Nowling	2015-01-08	1	-0/+187
*	[SPARK-5089][PYSPARK][MLLIB] Fix vector convert	freeman	2015-01-05	2	-1/+11
*	[SPARK-3325][Streaming] Add a parameter to the method print in class DStream	Yadong Qi	2015-01-02	1	-5/+7
*	[SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/sc...	Brennon York	2014-12-27	1	-1/+1
*	[SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`	jbencook	2014-12-23	1	-0/+28
*	[SPARK-4822] Use sphinx tags for Python doc annotations	lewuathe	2014-12-17	6	-17/+17
*	[SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc	Joseph K. Bradley	2014-12-17	3	-30/+5
*	[SPARK-4866] support StructType as key in MapType	Davies Liu	2014-12-16	2	-7/+18
*	[SPARK-4855][mllib] testing the Chi-squared hypothesis test	jbencook	2014-12-16	1	-1/+99
*	[SPARK-4841] fix zip with textFile()	Davies Liu	2014-12-15	3	-14/+26
*	[SPARK-4494][mllib] IDFModel.transform() add support for single vector	Yuu ISHIKAWA	2014-12-15	1	-7/+15
*	[SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + D...	Joseph K. Bradley	2014-12-04	1	-3/+3
*	[SPARK-4548] []SPARK-4517] improve performance of python broadcast	Davies Liu	2014-11-24	5	-233/+80
*	[SPARK-4578] fix asDict() with nested Row()	Davies Liu	2014-11-24	2	-4/+5
*	[SPARK-4562] [MLlib] speedup vector	Davies Liu	2014-11-24	2	-26/+53
*	[SPARK-4531] [MLlib] cache serialized java object	Davies Liu	2014-11-21	4	-13/+8
*	[SPARK-4477] [PySpark] remove numpy from RDDSampler	Davies Liu	2014-11-20	2	-69/+40
*	[SPARK-4439] [MLlib] add python api for random forest	Davies Liu	2014-11-20	2	-23/+221
*	[SPARK-4228][SQL] SchemaRDD to JSON	Dan McClary	2014-11-20	1	-1/+16
*	[SPARK-4384] [PySpark] improve sort spilling	Davies Liu	2014-11-19	1	-1/+10
*	[DOC][PySpark][Streaming] Fix docstring for sphinx	Ken Takagiwa	2014-11-19	1	-2/+2
*	[SPARK-4327] [PySpark] Python API for RDD.randomSplit()	Davies Liu	2014-11-18	2	-3/+41
*	[SPARK-3721] [PySpark] broadcast objects larger than 2G	Davies Liu	2014-11-18	6	-17/+239
*	[SPARK-4306] [MLlib] Python API for LogisticRegressionWithLBFGS	Davies Liu	2014-11-18	1	-4/+53
*	[SPARK-4396] allow lookup by index in Python's Rating	Xiangrui Meng	2014-11-18	1	-11/+15
*	[SPARK-4435] [MLlib] [PySpark] improve classification	Davies Liu	2014-11-18	1	-29/+106
*	[SPARK-4415] [PySpark] JVM should exit after Python exit	Davies Liu	2014-11-14	1	-1/+3
*	[SPARK-4398][PySpark] specialize sc.parallelize(xrange)	Xiangrui Meng	2014-11-14	1	-4/+21
*	[SPARK-4372][MLLIB] Make LR and SVM's default parameters consistent in Scala ...	Xiangrui Meng	2014-11-13	2	-35/+37
*	[SPARK-4348] [PySpark] [MLlib] rename random.py to rand.py	Davies Liu	2014-11-13	6	-20/+38
*	[SPARK-4369] [MLLib] fix TreeModel.predict() with RDD	Davies Liu	2014-11-12	1	-12/+14
*	[SPARK-4324] [PySpark] [MLlib] support numpy.array for all MLlib API	Davies Liu	2014-11-10	7	-32/+105
*	[MLLIB] [PYTHON] SPARK-4221: Expose nonnegative ALS in the python API	Michelangelo D'Agostino	2014-11-07	1	-15/+25
*	[SPARK-4304] [PySpark] Fix sort on empty RDD	Davies Liu	2014-11-07	2	-0/+5
*	[SPARK-4186] add binaryFiles and binaryRecords in Python	Davies Liu	2014-11-06	2	-1/+50