aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
...
* SPARK-4022 [CORE] [MLLIB] Replace colt dependency (LGPL) with commons-mathSean Owen2014-10-271-1/+1
* [SPARK-4088] [PySpark] Python worker should exit after socket is closed by JVMDavies Liu2014-10-251-5/+7
* [SPARK-4051] [SQL] [PySpark] Convert Row into dictionaryDavies Liu2014-10-242-0/+21
* [SPARK-2652] [PySpark] donot use KyroSerializer as default serializerDavies Liu2014-10-231-1/+0
* [SPARK-3993] [PySpark] fix bug while reuse worker after take()Davies Liu2014-10-234-4/+32
* Fix for sampling error in NumPy v1.9 [SPARK-3995][PYSPARK]freeman2014-10-222-2/+8
* SPARK-3770: Make userFeatures accessible from pythonMichelangelo D'Agostino2014-10-211-0/+31
* replace awaitTransformation with awaitTermination in scaladoc/javadocHolden Karau2014-10-211-1/+1
* [SPARK-4023] [MLlib] [PySpark] convert rdd into RDD of VectorDavies Liu2014-10-212-4/+24
* [SPARK-3207][MLLIB]Choose splits for continuous features in DecisionTree more...Qiping Li2014-10-201-2/+2
* [SPARK-3952] [Streaming] [PySpark] add Python examples in Streaming Programmi...Davies Liu2014-10-182-3/+15
* [SPARK-3855][SQL] Preserve the result attribute of python UDFs though transfo...Michael Armbrust2014-10-171-0/+6
* [SPARK-3971] [MLLib] [PySpark] hotfix: Customized pickler should work in clus...Davies Liu2014-10-1611-25/+37
* [SPARK-3943] Some scripts bin\*.cmd pollutes environment variables in WindowsMasayoshi TSUZUKI2014-10-142-239/+246
* [SPARK-3946] gitignore in /python includes wrong directoryMasayoshi TSUZUKI2014-10-141-1/+1
* [Spark] RDD take() method: overestimate too muchyingjieMiao2014-10-131-1/+4
* Add echo "Run streaming tests ..."Ken Takagiwa2014-10-121-0/+1
* [SPARK-2377] Python API for Streaminggiwa2014-10-1211-6/+1658
* [SPARK-3909][PySpark][Doc] A corrupted format in Sphinx documents and buildin...cocoatomo2014-10-114-7/+9
* [SPARK-3867][PySpark] ./python/run-tests failed when it run with Python 2.6 a...cocoatomo2014-10-112-2/+10
* [SPARK-3886] [PySpark] use AutoBatchedSerializer by defaultDavies Liu2014-10-102-6/+9
* [SPARK-3868][PySpark] Hard to recognize which module is tested from unit-test...cocoatomo2014-10-091-3/+4
* [SPARK-3713][SQL] Uses JSON to serialize DataType objectsCheng Lian2014-10-081-78/+75
* [SPARK-3412] [PySpark] Replace Epydoc with Sphinx to generate Python API docsDavies Liu2014-10-0712-193/+150
* [SPARK-3486][MLlib][PySpark] PySpark support for Word2VecLiquan Pei2014-10-073-0/+202
* [SPARK-3773][PySpark][Doc] Sphinx build warningcocoatomo2014-10-066-23/+28
* [SPARK-3786] [PySpark] speedup testsDavies Liu2014-10-064-91/+82
* [SPARK-2461] [PySpark] Add a toString method to GeneralizedLinearModelSandy Ryza2014-10-063-4/+7
* [SPARK-3749] [PySpark] fix bugs in broadcast large closure of RDDDavies Liu2014-10-013-6/+16
* [SPARK-3751] [mllib] DecisionTree: example update + print optionsJoseph K. Bradley2014-10-011-2/+8
* [SPARK-3478] [PySpark] Profile the Python tasksDavies Liu2014-09-306-7/+108
* [SPARK-3701][MLLIB] update python linalg api and small fixesXiangrui Meng2014-09-301-29/+121
* [SPARK-3681] [SQL] [PySpark] fix serialization of List and Map in SchemaRDDDavies Liu2014-09-272-27/+34
* Revert "[SPARK-3478] [PySpark] Profile the Python tasks"Josh Rosen2014-09-266-108/+7
* [SPARK-3478] [PySpark] Profile the Python tasksDavies Liu2014-09-266-7/+108
* [SPARK-546] Add full outer join to RDD and DStream.Aaron Staple2014-09-242-2/+39
* [SPARK-3679] [PySpark] pickle the exact globals of functionsDavies Liu2014-09-242-6/+54
* [SPARK-3634] [PySpark] User's module should take precedence over system modulesDavies Liu2014-09-244-8/+48
* [PySpark] remove unnecessary use of numSlices from pyspark testsMatthew Farrellee2014-09-201-2/+2
* [SPARK-3592] [SQL] [PySpark] support applySchema to RDD of RowDavies Liu2014-09-192-4/+20
* [SPARK-3491] [MLlib] [PySpark] use pickle to serialize data in MLlibDavies Liu2014-09-1916-917/+650
* [SPARK-3554] [PySpark] use broadcast automatically for large closureDavies Liu2014-09-184-3/+19
* [SPARK-3430] [PySpark] [Doc] generate PySpark API docs using SphinxDavies Liu2014-09-1613-5/+944
* [SPARK-2314][SQL] Override collect and take in python library, and count in j...Aaron Staple2014-09-161-5/+42
* [SPARK-3519] add distinct(n) to PySparkMatthew Farrellee2014-09-163-4/+24
* [SPARK-1087] Move python traceback utilities into new traceback_utils.py file.Aaron Staple2014-09-153-61/+83
* [SPARK-2951] [PySpark] support unpickle array.array for Python 2.6Davies Liu2014-09-152-2/+1
* [SPARK-3516] [mllib] DecisionTree: Add minInstancesPerNode, minInfoGain param...qiping.lqp2014-09-151-4/+12
* [SPARK-3463] [PySpark] aggregate and show spilled bytes in PythonDavies Liu2014-09-133-14/+34
* [SPARK-3030] [PySpark] Reuse Python workerDavies Liu2014-09-135-28/+70