aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-2739][SQL] Rename registerAsTable to registerTempTableMichael Armbrust2014-08-021-4/+8
* [SPARK-2797] [SQL] SchemaRDDs don't support unpersist()Yin Huai2014-08-021-2/+2
* [SPARK-2097][SQL] UDF SupportMichael Armbrust2014-08-021-1/+38
* [SPARK-2478] [mllib] DecisionTree Python APIJoseph K. Bradley2014-08-025-18/+291
* [SPARK-2454] Do not ship spark home to WorkersAndrew Or2014-08-021-1/+1
* StatCounter on NumPy arrays [PYSPARK][SPARK-2012]Jeremy Freeman2014-08-012-8/+37
* [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in pys...Michael Giannakopoulos2014-08-011-4/+28
* [SPARK-2764] Simplify daemon.py process structureJosh Rosen2014-08-011-108/+71
* [SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDDDavies Liu2014-08-012-347/+919
* [SPARK-2786][mllib] Python correlationsDoris Xin2014-08-012-1/+109
* [SPARK-2724] Python version of RandomRDDGeneratorsDoris Xin2014-07-314-0/+197
* SPARK-2282: Reuse Socket for sending accumulator updates to PysparkAaron Davidson2014-07-311-7/+27
* [SPARK-2397][SQL] Deprecate LocalHiveContextMichael Armbrust2014-07-311-0/+6
* SPARK-2341 [MLLIB] loadLibSVMFile doesn't handle regression datasetsSean Owen2014-07-301-11/+12
* [SPARK-2024] Add saveAsSequenceFile to PySparkKan Zhang2014-07-303-28/+454
* Avoid numerical instabilityNaftali Harris2014-07-301-1/+2
* [SPARK-2179][SQL] Public API for DataTypes and SchemaYin Huai2014-07-301-14/+553
* [SPARK-2305] [PySpark] Update Py4J to version 0.8.2.1Josh Rosen2014-07-292-0/+0
* [SPARK-2674] [SQL] [PySpark] support datetime type for SchemaRDDDavies Liu2014-07-291-10/+12
* [SPARK-791] [PySpark] fix pickle itemgetter with cloudpickleDavies Liu2014-07-292-2/+9
* [SPARK-2580] [PySpark] keep silent in worker if JVM close the socketDavies Liu2014-07-292-8/+19
* [SPARK-1550] [PySpark] Allow SparkContext creation after failed attemptsJosh Rosen2014-07-272-6/+18
* [SPARK-2679] [MLLib] Ser/De for DoubleDoris Xin2014-07-271-3/+45
* [SPARK-2601] [PySpark] Fix Py4J error when transforming pickleFilesJosh Rosen2014-07-262-3/+10
* [SPARK-2652] [PySpark] Turning some default configs for PySparkDavies Liu2014-07-261-1/+12
* [SPARK-1458] [PySpark] Expose sc.version in Java and PySparkJosh Rosen2014-07-261-0/+7
* [SPARK-2656] Python version of stratified samplingDoris Xin2014-07-242-5/+50
* [SPARK-2538] [PySpark] Hash based disk spilling aggregationDavies Liu2014-07-246-23/+597
* [SPARK-2014] Make PySpark store RDDs in MEMORY_ONLY_SER with compression by d...Prashant Sharma2014-07-243-3/+9
* [SPARK-2470] PEP8 fixes to PySparkNicholas Chammas2014-07-2118-97/+127
* [SPARK-2494] [PySpark] make hash of None consistant cross machinesDavies Liu2014-07-211-3/+32
* [SPARK-2552][MLLIB] stabilize logistic function in pysparkXiangrui Meng2014-07-201-1/+4
* follow pep8 None should be compared using is or is notKen Takagiwa2014-07-154-7/+7
* Made rdd.py pep8 complaint by using Autopep8 and a little manual editing.Prashant Sharma2014-07-141-58/+92
* [Minor] Remove unused val in MasterAndrew Or2014-07-111-1/+1
* [SPARK-2376][SQL] Selecting list values inside nested JSON objects raises jav...Yin Huai2014-07-071-10/+14
* [SPARK-1394] Remove SIGCHLD handler in worker subprocessMatthew Farrellee2014-06-281-0/+1
* [SPARK-2242] HOTFIX: pyspark shell hangs on simple jobAndrew Or2014-06-251-8/+13
* [SPARK-2061] Made splits deprecated in JavaRDDLikeAnant2014-06-202-3/+3
* SPARK-1868: Users should be allowed to cogroup at least 4 RDDsAllan Douglas R. de Oliveira2014-06-202-17/+25
* SPARK-2203: PySpark defaults to use same num reduce partitions as map sideAaron Davidson2014-06-201-3/+18
* [SPARK-1466] Raise exception if pyspark Gateway process doesn't start.Kay Ousterhout2014-06-181-4/+11
* [SPARK-2060][SQL] Querying JSON Datasets with SQL and DSL in Spark SQLYin Huai2014-06-171-2/+62
* SPARK-2146. Fix takeOrdered docSandy Ryza2014-06-171-1/+1
* SPARK-1063 Add .sortBy(f) method on RDDAndrew Ash2014-06-171-0/+12
* [SPARK-2130] End-user friendly String repr for StorageLevel in PythonKan Zhang2014-06-162-0/+12
* [SPARK-2010] Support for nested data in PySpark SQLKan Zhang2014-06-161-1/+21
* [SPARK-2079] Support batching when serializing SchemaRDD to PythonKan Zhang2014-06-141-1/+3
* SPARK-1939 Refactor takeSample method in RDD to use ScaSRSDoris Xin2014-06-121-61/+106
* SPARK-554. Add aggregateByKey.Sandy Ryza2014-06-122-1/+33