aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark/rdd.py
Commit message (Expand)AuthorAgeFilesLines
...
* [SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDDDavies Liu2014-08-011-4/+4
* [SPARK-2024] Add saveAsSequenceFile to PySparkKan Zhang2014-07-301-0/+114
* [SPARK-2601] [PySpark] Fix Py4J error when transforming pickleFilesJosh Rosen2014-07-261-3/+1
* [SPARK-2656] Python version of stratified samplingDoris Xin2014-07-241-2/+23
* [SPARK-2538] [PySpark] Hash based disk spilling aggregationDavies Liu2014-07-241-21/+71
* [SPARK-2014] Make PySpark store RDDs in MEMORY_ONLY_SER with compression by d...Prashant Sharma2014-07-241-2/+2
* [SPARK-2494] [PySpark] make hash of None consistant cross machinesDavies Liu2014-07-211-3/+32
* Made rdd.py pep8 complaint by using Autopep8 and a little manual editing.Prashant Sharma2014-07-141-58/+92
* [SPARK-2061] Made splits deprecated in JavaRDDLikeAnant2014-06-201-2/+2
* SPARK-1868: Users should be allowed to cogroup at least 4 RDDsAllan Douglas R. de Oliveira2014-06-201-7/+15
* SPARK-2203: PySpark defaults to use same num reduce partitions as map sideAaron Davidson2014-06-201-3/+18
* SPARK-2146. Fix takeOrdered docSandy Ryza2014-06-171-1/+1
* SPARK-1063 Add .sortBy(f) method on RDDAndrew Ash2014-06-171-0/+12
* [SPARK-2130] End-user friendly String repr for StorageLevel in PythonKan Zhang2014-06-161-0/+3
* SPARK-1939 Refactor takeSample method in RDD to use ScaSRSDoris Xin2014-06-121-61/+106
* SPARK-554. Add aggregateByKey.Sandy Ryza2014-06-121-1/+18
* fixed typo in docstring for min()Jeff Thompson2014-06-121-1/+1
* [SPARK-1308] Add getNumPartitions to pyspark RDDSyed Hashmi2014-06-091-18/+27
* [SPARK-1161] Add saveAsPickleFile and SparkContext.pickleFile in PythonKan Zhang2014-06-031-8/+25
* [SPARK-1468] Modify the partition function used by partitionBy.Erik Selin2014-06-031-1/+4
* SPARK-1839: PySpark RDD#take() shouldn't always read from driverAaron Davidson2014-05-311-21/+38
* Documentation: Encourage use of reduceByKey instead of groupByKey.Patrick Wendell2014-05-141-0/+4
* [SPARK-1690] Tolerating empty elements when saving Python RDD to text filesKan Zhang2014-05-101-0/+8
* [SPARK-1674] fix interrupted system call error in pyspark's RDD.pipeXiangrui Meng2014-04-291-3/+3
* SPARK-1242 Add aggregate to python rddHolden Karau2014-04-241-2/+29
* SPARK-1438 RDD.sample() make seed param optionalArun Ramakrishnan2014-04-241-7/+6
* Spark 1271: Co-Group and Group-By should pass Iterable[X]Holden Karau2014-04-081-5/+5
* SPARK-1305: Support persisting RDD's directly to TachyonHaoyuan Li2014-04-041-1/+2
* Spark 1162 Implemented takeOrdered in pyspark.Prashant Sharma2014-04-031-5/+102
* SPARK-1322, top in pyspark should sort result in descending order.Prashant Sharma2014-03-261-3/+3
* Added doctest for map function in rdd.pyJyotiska NK2014-03-191-0/+4
* Spark 1246 add min max to stat counterDan McClary2014-03-181-0/+19
* SPARK-1240: handle the case of empty RDD when takeSampleCodingCat2014-03-161-0/+4
* SPARK-1162 Added top in python.Prashant Sharma2014-03-121-0/+25
* Spark-1163, Added missing Python RDD functionsprabinb2014-03-111-0/+42
* SPARK-1168, Added foldByKey to pyspark.Prashant Sharma2014-03-101-0/+14
* [SPARK-972] Added detailed callsite info for ValueError in context.py (resubm...jyotiska2014-03-101-7/+14
* SPARK-977 Added Python RDD.zip functionPrabin Banka2014-03-101-1/+19
* Spark 1165 rdd.intersection in python and javaPrashant Sharma2014-03-071-0/+17
* SPARK-1187, Added missing Python APIsPrabin Banka2014-03-061-0/+7
* SPARK-1109 wrong API docs for pyspark map functionPrashant Sharma2014-03-041-1/+1
* doctest updated for mapValues, flatMapValues in rdd.pyjyotiska2014-02-221-0/+10
* Merge pull request #498 from ScrapCodes/python-api. Closes #498.Prashant Sharma2014-02-061-0/+60
* Deprecate mapPartitionsWithSplit in PySpark.Josh Rosen2014-01-231-4/+21
* Make Python function/line appear in the UI.Tor Myklebust2013-12-281-11/+55
* Merge pull request #276 from shivaram/collectPartitionReynold Xin2013-12-191-1/+6
|\
| * Make collectPartitions take an array of partitionsShivaram Venkataraman2013-12-191-1/+6
| * Add collectPartition to JavaRDD interface.Shivaram Venkataraman2013-12-181-1/+1
* | Add toString to Java RDD, and __repr__ to Python RDDNick Pentreath2013-12-191-0/+3
|/
* Merge branch 'master' into akka-bug-fixPrashant Sharma2013-12-111-1/+4
|\