index
:
spark
2.12/build
SPARK-10001-hotfix
SPARK-10001-sigint
SPARK-14511-genjavadoc
SPARK-17647
WIP-SPARK-17647
escape
genjavadoc
macros
master
packages
scala-2.12
value-classes
Mirror of Apache Spark
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
*
Move some classes to more appropriate packages:
Matei Zaharia
2013-09-01
1
-2
/
+2
*
Add banner to PySpark and make wordcount output nicer
Matei Zaharia
2013-09-01
2
-1
/
+14
*
Initial work to rename package to org.apache.spark
Matei Zaharia
2013-09-01
3
-5
/
+5
*
Merge pull request #861 from AndreSchumacher/pyspark_sampling_function
Matei Zaharia
2013-08-31
2
-7
/
+167
|
\
|
*
RDD sample() and takeSample() prototypes for PySpark
Andre Schumacher
2013-08-28
2
-7
/
+167
*
|
Merge pull request #870 from JoshRosen/spark-885
Matei Zaharia
2013-08-31
1
-1
/
+5
|
\
\
|
*
|
Don't send SIGINT to Py4J gateway subprocess.
Josh Rosen
2013-08-28
1
-1
/
+5
|
|
/
*
|
Merge pull request #869 from AndreSchumacher/subtract
Matei Zaharia
2013-08-30
1
-0
/
+37
|
\
\
|
*
|
PySpark: implementing subtractByKey(), subtract() and keyBy()
Andre Schumacher
2013-08-28
1
-0
/
+37
|
|
/
*
|
Fix PySpark for assembly run and include it in dist
Matei Zaharia
2013-08-29
1
-0
/
+0
*
|
Change build and run instructions to use assemblies
Matei Zaharia
2013-08-29
1
-1
/
+1
|
/
*
Implementing SPARK-838: Add DoubleRDDFunctions methods to PySpark
Andre Schumacher
2013-08-21
2
-1
/
+168
*
Implementing SPARK-878 for PySpark: adding zip and egg files to context and p...
Andre Schumacher
2013-08-16
5
-5
/
+37
*
Fix PySpark unit tests on Python 2.6.
Josh Rosen
2013-08-14
2
-19
/
+20
*
Merge pull request #802 from stayhf/SPARK-760-Python
Matei Zaharia
2013-08-12
1
-0
/
+70
|
\
|
*
Code update for Matei's suggestions
stayhf
2013-08-11
1
-7
/
+9
|
*
Simple PageRank algorithm implementation in Python for SPARK-760
stayhf
2013-08-10
1
-0
/
+68
*
|
Merge pull request #813 from AndreSchumacher/add_files_pyspark
Matei Zaharia
2013-08-12
1
-1
/
+6
|
\
\
|
*
|
Implementing SPARK-865: Add the equivalent of ADD_JARS to PySpark
Andre Schumacher
2013-08-12
1
-1
/
+6
*
|
|
Merge pull request #747 from mateiz/improved-lr
Matei Zaharia
2013-08-06
1
-27
/
+26
|
\
\
\
|
*
|
|
Fix string parsing and style in LR
Matei Zaharia
2013-07-31
1
-1
/
+1
|
*
|
|
Update the Python logistic regression example to read from a file and
Matei Zaharia
2013-07-29
1
-27
/
+26
*
|
|
|
Do not inherit master's PYTHONPATH on workers.
Josh Rosen
2013-07-29
1
-3
/
+2
|
/
/
/
*
|
|
Merge branch 'master' of github.com:mesos/spark
Matei Zaharia
2013-07-29
6
-15
/
+9
|
\
\
\
|
*
|
|
Some fixes to Python examples (style and package name for LR)
Matei Zaharia
2013-07-27
6
-15
/
+9
|
|
|
/
|
|
/
|
*
|
|
SPARK-815. Python parallelize() should split lists before batching
Matei Zaharia
2013-07-29
1
-2
/
+9
*
|
|
Use None instead of empty string as it's slightly smaller/faster
Matei Zaharia
2013-07-29
1
-1
/
+1
*
|
|
Allow python/run-tests to run from any directory
Matei Zaharia
2013-07-29
1
-0
/
+3
*
|
|
Optimize Python foreach() to not return as many objects
Matei Zaharia
2013-07-29
1
-1
/
+5
*
|
|
Optimize Python take() to not compute entire first partition
Matei Zaharia
2013-07-29
1
-6
/
+9
|
/
/
*
|
Add Apache license headers and LICENSE and NOTICE files
Matei Zaharia
2013-07-16
19
-1
/
+325
*
|
Fixed PySpark perf regression by not using socket.makefile(), and improved
root
2013-07-01
1
-18
/
+24
*
|
Fix reporting of PySpark exceptions
Jey Kottalam
2013-06-21
2
-5
/
+19
*
|
PySpark daemon: fix deadlock, improve error handling
Jey Kottalam
2013-06-21
1
-17
/
+50
*
|
Add tests and fixes for Python daemon shutdown
Jey Kottalam
2013-06-21
3
-22
/
+69
*
|
Prefork Python worker processes
Jey Kottalam
2013-06-21
2
-32
/
+138
*
|
Add Python timing instrumentation
Jey Kottalam
2013-06-21
2
-1
/
+19
*
|
Fix Python saveAsTextFile doctest to not expect order to be preserved
Jey Kottalam
2013-04-02
1
-1
/
+1
*
|
Fix argv handling in Python transitive closure example
Jey Kottalam
2013-04-02
1
-1
/
+1
*
|
Change numSplits to numPartitions in PySpark.
Josh Rosen
2013-02-24
2
-38
/
+38
*
|
Add commutative requirement for 'reduce' to Python docstring.
Mark Hamstra
2013-02-09
1
-2
/
+2
|
/
*
Remove unnecessary doctest __main__ methods.
Josh Rosen
2013-02-03
2
-18
/
+0
*
Fetch fewer objects in PySpark's take() method.
Josh Rosen
2013-02-03
1
-0
/
+4
*
Fix reporting of PySpark doctest failures.
Josh Rosen
2013-02-03
2
-2
/
+6
*
Use spark.local.dir for PySpark temp files (SPARK-580).
Josh Rosen
2013-02-01
2
-10
/
+9
*
Do not launch JavaGateways on workers (SPARK-674).
Josh Rosen
2013-02-01
4
-18
/
+25
*
Fix stdout redirection in PySpark.
Josh Rosen
2013-02-01
2
-2
/
+12
*
SPARK-673: Capture and re-throw Python exceptions
Patrick Wendell
2013-01-31
1
-2
/
+8
*
Merge pull request #430 from pwendell/pyspark-guide
Matei Zaharia
2013-01-30
1
-0
/
+1
|
\
|
*
Make module help available in python shell.
Patrick Wendell
2013-01-30
1
-0
/
+1
[next]