spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SPARK-1439, SPARK-1440] Generate unified Scaladoc across projects and Javadocs	Matei Zaharia	2014-04-21	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I used the sbt-unidoc plugin (https://github.com/sbt/sbt-unidoc) to create a unified Scaladoc of our public packages, and generate Javadocs as well. One limitation is that I haven't found an easy way to exclude packages in the Javadoc; there is a SBT task that identifies Java sources to run javadoc on, but it's been very difficult to modify it from outside to change what is set in the unidoc package. Some SBT-savvy people should help with this. The Javadoc site also lacks package-level descriptions and things like that, so we may want to look into that. We may decide not to post these right now if it's too limited compared to the Scala one. Example of the built doc site: http://people.csail.mit.edu/matei/spark-unified-docs/ Author: Matei Zaharia <matei@databricks.com> This patch had conflicts when merged, resolved by Committer: Patrick Wendell <pwendell@gmail.com> Closes #457 from mateiz/better-docs and squashes the following commits: a63d4a3 [Matei Zaharia] Skip Java/Scala API docs for Python package 5ea1f43 [Matei Zaharia] Fix links to Java classes in Java guide, fix some JS for scrolling to anchors on page load f05abc0 [Matei Zaharia] Don't include java.lang package names 995e992 [Matei Zaharia] Skip internal packages and class names with $ in JavaDoc a14a93c [Matei Zaharia] typo 76ce64d [Matei Zaharia] Add groups to Javadoc index page, and a first package-info.java ed6f994 [Matei Zaharia] Generate JavaDoc as well, add titles, update doc site to use unified docs acb993d [Matei Zaharia] Add Unidoc plugin for the projects we want Unidoced
*	SPARK-1426: Make MLlib work with NumPy versions older than 1.7	Sandeep	2014-04-15	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \|	Currently it requires NumPy 1.7 due to using the copyto method (http://docs.scipy.org/doc/numpy/reference/generated/numpy.copyto.html) for extracting data out of an array. Replace it with a fallback Author: Sandeep <sandeep@techaddict.me> Closes #391 from techaddict/1426 and squashes the following commits: d365962 [Sandeep] SPARK-1426: Make MLlib work with NumPy versions older than 1.7 Currently it requires NumPy 1.7 due to using the copyto method (http://docs.scipy.org/doc/numpy/reference/generated/numpy.copyto.html) for extracting data out of an array. Replace it with a fallback
*	SPARK-1099: Introduce local[*] mode to infer number of cores	Aaron Davidson	2014-04-07	1	-3/+4
\| \| \| \| \| \| \| \| \| \| \|	This is the default mode for running spark-shell and pyspark, intended to allow users running spark for the first time to see the performance benefits of using multiple cores, while not breaking backwards compatibility for users who use "local" mode and expect exactly 1 core. Author: Aaron Davidson <aaron@databricks.com> Closes #182 from aarondav/110 and squashes the following commits: a88294c [Aaron Davidson] Rebased changes for new spark-shell a9f393e [Aaron Davidson] SPARK-1099: Introduce local[*] mode to infer number of cores
*	SPARK-1421. Make MLlib work on Python 2.6	Matei Zaharia	2014-04-05	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	The reason it wasn't working was passing a bytearray to stream.write(), which is not supported in Python 2.6 but is in 2.7. (This array came from NumPy when we converted data to send it over to Java). Now we just convert those bytearrays to strings of bytes, which preserves nonprintable characters as well. Author: Matei Zaharia <matei@databricks.com> Closes #335 from mateiz/mllib-python-2.6 and squashes the following commits: f26c59f [Matei Zaharia] Update docs to no longer say we need Python 2.7 a84d6af [Matei Zaharia] SPARK-1421. Make MLlib work on Python 2.6
*	SPARK-1183. Don't use "worker" to mean executor	Sandy Ryza	2014-03-13	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Author: Sandy Ryza <sandy@cloudera.com> Closes #120 from sryza/sandy-spark-1183 and squashes the following commits: 5066a4a [Sandy Ryza] Remove "worker" in a couple comments 0bd1e46 [Sandy Ryza] Remove --am-class from usage bfc8fe0 [Sandy Ryza] Remove am-class from doc and fix yarn-alpha 607539f [Sandy Ryza] Address review comments 74d087a [Sandy Ryza] SPARK-1183. Don't use "worker" to mean executor
*	Updated link for pyspark examples in docs	Jyotiska NK	2014-02-26	1	-1/+1
\| \| \| \| \| \| \| \|	Author: Jyotiska NK <jyotiska123@gmail.com> Closes #22 from jyotiska/pyspark_docs and squashes the following commits: 426136c [Jyotiska NK] Updated link for pyspark examples
*	Clarify that Python 2.7 is only needed for MLlib	Matei Zaharia	2014-01-15	1	-2/+2
\|
*	Update Python required version to 2.7, and mention MLlib support	Matei Zaharia	2014-01-12	1	-1/+7
\|
*	Simplify and fix pyspark script.	Patrick Wendell	2014-01-07	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch removes compatibility for IPython < 1.0 but fixes the launch script and makes it much simpler. I tested this using the three commands in the PySpark documentation page: 1. IPYTHON=1 ./pyspark 2. IPYTHON_OPTS="notebook" ./pyspark 3. IPYTHON_OPTS="notebook --pylab inline" ./pyspark There are two changes: - We rely on PYTHONSTARTUP env var to start PySpark - Removed the quotes around $IPYTHON_OPTS... having quotes gloms them together as a single argument passed to `exec` which seemed to cause ipython to fail (it instead expects them as multiple arguments).
*	Code review feedback	Holden Karau	2014-01-05	1	-1/+1
\|
*	Merge remote-tracking branch 'apache-github/master' into remove-binaries	Patrick Wendell	2014-01-03	1	-14/+14
\|\ \| \| \| \| \| \| \| \| \| \|	Conflicts: core/src/test/scala/org/apache/spark/DriverSuite.scala docs/python-programming-guide.md
\| *	pyspark -> bin/pyspark	Prashant Sharma	2014-01-02	1	-14/+14
\| \|
* \|	Merge branch 'master' into spark-1002-remove-jars	Prashant Sharma	2014-01-03	1	-7/+8
\|\\|
\| *	Updated docs for SparkConf and handled review comments	Matei Zaharia	2013-12-30	1	-7/+8
\| \|
* \|	Removed sbt folder and changed docs accordingly	Prashant Sharma	2014-01-02	1	-1/+1
\|/
*	Add notes to python documentation about using SparkContext.setSystemProperty.	Ewen Cheslack-Postava	2013-10-22	1	-0/+11
\|
*	Fix PySpark docs and an overly long line of code after fdbae41e	Matei Zaharia	2013-10-09	1	-1/+1
\|
*	Update Python API features	Matei Zaharia	2013-09-10	1	-1/+1
\|
*	More fair scheduler docs and property names.	Matei Zaharia	2013-09-08	1	-6/+6
\| \| \| \| \|	Also changed uses of "job" terminology to "application" when they referred to an entire Spark program, to avoid confusion.
*	Doc improvements	Matei Zaharia	2013-09-01	1	-18/+18
\|
*	Fix more URLs in docs	Matei Zaharia	2013-09-01	1	-2/+5
\|
*	More updates, describing changes to recommended use of environment vars	Matei Zaharia	2013-08-31	1	-2/+2
\| \| \| \|	and new Python stuff
*	Update some build instructions because only sbt assembly and mvn package	Matei Zaharia	2013-08-29	1	-1/+1
\| \| \| \|	are now needed
*	Add docs about ipython	Matei Zaharia	2013-07-29	1	-3/+31
\|
*	Clarify that PySpark is not supported on Windows	root	2013-07-01	1	-3/+2
\|
*	Simplify Python docs a little to do substring search	Matei Zaharia	2013-06-26	1	-4/+3
\|
*	Some tweaks to docs	Matei Zaharia	2013-02-25	1	-2/+2
\|
*	Added checkpointing and fault-tolerance semantics to the programming guide. ↵	Tathagata Das	2013-02-18	1	-1/+1
\| \| \| \|	Fixed default checkpoint interval to being a multiple of slide duration. Fixed visibility of some classes and objects to clean up docs.
*	Make module help available in python shell.	Patrick Wendell	2013-01-30	1	-0/+1
\| \| \| \|	Also, adds a line in doc explaining how to use.
*	Inclue packaging and launching pyspark in guide.	Patrick Wendell	2013-01-30	1	-2/+8
\| \| \| \|	It's nicer if all the commands you need are made explicit.
*	Fix Python guide to say accumulators are available	Matei Zaharia	2013-01-20	1	-1/+0
\|
*	Add mapPartitionsWithSplit() to PySpark.	Josh Rosen	2013-01-08	1	-1/+0
\|
*	Add `pyspark` script to replace the other scripts.	Josh Rosen	2013-01-01	1	-5/+44
\| \| \|	Expand the PySpark programming guide.
*	Minor documentation and style fixes for PySpark.	Josh Rosen	2013-01-01	1	-2/+1
\|
*	Add documentation for Python API.	Josh Rosen	2012-12-28	1	-0/+74