spark - Mirror of Apache Spark

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SPARK-12300] [SQL] [PYSPARK] fix schema inferance on local collections	Holden Karau	2015-12-30	2	-7/+14
\| \| \| \| \| \| \| \|	Current schema inference for local python collections halts as soon as there are no NullTypes. This is different than when we specify a sampling ratio of 1.0 on a distributed collection. This could result in incomplete schema information. Author: Holden Karau <holden@us.ibm.com> Closes #10275 from holdenk/SPARK-12300-fix-schmea-inferance-on-local-collections.
*	[SPARK-12353][STREAMING][PYSPARK] Fix countByValue inconsistent output in ↵	jerryshao	2015-12-28	2	-5/+16
\| \| \| \| \| \| \| \| \| \|	Python API The semantics of Python countByValue is different from Scala API, it is more like countDistinctValue, so here change to make it consistent with Scala/Java API. Author: jerryshao <sshao@hortonworks.com> Closes #10350 from jerryshao/SPARK-12353.
*	[SPARK-12520] [PYSPARK] Correct Descriptions and Add Use Cases in Equi-Join	gatorsmile	2015-12-27	1	-1/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	After reading the JIRA https://issues.apache.org/jira/browse/SPARK-12520, I double checked the code. For example, users can do the Equi-Join like ```df.join(df2, 'name', 'outer').select('name', 'height').collect()``` - There exists a bug in 1.5 and 1.4. The code just ignores the third parameter (join type) users pass. However, the join type we called is `Inner`, even if the user-specified type is the other type (e.g., `Outer`). - After a PR: https://github.com/apache/spark/pull/8600, the 1.6 does not have such an issue, but the description has not been updated. Plan to submit another PR to fix 1.5 and issue an error message if users specify a non-inner join type when using Equi-Join. Author: gatorsmile <gatorsmile@gmail.com> Closes #10477 from gatorsmile/pyOuterJoin.
*	[SPARK-12296][PYSPARK][MLLIB] Feature parity for pyspark mllib standard ↵	Holden Karau	2015-12-22	1	-0/+40
\| \| \| \| \| \| \| \| \| \|	scaler model Some methods are missing, such as ways to access the std, mean, etc. This PR is for feature parity for pyspark.mllib.feature.StandardScaler & StandardScalerModel. Author: Holden Karau <holden@us.ibm.com> Closes #10298 from holdenk/SPARK-12296-feature-parity-pyspark-mllib-StandardScalerModel.
*	Doc typo: ltrim = trim from left end, not right	pshearer	2015-12-21	1	-1/+1
\| \| \| \| \| \|	Author: pshearer <pshearer@massmutual.com> Closes #10414 from pshearer/patch-1.
*	[PYSPARK] Pyspark typo & Add missing abstractmethod annotation	Jeff Zhang	2015-12-21	2	-2/+3
\| \| \| \| \| \| \| \| \| \|	No jira is created since this is a trivial change. davies Please help review it Author: Jeff Zhang <zjffdu@apache.org> Closes #10143 from zjffdu/pyspark_typo.
*	[SPARK-10158][PYSPARK][MLLIB] ALS better error message when using Long IDs	Bryan Cutler	2015-12-20	1	-0/+17
\| \| \| \| \| \| \| \|	Added catch for casting Long to Int exception when PySpark ALS Ratings are serialized. It is easy to accidentally use Long IDs for user/product and before, it would fail with a somewhat cryptic "ClassCastException: java.lang.Long cannot be cast to java.lang.Integer." Now if this is done, a more descriptive error is shown, e.g. "PickleException: Ratings id 1205640308657491975 exceeds max integer value of 2147483647." Author: Bryan Cutler <bjcutler@us.ibm.com> Closes #9361 from BryanCutler/als-pyspark-long-id-error-SPARK-10158.
*	[SQL] Fix mistake doc of join type for dataframe.join	Yanbo Liang	2015-12-19	1	-1/+1
\| \| \| \| \| \| \| \|	Fix mistake doc of join type for ```dataframe.join```. Author: Yanbo Liang <ybliang8@gmail.com> Closes #10378 from yanboliang/leftsemi.
*	[SPARK-12091] [PYSPARK] Deprecate the JAVA-specific deserialized storage levels	gatorsmile	2015-12-18	8	-24/+35
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current default storage level of Python persist API is MEMORY_ONLY_SER. This is different from the default level MEMORY_ONLY in the official document and RDD APIs. davies Is this inconsistency intentional? Thanks! Updates: Since the data is always serialized on the Python side, the storage levels of JAVA-specific deserialization are not removed, such as MEMORY_ONLY. Updates: Based on the reviewers' feedback. In Python, stored objects will always be serialized with the [Pickle](https://docs.python.org/2/library/pickle.html) library, so it does not matter whether you choose a serialized level. The available storage levels in Python include `MEMORY_ONLY`, `MEMORY_ONLY_2`, `MEMORY_AND_DISK`, `MEMORY_AND_DISK_2`, `DISK_ONLY`, `DISK_ONLY_2` and `OFF_HEAP`. Author: gatorsmile <gatorsmile@gmail.com> Closes #10092 from gatorsmile/persistStorageLevel.
*	[SQL] Update SQLContext.read.text doc	Yanbo Liang	2015-12-17	1	-1/+1
\| \| \| \| \| \| \| \|	Since we rename the column name from ```text``` to ```value``` for DataFrame load by ```SQLContext.read.text```, we need to update doc. Author: Yanbo Liang <ybliang8@gmail.com> Closes #10349 from yanboliang/text-value.
*	[SPARK-11904][PYSPARK] reduceByKeyAndWindow does not require checkpointing ↵	David Tolpin	2015-12-16	1	-22/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	when invFunc is None when invFunc is None, `reduceByKeyAndWindow(func, None, winsize, slidesize)` is equivalent to reduceByKey(func).window(winsize, slidesize).reduceByKey(winsize, slidesize) and no checkpoint is necessary. The corresponding Scala code does exactly that, but Python code always creates a windowed stream with obligatory checkpointing. The patch fixes this. I do not know how to unit-test this. Author: David Tolpin <david.tolpin@gmail.com> Closes #9888 from dtolpin/master.
*	[SPARK-12380] [PYSPARK] use SQLContext.getOrCreate in mllib	Davies Liu	2015-12-16	3	-11/+9
\| \| \| \| \| \| \| \|	MLlib should use SQLContext.getOrCreate() instead of creating new SQLContext. Author: Davies Liu <davies@databricks.com> Closes #10338 from davies/create_context.
*	[SPARK-9690][ML][PYTHON] pyspark CrossValidator random seed	Martin Menestret	2015-12-16	1	-7/+13
\| \| \| \| \| \| \| \| \| \| \| \| \|	Extend CrossValidator with HasSeed in PySpark. This PR replaces [https://github.com/apache/spark/pull/7997] CC: yanboliang thunterdb mmenestret Would one of you mind taking a look? Thanks! Author: Joseph K. Bradley <joseph@databricks.com> Author: Martin MENESTRET <mmenestret@ippon.fr> Closes #10268 from jkbradley/pyspark-cv-seed.
*	[SPARK-12361][PYSPARK][TESTS] Should set PYSPARK_DRIVER_PYTHON before Python ↵	Jeff Zhang	2015-12-16	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	tests Although this patch still doesn't solve the issue why the return code is 0 (see JIRA description), it resolves the issue of python version mismatch. Author: Jeff Zhang <zjffdu@apache.org> Closes #10322 from zjffdu/SPARK-12361.
*	[SPARK-12016] [MLLIB] [PYSPARK] Wrap Word2VecModel when loading it in pyspark	Liang-Chi Hsieh	2015-12-14	1	-1/+5
\| \| \| \| \| \| \| \| \| \|	JIRA: https://issues.apache.org/jira/browse/SPARK-12016 We should not directly use Word2VecModel in pyspark. We need to wrap it in a Word2VecModelWrapper when loading it in pyspark. Author: Liang-Chi Hsieh <viirya@appier.com> Closes #10100 from viirya/fix-load-py-wordvecmodel.
*	[SPARK-11713] [PYSPARK] [STREAMING] Initial RDD updateStateByKey for PySpark	Bryan Cutler	2015-12-10	2	-2/+31
\| \| \| \| \| \| \| \|	Adding ability to define an initial state RDD for use with updateStateByKey PySpark. Added unit test and changed stateful_network_wordcount example to use initial RDD. Author: Bryan Cutler <bjcutler@us.ibm.com> Closes #10082 from BryanCutler/initial-rdd-updateStateByKey-SPARK-11713.
*	[SPARK-12012][SQL] Show more comprehensive PhysicalRDD metadata when ↵	Cheng Lian	2015-12-09	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	visualizing SQL query plan This PR adds a `private[sql]` method `metadata` to `SparkPlan`, which can be used to describe detail information about a physical plan during visualization. Specifically, this PR uses this method to provide details of `PhysicalRDD`s translated from a data source relation. For example, a `ParquetRelation` converted from Hive metastore table `default.psrc` is now shown as the following screenshot: ![image](https://cloud.githubusercontent.com/assets/230655/11526657/e10cb7e6-9916-11e5-9afa-f108932ec890.png) And here is the screenshot for a regular `ParquetRelation` (not converted from Hive metastore table) loaded from a really long path: ![output](https://cloud.githubusercontent.com/assets/230655/11680582/37c66460-9e94-11e5-8f50-842db5309d5a.png) Author: Cheng Lian <lian@databricks.com> Closes #10004 from liancheng/spark-12012.physical-rdd-metadata.
*	[SPARK-12184][PYTHON] Make python api doc for pivot consistant with scala doc	Andrew Ray	2015-12-07	1	-5/+9
\| \| \| \| \| \| \| \|	In SPARK-11946 the API for pivot was changed a bit and got updated doc, the doc changes were not made for the python api though. This PR updates the python doc to be consistent. Author: Andrew Ray <ray.andrew@gmail.com> Closes #10176 from aray/sql-pivot-python-doc.
*	[SPARK-12132] [PYSPARK] raise KeyboardInterrupt inside SIGINT handler	Davies Liu	2015-12-07	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, the current line is not cleared by Cltr-C After this patch ``` >>> asdfasdf^C Traceback (most recent call last): File "~/spark/python/pyspark/context.py", line 225, in signal_handler raise KeyboardInterrupt() KeyboardInterrupt ``` It's still worse than 1.5 (and before). Author: Davies Liu <davies@databricks.com> Closes #10134 from davies/fix_cltrc.
*	[SPARK-12058][STREAMING][KINESIS][TESTS] fix Kinesis python tests	Burak Yavuz	2015-12-04	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \|	Python tests require access to the `KinesisTestUtils` file. When this file exists under src/test, python can't access it, since it is not available in the assembly jar. However, if we move KinesisTestUtils to src/main, we need to add the KinesisProducerLibrary as a dependency. In order to avoid this, I moved KinesisTestUtils to src/main, and extended it with ExtendedKinesisTestUtils which is under src/test that adds support for the KPL. cc zsxwing tdas Author: Burak Yavuz <brkyvz@gmail.com> Closes #10050 from brkyvz/kinesis-py.
*	[MINOR][ML] Use coefficients replace weights	Yanbo Liang	2015-12-03	2	-2/+2
\| \| \| \| \| \| \| \| \|	Use ```coefficients``` replace ```weights```, I wish they are the last two. mengxr Author: Yanbo Liang <ybliang8@gmail.com> Closes #10065 from yanboliang/coefficients.
*	[SPARK-12090] [PYSPARK] consider shuffle in coalesce()	Davies Liu	2015-12-01	1	-1/+1
\| \| \| \| \| \|	Author: Davies Liu <davies@databricks.com> Closes #10090 from davies/fix_coalesce.
*	[SPARK-12002][STREAMING][PYSPARK] Fix python direct stream checkpoint ↵	jerryshao	2015-12-01	2	-6/+56
\| \| \| \| \| \| \| \| \| \| \| \| \|	recovery issue Fixed a minor race condition in #10017 Closes #10017 Author: jerryshao <sshao@hortonworks.com> Author: Shixiong Zhu <shixiong@databricks.com> Closes #10074 from zsxwing/review-pr10017.
*	[SPARK-12058][HOTFIX] Disable KinesisStreamTests	Shixiong Zhu	2015-11-30	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	KinesisStreamTests in test.py is broken because of #9403. See https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/46896/testReport/(root)/KinesisStreamTests/test_kinesis_stream/ Because Streaming Python didn’t work when merging https://github.com/apache/spark/pull/9403, the PR build didn’t report the Python test failure actually. This PR just disabled the test to unblock #10039 Author: Shixiong Zhu <shixiong@databricks.com> Closes #10047 from zsxwing/disable-python-kinesis-test.
*	[SPARK-11917][PYSPARK] Add SQLContext#dropTempTable to PySpark	Jeff Zhang	2015-11-26	1	-0/+9
\| \| \| \| \| \|	Author: Jeff Zhang <zjffdu@apache.org> Closes #9903 from zjffdu/SPARK-11917.
*	[SPARK-11980][SPARK-10621][SQL] Fix json_tuple and add test cases for	gatorsmile	2015-11-25	1	-10/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Added Python test cases for the function `isnan`, `isnull`, `nanvl` and `json_tuple`. Fixed a bug in the function `json_tuple` rxin , could you help me review my changes? Please let me know anything is missing. Thank you! Have a good Thanksgiving day! Author: gatorsmile <gatorsmile@gmail.com> Closes #9977 from gatorsmile/json_tuple.
*	[SPARK-11935][PYSPARK] Send the Python exceptions in TransformFunction and ↵	Shixiong Zhu	2015-11-25	2	-10/+101
\| \| \| \| \| \| \| \| \| \| \| \|	TransformFunctionSerializer to Java The Python exception track in TransformFunction and TransformFunctionSerializer is not sent back to Java. Py4j just throws a very general exception, which is hard to debug. This PRs adds `getFailure` method to get the failure message in Java side. Author: Shixiong Zhu <shixiong@databricks.com> Closes #9922 from zsxwing/SPARK-11935.
*	[SPARK-11969] [SQL] [PYSPARK] visualization of SQL query for pyspark	Davies Liu	2015-11-25	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Currently, we does not have visualization for SQL query from Python, this PR fix that. cc zsxwing Author: Davies Liu <davies@databricks.com> Closes #9949 from davies/pyspark_sql_ui.
*	[SPARK-11984][SQL][PYTHON] Fix typos in doc for pivot for scala and python	felixcheung	2015-11-25	1	-3/+3
\| \| \| \| \| \|	Author: felixcheung <felixcheung_m@hotmail.com> Closes #9967 from felixcheung/pypivotdoc.
*	[SPARK-11860][PYSAPRK][DOCUMENTATION] Invalid argument specification …	Jeff Zhang	2015-11-25	1	-2/+3
\| \| \| \| \| \| \| \| \| \|	…for registerFunction [Python] Straightforward change on the python doc Author: Jeff Zhang <zjffdu@apache.org> Closes #9901 from zjffdu/SPARK-11860.
*	[SPARK-10621][SQL] Consistent naming for functions in SQL, Python, Scala	Reynold Xin	2015-11-24	1	-17/+94
\| \| \| \| \| \|	Author: Reynold Xin <rxin@databricks.com> Closes #9948 from rxin/SPARK-10621.
*	[SPARK-11967][SQL] Consistent use of varargs for multiple paths in ↵	Reynold Xin	2015-11-24	1	-7/+12
\| \| \| \| \| \| \| \| \| \| \| \|	DataFrameReader This patch makes it consistent to use varargs in all DataFrameReader methods, including Parquet, JSON, text, and the generic load function. Also added a few more API tests for the Java API. Author: Reynold Xin <rxin@databricks.com> Closes #9945 from rxin/SPARK-11967.
*	[SPARK-11946][SQL] Audit pivot API for 1.6.	Reynold Xin	2015-11-24	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently pivot's signature looks like ```scala scala.annotation.varargs def pivot(pivotColumn: Column, values: Column): GroupedData scala.annotation.varargs def pivot(pivotColumn: String, values: Any): GroupedData ``` I think we can remove the one that takes "Column" types, since callers should always be passing in literals. It'd also be more clear if the values are not varargs, but rather Seq or java.util.List. I also made similar changes for Python. Author: Reynold Xin <rxin@databricks.com> Closes #9929 from rxin/SPARK-11946.
*	[SPARK-10560][PYSPARK][MLLIB][DOCS] Make StreamingLogisticRegressionWithSGD ↵	Bryan Cutler	2015-11-23	2	-23/+46
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Python API equal to Scala one This is to bring the API documentation of StreamingLogisticReressionWithSGD and StreamingLinearRegressionWithSGC in line with the Scala versions. -Fixed the algorithm descriptions -Added default values to parameter descriptions -Changed StreamingLogisticRegressionWithSGD regParam to default to 0, as in the Scala version Author: Bryan Cutler <bjcutler@us.ibm.com> Closes #9141 from BryanCutler/StreamingLogisticRegressionWithSGD-python-api-sync.
*	[SPARK-11836][SQL] udf/cast should not create new SQLContext	Davies Liu	2015-11-23	2	-6/+8
\| \| \| \| \| \| \| \|	They should use the existing SQLContext. Author: Davies Liu <davies@databricks.com> Closes #9914 from davies/create_udf.
*	[SPARK-11870][STREAMING][PYSPARK] Rethrow the exceptions in ↵	Shixiong Zhu	2015-11-20	2	-0/+19
\| \| \| \| \| \| \| \| \| \|	TransformFunction and TransformFunctionSerializer TransformFunction and TransformFunctionSerializer don't rethrow the exception, so when any exception happens, it just return None. This will cause some weird NPE and confuse people. Author: Shixiong Zhu <shixiong@databricks.com> Closes #9847 from zsxwing/pyspark-streaming-exception.
*	[SPARK-11875][ML][PYSPARK] Update doc for PySpark HasCheckpointInterval	Yanbo Liang	2015-11-19	2	-9/+11
\| \| \| \| \| \| \| \| \|	* Update doc for PySpark ```HasCheckpointInterval``` that users can understand how to disable checkpoint. * Update doc for PySpark ```cacheNodeIds``` of ```DecisionTreeParams``` to notify the relationship between ```cacheNodeIds``` and ```checkpointInterval```. Author: Yanbo Liang <ybliang8@gmail.com> Closes #9856 from yanboliang/spark-11875.
*	[SPARK-11812][PYSPARK] invFunc=None works properly with python's ↵	David Tolpin	2015-11-19	2	-3/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	reduceByKeyAndWindow invFunc is optional and can be None. Instead of invFunc (the parameter) invReduceFunc (a local function) was checked for trueness (that is, not None, in this context). A local function is never None, thus the case of invFunc=None (a common one when inverse reduction is not defined) was treated incorrectly, resulting in loss of data. In addition, the docstring used wrong parameter names, also fixed. Author: David Tolpin <david.tolpin@gmail.com> Closes #9775 from dtolpin/master.
*	[SPARK-11820][ML][PYSPARK] PySpark LiR & LoR should support weightCol	Yanbo Liang	2015-11-18	2	-16/+17
\| \| \| \| \| \| \| \|	[SPARK-7685](https://issues.apache.org/jira/browse/SPARK-7685) and [SPARK-9642](https://issues.apache.org/jira/browse/SPARK-9642) have already supported setting weight column for ```LogisticRegression``` and ```LinearRegression```. It's a very important feature, PySpark should also support. mengxr Author: Yanbo Liang <ybliang8@gmail.com> Closes #9811 from yanboliang/spark-11820.
*	[SPARK-11720][SQL][ML] Handle edge cases when count = 0 or 1 for Stats function	JihongMa	2015-11-18	1	-1/+1
\| \| \| \| \| \| \| \|	return Double.NaN for mean/average when count == 0 for all numeric types that is converted to Double, Decimal type continue to return null. Author: JihongMa <linlin200605@gmail.com> Closes #9705 from JihongMA/SPARK-11720.
*	[SPARK-11804] [PYSPARK] Exception raise when using Jdbc predicates opt…	Jeff Zhang	2015-11-18	2	-5/+18
\| \| \| \| \| \| \| \|	…ion in PySpark Author: Jeff Zhang <zjffdu@apache.org> Closes #9791 from zjffdu/SPARK-11804.
*	[SPARK-9065][STREAMING][PYSPARK] Add MessageHandler for Kafka Python API	jerryshao	2015-11-17	2	-12/+134
\| \| \| \| \| \| \| \| \| \| \| \|	Fixed the merge conflicts in #7410 Closes #7410 Author: Shixiong Zhu <shixiong@databricks.com> Author: jerryshao <saisai.shao@intel.com> Author: jerryshao <sshao@hortonworks.com> Closes #9742 from zsxwing/pr7410.
*	[SPARK-11740][STREAMING] Fix the race condition of two checkpoints in a batch	Shixiong Zhu	2015-11-17	1	-5/+4
\| \| \| \| \| \| \| \|	We will do checkpoint when generating a batch and completing a batch. When the processing time of a batch is greater than the batch interval, checkpointing for completing an old batch may run after checkpointing for generating a new batch. If this happens, checkpoint of an old batch actually has the latest information, so we want to recovery from it. This PR will use the latest checkpoint time as the file name, so that we can always recovery from the latest checkpoint file. Author: Shixiong Zhu <shixiong@databricks.com> Closes #9707 from zsxwing/fix-checkpoint.
*	[SPARK-6328][PYTHON] Python API for StreamingListener	Daniel Jalova	2015-11-16	4	-2/+210
\| \| \| \| \| \|	Author: Daniel Jalova <djalova@us.ibm.com> Closes #9186 from djalova/SPARK-6328.
*	[SPARK-11745][SQL] Enable more JSON parsing options	Reynold Xin	2015-11-16	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds the following options to the JSON data source, for dealing with non-standard JSON files: * `allowComments` (default `false`): ignores Java/C++ style comment in JSON records * `allowUnquotedFieldNames` (default `false`): allows unquoted JSON field names * `allowSingleQuotes` (default `true`): allows single quotes in addition to double quotes * `allowNumericLeadingZeros` (default `false`): allows leading zeros in numbers (e.g. 00012) To avoid passing a lot of options throughout the json package, I introduced a new JSONOptions case class to define all JSON config options. Also updated documentation to explain these options. Scala ![screen shot 2015-11-15 at 6 12 12 pm](https://cloud.githubusercontent.com/assets/323388/11172965/e3ace6ec-8bc4-11e5-805e-2d78f80d0ed6.png) Python ![screen shot 2015-11-15 at 6 11 28 pm](https://cloud.githubusercontent.com/assets/323388/11172964/e23ed6ee-8bc4-11e5-8216-312f5983acd5.png) Author: Reynold Xin <rxin@databricks.com> Closes #9724 from rxin/SPARK-11745.
*	[SPARK-11690][PYSPARK] Add pivot to python api	Andrew Ray	2015-11-13	1	-1/+23
\| \| \| \| \| \| \| \|	This PR adds pivot to the python api of GroupedData with the same syntax as Scala/Java. Author: Andrew Ray <ray.andrew@gmail.com> Closes #9653 from aray/sql-pivot-python.
*	[SPARK-11706][STREAMING] Fix the bug that Streaming Python tests cannot ↵	Shixiong Zhu	2015-11-13	1	-10/+20
\| \| \| \| \| \| \| \| \| \|	report failures This PR just checks the test results and returns 1 if the test fails, so that `run-tests.py` can mark it fail. Author: Shixiong Zhu <shixiong@databricks.com> Closes #9669 from zsxwing/streaming-python-tests.
*	[SPARK-11658] simplify documentation for PySpark combineByKey	Chris Snow	2015-11-12	1	-1/+0
\| \| \| \| \| \|	Author: Chris Snow <chsnow123@gmail.com> Closes #9640 from snowch/patch-3.
*	[SPARK-11671] documentation code example typo	Chris Snow	2015-11-12	1	-1/+1
\| \| \| \| \| \| \| \|	Example for sqlContext.createDataDrame from pandas.DataFrame has a typo Author: Chris Snow <chsnow123@gmail.com> Closes #9639 from snowch/patch-2.
*	[SPARK-11420] Updating Stddev support via Imperative Aggregate	JihongMa	2015-11-12	1	-1/+1
\| \| \| \| \| \| \| \|	switched stddev support from DeclarativeAggregate to ImperativeAggregate. Author: JihongMa <linlin200605@gmail.com> Closes #9380 from JihongMA/SPARK-11420.