aboutsummaryrefslogtreecommitdiff
path: root/CHANGES.txt
diff options
context:
space:
mode:
authorPatrick Wendell <patrick@databricks.com>2015-03-03 02:19:19 -0800
committerPatrick Wendell <patrick@databricks.com>2015-03-03 02:19:19 -0800
commitce7158cf70c1003c1011d9a755813b31feae91e4 (patch)
treefc096acd5e4507b29d7321dfe4fc11d2db1883cb /CHANGES.txt
parentae60eb9984de56c36c7f63220f9281fbaac10931 (diff)
downloadspark-ce7158cf70c1003c1011d9a755813b31feae91e4.tar.gz
spark-ce7158cf70c1003c1011d9a755813b31feae91e4.tar.bz2
spark-ce7158cf70c1003c1011d9a755813b31feae91e4.zip
Adding CHANGES.txt for Spark 1.3
Diffstat (limited to 'CHANGES.txt')
-rw-r--r--CHANGES.txt6520
1 files changed, 6520 insertions, 0 deletions
diff --git a/CHANGES.txt b/CHANGES.txt
new file mode 100644
index 0000000000..46d612a7f4
--- /dev/null
+++ b/CHANGES.txt
@@ -0,0 +1,6520 @@
+Spark Change Log
+----------------
+
+Release 1.3.0
+
+ BUILD: Minor tweaks to internal build scripts
+ Patrick Wendell <patrick@databricks.com>
+ 2015-03-03 00:38:12 -0800
+ Commit: ae60eb9
+
+ HOTFIX: Bump HBase version in MapR profiles.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-03-03 01:38:07 -0800
+ Commit: 1aa8461
+
+ [SPARK-5537][MLlib][Docs] Add user guide for multinomial logistic regression
+ DB Tsai <dbtsai@alpinenow.com>
+ 2015-03-02 22:37:12 -0800
+ Commit: 841d2a2, github.com/apache/spark/pull/4866
+
+ [SPARK-6120] [mllib] Warnings about memory in tree, ensemble model save
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-03-02 22:33:51 -0800
+ Commit: 81648a7, github.com/apache/spark/pull/4864
+
+ [SPARK-6097][MLLIB] Support tree model save/load in PySpark/MLlib
+ Xiangrui Meng <meng@databricks.com>
+ 2015-03-02 22:27:01 -0800
+ Commit: 62c53be, github.com/apache/spark/pull/4854
+
+ [SPARK-5310][SQL] Fixes to Docs and Datasources API
+ Reynold Xin <rxin@databricks.com>, Michael Armbrust <michael@databricks.com>
+ 2015-03-02 22:14:08 -0800
+ Commit: 4e6e008, github.com/apache/spark/pull/4868
+
+ [SPARK-5950][SQL]Insert array into a metastore table saved as parquet should work when using datasource api
+ Yin Huai <yhuai@databricks.com>
+ 2015-03-02 19:31:55 -0800
+ Commit: 1b490e9, github.com/apache/spark/pull/4826
+
+ [SPARK-6127][Streaming][Docs] Add Kafka to Python api docs
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2015-03-02 18:40:46 -0800
+ Commit: ffd0591, github.com/apache/spark/pull/4860
+
+ [SPARK-5537] Add user guide for multinomial logistic regression
+ Xiangrui Meng <meng@databricks.com>, DB Tsai <dbtsai@alpinenow.com>
+ 2015-03-02 18:10:50 -0800
+ Commit: 11389f0, github.com/apache/spark/pull/4801
+
+ [SPARK-6121][SQL][MLLIB] simpleString for UDT
+ Xiangrui Meng <meng@databricks.com>
+ 2015-03-02 17:14:34 -0800
+ Commit: 1b8ab57, github.com/apache/spark/pull/4858
+
+ [SPARK-6048] SparkConf should not translate deprecated configs on set
+ Andrew Or <andrew@databricks.com>
+ 2015-03-02 16:36:42 -0800
+ Commit: ea69cf2, github.com/apache/spark/pull/4799
+
+ [SPARK-6066] Make event log format easier to parse
+ Andrew Or <andrew@databricks.com>
+ 2015-03-02 16:34:32 -0800
+ Commit: 8100b79, github.com/apache/spark/pull/4821
+
+ [SPARK-6082] [SQL] Provides better error message for malformed rows when caching tables
+ Cheng Lian <lian@databricks.com>
+ 2015-03-02 16:18:00 -0800
+ Commit: 866f281, github.com/apache/spark/pull/4842
+
+ [SPARK-6114][SQL] Avoid metastore conversions before plan is resolved
+ Michael Armbrust <michael@databricks.com>
+ 2015-03-02 16:10:54 -0800
+ Commit: 3899c7c, github.com/apache/spark/pull/4855
+
+ [SPARK-6050] [yarn] Relax matching of vcore count in received containers.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-03-02 16:41:43 -0600
+ Commit: 650d1e7, github.com/apache/spark/pull/4818
+
+ [SPARK-6040][SQL] Fix the percent bug in tablesample
+ q00251598 <qiyadong@huawei.com>
+ 2015-03-02 13:16:29 -0800
+ Commit: a83b9bb, github.com/apache/spark/pull/4789
+
+ [Minor] Fix doc typo for describing primitiveTerm effectiveness condition
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-03-02 13:11:17 -0800
+ Commit: f92876a, github.com/apache/spark/pull/4762
+
+ SPARK-5390 [DOCS] Encourage users to post on Stack Overflow in Community Docs
+ Sean Owen <sowen@cloudera.com>
+ 2015-03-02 21:10:08 +0000
+ Commit: 58e7198, github.com/apache/spark/pull/4843
+
+ [DOCS] Refactored Dataframe join comment to use correct parameter ordering
+ Paul Power <paul.power@peerside.com>
+ 2015-03-02 13:08:47 -0800
+ Commit: 54ac243, github.com/apache/spark/pull/4847
+
+ [SPARK-6080] [PySpark] correct LogisticRegressionWithLBFGS regType parameter for pyspark
+ Yanbo Liang <ybliang8@gmail.com>
+ 2015-03-02 10:17:24 -0800
+ Commit: 4ffaf85, github.com/apache/spark/pull/4831
+
+ [SPARK-5741][SQL] Support the path contains comma in HiveContext
+ q00251598 <qiyadong@huawei.com>
+ 2015-03-02 10:13:11 -0800
+ Commit: f476108, github.com/apache/spark/pull/4532
+
+ [SPARK-6111] Fixed usage string in documentation.
+ Kenneth Myers <myerske@us.ibm.com>
+ 2015-03-02 17:25:24 +0000
+ Commit: b2b7f01, github.com/apache/spark/pull/4852
+
+ [SPARK-6052][SQL]In JSON schema inference, we should always set containsNull of an ArrayType to true
+ Yin Huai <yhuai@databricks.com>
+ 2015-03-02 23:18:07 +0800
+ Commit: a3fef2c, github.com/apache/spark/pull/4806
+
+ [SPARK-6073][SQL] Need to refresh metastore cache after append data in CreateMetastoreDataSourceAsSelect
+ Yin Huai <yhuai@databricks.com>
+ 2015-03-02 22:42:18 +0800
+ Commit: c59871c, github.com/apache/spark/pull/4824
+
+ [Streaming][Minor]Fix some error docs in streaming examples
+ Saisai Shao <saisai.shao@intel.com>
+ 2015-03-02 08:49:19 +0000
+ Commit: 1fe677a, github.com/apache/spark/pull/4837
+
+ [SPARK-6083] [MLLib] [DOC] Make Python API example consistent in NaiveBayes
+ MechCoder <manojkumarsivaraj334@gmail.com>
+ 2015-03-01 16:28:15 -0800
+ Commit: 6a2fc85, github.com/apache/spark/pull/4834
+
+ [SPARK-6053][MLLIB] support save/load in PySpark's ALS
+ Xiangrui Meng <meng@databricks.com>
+ 2015-03-01 16:26:57 -0800
+ Commit: b570d98, github.com/apache/spark/pull/4811
+
+ [SPARK-6074] [sql] Package pyspark sql bindings.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-03-01 11:05:10 +0000
+ Commit: bb16618, github.com/apache/spark/pull/4822
+
+ SPARK-5984: Fix TimSort bug causes ArrayOutOfBoundsException
+ Evan Yu <ehotou@gmail.com>
+ 2015-02-28 18:55:34 -0800
+ Commit: 317694c, github.com/apache/spark/pull/4804
+
+ [SPARK-5775] [SQL] BugFix: GenericRow cannot be cast to SpecificMutableRow when nested data and partitioned table
+ Cheng Lian <lian@databricks.com>, Cheng Lian <liancheng@users.noreply.github.com>, Yin Huai <yhuai@databricks.com>
+ 2015-02-28 21:15:43 +0800
+ Commit: aa39460, github.com/apache/spark/pull/4792
+
+ [SPARK-5979][SPARK-6032] Smaller safer --packages fix
+ Burak Yavuz <brkyvz@gmail.com>
+ 2015-02-27 22:59:35 -0800
+ Commit: 5a55c96, github.com/apache/spark/pull/4802
+
+ [SPARK-6070] [yarn] Remove unneeded classes from shuffle service jar.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-02-27 22:44:11 -0800
+ Commit: 1747e0a, github.com/apache/spark/pull/4820
+
+ [SPARK-6055] [PySpark] fix incorrect __eq__ of DataType
+ Davies Liu <davies@databricks.com>
+ 2015-02-27 20:07:17 -0800
+ Commit: 49f2187, github.com/apache/spark/pull/4808
+
+ [SPARK-5751] [SQL] Sets SPARK_HOME as SPARK_PID_DIR when running Thrift server test suites
+ Cheng Lian <lian@databricks.com>
+ 2015-02-28 08:41:49 +0800
+ Commit: 5d19cf0, github.com/apache/spark/pull/4758
+
+ [Streaming][Minor] Remove useless type signature of Java Kafka direct stream API
+ Saisai Shao <saisai.shao@intel.com>
+ 2015-02-27 13:01:42 -0800
+ Commit: ceebe3c, github.com/apache/spark/pull/4817
+
+ [SPARK-4587] [mllib] [docs] Fixed save,load calls in ML guide examples
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-27 13:00:36 -0800
+ Commit: 117e10c, github.com/apache/spark/pull/4816
+
+ [SPARK-6058][Yarn] Log the user class exception in ApplicationMaster
+ zsxwing <zsxwing@gmail.com>
+ 2015-02-27 13:31:46 +0000
+ Commit: bff8088, github.com/apache/spark/pull/4813
+
+ fix spark-6033, clarify the spark.worker.cleanup behavior in standalone mode
+ 许鹏 <peng.xu@fraudmetrix.cn>
+ 2015-02-26 23:05:56 -0800
+ Commit: b8db84c, github.com/apache/spark/pull/4803
+
+ SPARK-2168 [Spark core] Use relative URIs for the app links in the History Server.
+ Lukasz Jastrzebski <lukasz.jastrzebski@gmail.com>
+ 2015-02-26 22:38:06 -0800
+ Commit: 485b919, github.com/apache/spark/pull/4778
+
+ [SPARK-6024][SQL] When a data source table has too many columns, it's schema cannot be stored in metastore.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-26 20:46:05 -0800
+ Commit: 6200f07, github.com/apache/spark/pull/4795
+
+ [SPARK-6037][SQL] Avoiding duplicate Parquet schema merging
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-27 11:06:47 +0800
+ Commit: 25a109e, github.com/apache/spark/pull/4786
+
+ SPARK-4579 [WEBUI] Scheduling Delay appears negative
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-26 17:35:09 -0800
+ Commit: b83a93e, github.com/apache/spark/pull/4796
+
+ [SPARK-5951][YARN] Remove unreachable driver memory properties in yarn client mode
+ mohit.goyal <mohit.goyal@guavus.com>
+ 2015-02-26 14:27:47 -0800
+ Commit: 5b426cb, github.com/apache/spark/pull/4730
+
+ Add a note for context termination for History server on Yarn
+ moussa taifi <moutai10@gmail.com>
+ 2015-02-26 14:19:43 -0800
+ Commit: 297c3ef, github.com/apache/spark/pull/4721
+
+ [SPARK-6018] [YARN] NoSuchMethodError in Spark app is swallowed by YARN AM
+ Cheolsoo Park <cheolsoop@netflix.com>
+ 2015-02-26 13:53:49 -0800
+ Commit: fe79674, github.com/apache/spark/pull/4773
+
+ [SPARK-6027][SPARK-5546] Fixed --jar and --packages not working for KafkaUtils and improved error message
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2015-02-26 13:46:07 -0800
+ Commit: 731a997, github.com/apache/spark/pull/4779
+
+ Modify default value description for spark.scheduler.minRegisteredResourcesRatio on docs.
+ Li Zhihui <zhihui.li@intel.com>
+ 2015-02-26 13:07:07 -0800
+ Commit: 62652dc, github.com/apache/spark/pull/4781
+
+ [SPARK-5363] Fix bug in PythonRDD: remove() inside iterator is not safe
+ Davies Liu <davies@databricks.com>
+ 2015-02-26 11:54:17 -0800
+ Commit: 5d309ad, github.com/apache/spark/pull/4776
+
+ [SPARK-6015] fix links to source code in Python API docs
+ Davies Liu <davies@databricks.com>
+ 2015-02-26 10:45:29 -0800
+ Commit: dafb3d2, github.com/apache/spark/pull/4772
+
+ [SPARK-6007][SQL] Add numRows param in DataFrame.show()
+ Jacky Li <jacky.likun@huawei.com>
+ 2015-02-26 10:40:58 -0800
+ Commit: 7c779d8, github.com/apache/spark/pull/4767
+
+ [SPARK-6016][SQL] Cannot read the parquet table after overwriting the existing table when spark.sql.parquet.cacheMetadata=true
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-27 01:01:32 +0800
+ Commit: b5c5e93, github.com/apache/spark/pull/4775
+
+ [SPARK-6023][SQL] ParquetConversions fails to replace the destination MetastoreRelation of an InsertIntoTable node to ParquetRelation2
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-26 22:39:49 +0800
+ Commit: e0f5fb0, github.com/apache/spark/pull/4782
+
+ [SPARK-5976][MLLIB] Add partitioner to factors returned by ALS
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-25 23:43:29 -0800
+ Commit: a51d9db, github.com/apache/spark/pull/4748
+
+ [SPARK-1182][Docs] Sort the configuration parameters in configuration.md
+ Brennon York <brennon.york@capitalone.com>
+ 2015-02-25 16:12:56 -0800
+ Commit: 56fa38a, github.com/apache/spark/pull/3863
+
+ [SPARK-5724] fix the misconfiguration in AkkaUtils
+ CodingCat <zhunansjtu@gmail.com>
+ 2015-02-23 11:29:25 +0000
+ Commit: b32a653, github.com/apache/spark/pull/4512
+
+ [SPARK-5974] [SPARK-5980] [mllib] [python] [docs] Update ML guide with save/load, Python GBT
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-25 16:13:17 -0800
+ Commit: a1b4856, github.com/apache/spark/pull/4750
+
+ [SPARK-5926] [SQL] make DataFrame.explain leverage queryExecution.logical
+ Yanbo Liang <ybliang8@gmail.com>
+ 2015-02-25 15:37:13 -0800
+ Commit: 5bd4b49, github.com/apache/spark/pull/4707
+
+ [SPARK-5999][SQL] Remove duplicate Literal matching block
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-25 15:22:33 -0800
+ Commit: 6fff9b8, github.com/apache/spark/pull/4760
+
+ [SPARK-6010] [SQL] Merging compatible Parquet schemas before computing splits
+ Cheng Lian <lian@databricks.com>
+ 2015-02-25 15:15:22 -0800
+ Commit: 016f1f8, github.com/apache/spark/pull/4768
+
+ [SPARK-5944] [PySpark] fix version in Python API docs
+ Davies Liu <davies@databricks.com>
+ 2015-02-25 15:13:34 -0800
+ Commit: 9aca3c6, github.com/apache/spark/pull/4731
+
+ [SPARK-5982] Remove incorrect Local Read Time Metric
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2015-02-25 14:55:24 -0800
+ Commit: 791df93, github.com/apache/spark/pull/4749
+
+ [SPARK-1955][GraphX]: VertexRDD can incorrectly assume index sharing
+ Brennon York <brennon.york@capitalone.com>
+ 2015-02-25 14:11:12 -0800
+ Commit: 8073767, github.com/apache/spark/pull/4705
+
+ SPARK-5930 [DOCS] Documented default of spark.shuffle.io.retryWait is confusing
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-25 12:20:44 -0800
+ Commit: eaffc6e, github.com/apache/spark/pull/4769
+
+ [SPARK-5996][SQL] Fix specialized outbound conversions
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-25 10:13:40 -0800
+ Commit: fada683, github.com/apache/spark/pull/4757
+
+ [SPARK-5994] [SQL] Python DataFrame documentation fixes
+ Davies Liu <davies@databricks.com>
+ 2015-02-24 20:51:55 -0800
+ Commit: 5c421e0, github.com/apache/spark/pull/4756
+
+ [SPARK-5286][SQL] SPARK-5286 followup
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-24 19:51:36 -0800
+ Commit: e7a748e, github.com/apache/spark/pull/4755
+
+ [SPARK-5993][Streaming][Build] Fix assembly jar location of kafka-assembly
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2015-02-24 19:10:37 -0800
+ Commit: 1e94894, github.com/apache/spark/pull/4753
+
+ [SPARK-5985][SQL] DataFrame sortBy -> orderBy in Python.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-24 18:59:23 -0800
+ Commit: 5e233b2, github.com/apache/spark/pull/4752
+
+ [SPARK-5904][SQL] DataFrame Java API test suites.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-24 18:51:41 -0800
+ Commit: 78a1781, github.com/apache/spark/pull/4751
+
+ [SPARK-5751] [SQL] [WIP] Revamped HiveThriftServer2Suite for robustness
+ Cheng Lian <lian@databricks.com>
+ 2015-02-25 08:34:55 +0800
+ Commit: 17ee246, github.com/apache/spark/pull/4720
+
+ [SPARK-5973] [PySpark] fix zip with two RDDs with AutoBatchedSerializer
+ Davies Liu <davies@databricks.com>
+ 2015-02-24 14:50:00 -0800
+ Commit: 91bf0f8, github.com/apache/spark/pull/4745
+
+ [SPARK-5952][SQL] Lock when using hive metastore client
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-24 13:39:29 -0800
+ Commit: 641423d, github.com/apache/spark/pull/4746
+
+ [MLLIB] Change x_i to y_i in Variance's user guide
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-24 11:38:59 -0800
+ Commit: a4ff445, github.com/apache/spark/pull/4740
+
+ [SPARK-5965] Standalone Worker UI displays {{USER_JAR}}
+ Andrew Or <andrew@databricks.com>
+ 2015-02-24 11:08:07 -0800
+ Commit: eaf7bf9, github.com/apache/spark/pull/4739
+
+ [Spark-5967] [UI] Correctly clean JobProgressListener.stageIdToActiveJobIds
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2015-02-24 11:02:47 -0800
+ Commit: 28dd53b, github.com/apache/spark/pull/4741
+
+ [SPARK-5532][SQL] Repartition should not use external rdd representation
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-24 10:52:18 -0800
+ Commit: e46096b, github.com/apache/spark/pull/4738
+
+ [SPARK-5910][SQL] Support for as in selectExpr
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-24 10:49:51 -0800
+ Commit: ba5d60d, github.com/apache/spark/pull/4736
+
+ [SPARK-5968] [SQL] Suppresses ParquetOutputCommitter WARN logs
+ Cheng Lian <lian@databricks.com>
+ 2015-02-24 10:45:38 -0800
+ Commit: 2b562b0, github.com/apache/spark/pull/4744
+
+ [SPARK-5958][MLLIB][DOC] update block matrix user guide
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-23 22:08:44 -0800
+ Commit: dd42558, github.com/apache/spark/pull/4737
+
+ [SPARK-5873][SQL] Allow viewing of partially analyzed plans in queryExecution
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-23 17:34:54 -0800
+ Commit: 2d7786e, github.com/apache/spark/pull/4684
+
+ [SPARK-5935][SQL] Accept MapType in the schema provided to a JSON dataset.
+ Yin Huai <yhuai@databricks.com>, Yin Huai <huai@cse.ohio-state.edu>
+ 2015-02-23 17:16:34 -0800
+ Commit: 33ccad2, github.com/apache/spark/pull/4710
+
+ [SPARK-5912] [docs] [mllib] Small fixes to ChiSqSelector docs
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-23 16:15:57 -0800
+ Commit: ae97040, github.com/apache/spark/pull/4732
+
+ [MLLIB] SPARK-5912 Programming guide for feature selection
+ Alexander Ulanov <nashb@yandex.ru>
+ 2015-02-23 12:09:40 -0800
+ Commit: 8355773, github.com/apache/spark/pull/4709
+
+ [SPARK-5939][MLLib] make FPGrowth example app take parameters
+ Jacky Li <jacky.likun@huawei.com>
+ 2015-02-23 08:47:28 -0800
+ Commit: 33b9084, github.com/apache/spark/pull/4714
+
+ [SPARK-5943][Streaming] Update the test to use new API to reduce the warning
+ Saisai Shao <saisai.shao@intel.com>
+ 2015-02-23 11:27:27 +0000
+ Commit: 67b7f79, github.com/apache/spark/pull/4722
+
+ [EXAMPLES] fix typo.
+ Makoto Fukuhara <fukuo33@gmail.com>
+ 2015-02-23 09:24:33 +0000
+ Commit: f172387, github.com/apache/spark/pull/4724
+
+ Revert "[SPARK-4808] Removing minimum number of elements read before spill check"
+ Andrew Or <andrew@databricks.com>
+ 2015-02-22 09:44:52 -0800
+ Commit: 4186dd3
+
+ SPARK-5669 [BUILD] Reverse exclusion of JBLAS libs for 1.3
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-22 09:09:06 +0000
+ Commit: eed7389, github.com/apache/spark/pull/4715
+
+ [DataFrame] [Typo] Fix the typo
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-22 08:56:30 +0000
+ Commit: 04d3b32, github.com/apache/spark/pull/4717
+
+ [DOCS] Fix typo in API for custom InputFormats based on the “new” MapReduce API
+ Alexander <abezzubov@nflabs.com>
+ 2015-02-22 08:53:05 +0000
+ Commit: c5a5c6f, github.com/apache/spark/pull/4718
+
+ [SPARK-5937][YARN] Fix ClientSuite to set YARN mode, so that the correct class is used in t...
+ Hari Shreedharan <hshreedharan@apache.org>
+ 2015-02-21 10:01:01 -0800
+ Commit: 76e3e65, github.com/apache/spark/pull/4711
+
+ SPARK-5841 [CORE] [HOTFIX 2] Memory leak in DiskBlockManager
+ Nishkam Ravi <nravi@cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>, nravi <nravi@c1704.halxg.cloudera.com>
+ 2015-02-21 09:59:28 -0800
+ Commit: 932338e, github.com/apache/spark/pull/4690
+
+ [SPARK-5909][SQL] Add a clearCache command to Spark SQL's cache manager
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-20 16:20:02 +0800
+ Commit: b9a6c5c, github.com/apache/spark/pull/4694
+
+ [SPARK-5898] [SPARK-5896] [SQL] [PySpark] create DataFrame from pandas and tuple/list
+ Davies Liu <davies@databricks.com>
+ 2015-02-20 15:35:05 -0800
+ Commit: 913562a, github.com/apache/spark/pull/4679
+
+ [SPARK-5867] [SPARK-5892] [doc] [ml] [mllib] Doc cleanups for 1.3 release
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-20 02:31:32 -0800
+ Commit: 8c12f31, github.com/apache/spark/pull/4675
+
+ [SPARK-4808] Removing minimum number of elements read before spill check
+ mcheah <mcheah@palantir.com>
+ 2015-02-19 18:09:22 -0800
+ Commit: 0382dcc, github.com/apache/spark/pull/4420
+
+ [SPARK-5900][MLLIB] make PIC and FPGrowth Java-friendly
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-19 18:06:16 -0800
+ Commit: ba941ce, github.com/apache/spark/pull/4695
+
+ SPARK-5570: No docs stating that `new SparkConf().set("spark.driver.memory", ...) will not work
+ Ilya Ganelin <ilya.ganelin@capitalone.com>
+ 2015-02-19 15:50:58 -0800
+ Commit: c5f3b9e, github.com/apache/spark/pull/4665
+
+ SPARK-4682 [CORE] Consolidate various 'Clock' classes
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-19 15:35:23 -0800
+ Commit: bd49e8b, github.com/apache/spark/pull/4514
+
+ [Spark-5889] Remove pid file after stopping service.
+ Zhan Zhang <zhazhan@gmail.com>
+ 2015-02-19 23:13:02 +0000
+ Commit: ff8976e, github.com/apache/spark/pull/4676
+
+ [SPARK-5902] [ml] Made PipelineStage.transformSchema public instead of private to ml
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-19 12:46:27 -0800
+ Commit: 0c494cf, github.com/apache/spark/pull/4682
+
+ [SPARK-5904][SQL] DataFrame API fixes.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-19 12:09:44 -0800
+ Commit: 55d91d9, github.com/apache/spark/pull/4686
+
+ [SPARK-5825] [Spark Submit] Remove the double checking instance name when stopping the service
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-19 12:07:51 -0800
+ Commit: fe00eb6, github.com/apache/spark/pull/4611
+
+ [SPARK-5423][Core] Cleanup resources in DiskMapIterator.finalize to ensure deleting the temp file
+ zsxwing <zsxwing@gmail.com>
+ 2015-02-19 18:37:31 +0000
+ Commit: 25fae8e, github.com/apache/spark/pull/4219
+
+ [SPARK-5816] Add huge compatibility warning in DriverWrapper
+ Andrew Or <andrew@databricks.com>
+ 2015-02-19 09:56:25 -0800
+ Commit: f93d4d9, github.com/apache/spark/pull/4687
+
+ SPARK-5548: Fix for AkkaUtilsSuite failure - attempt 2
+ Jacek Lewandowski <lewandowski.jacek@gmail.com>
+ 2015-02-19 09:53:36 -0800
+ Commit: fbcb949, github.com/apache/spark/pull/4653
+
+ [SPARK-5846] Correctly set job description and pool for SQL jobs
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2015-02-19 09:49:34 +0800
+ Commit: 092b45f, github.com/apache/spark/pull/4630
+
+ [SPARK-5879][MLLIB] update PIC user guide and add a Java example
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-18 16:29:32 -0800
+ Commit: a64f374, github.com/apache/spark/pull/4680
+
+ [SPARK-5722] [SQL] [PySpark] infer int as LongType
+ Davies Liu <davies@databricks.com>
+ 2015-02-18 14:17:04 -0800
+ Commit: 470cba8, github.com/apache/spark/pull/4666
+
+ [SPARK-5840][SQL] HiveContext cannot be serialized due to tuple extraction
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-18 14:02:32 -0800
+ Commit: b86e44c, github.com/apache/spark/pull/4628
+
+ [SPARK-5507] Added documentation for BlockMatrix
+ Burak Yavuz <brkyvz@gmail.com>
+ 2015-02-18 10:11:08 -0800
+ Commit: 56f8f29, github.com/apache/spark/pull/4664
+
+ [SPARK-5519][MLLIB] add user guide with example code for fp-growth
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-18 10:09:56 -0800
+ Commit: 661fbd3, github.com/apache/spark/pull/4661
+
+ SPARK-5669 [BUILD] [HOTFIX] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-18 14:41:44 +0000
+ Commit: 9f256ce, github.com/apache/spark/pull/4673
+
+ SPARK-4610 addendum: [Minor] [MLlib] Minor doc fix in GBT classification example
+ MechCoder <manojkumarsivaraj334@gmail.com>
+ 2015-02-18 10:13:28 +0000
+ Commit: 3997e74, github.com/apache/spark/pull/4672
+
+ [SPARK-5878] fix DataFrame.repartition() in Python
+ Davies Liu <davies@databricks.com>
+ 2015-02-18 01:00:54 -0800
+ Commit: aca7991, github.com/apache/spark/pull/4667
+
+ Avoid deprecation warnings in JDBCSuite.
+ Tor Myklebust <tmyklebu@gmail.com>
+ 2015-02-18 01:00:13 -0800
+ Commit: 9a565b8, github.com/apache/spark/pull/4668
+
+ [Minor] [SQL] Cleans up DataFrame variable names and toDF() calls
+ Cheng Lian <lian@databricks.com>
+ 2015-02-17 23:36:20 -0800
+ Commit: 2bd33ce, github.com/apache/spark/pull/4670
+
+ [SPARK-5731][Streaming][Test] Fix incorrect test in DirectKafkaStreamSuite
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2015-02-17 22:44:16 -0800
+ Commit: f8f9a64, github.com/apache/spark/pull/4597
+
+ [SPARK-5723][SQL]Change the default file format to Parquet for CTAS statements.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-17 18:14:33 -0800
+ Commit: 6e82c46, github.com/apache/spark/pull/4639
+
+ Preparing development version 1.3.1-SNAPSHOT
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-18 01:52:06 +0000
+ Commit: 2ab0ba0
+
+ Preparing Spark release v1.3.0-rc1
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-18 01:52:06 +0000
+ Commit: f97b0d4
+
+ [SPARK-5875][SQL]logical.Project should not be resolved if it contains aggregates or generators
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-17 17:50:39 -0800
+ Commit: e8284b2, github.com/apache/spark/pull/4663
+
+ Revert "Preparing Spark release v1.3.0-snapshot1"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-17 17:48:47 -0800
+ Commit: 7320605
+
+ Revert "Preparing development version 1.3.1-SNAPSHOT"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-17 17:48:43 -0800
+ Commit: 932ae4d
+
+ [SPARK-4454] Revert getOrElse() cleanup in DAGScheduler.getCacheLocs()
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-17 17:45:16 -0800
+ Commit: 7e5e4d8
+
+ [SPARK-4454] Properly synchronize accesses to DAGScheduler cacheLocs map
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-17 17:39:58 -0800
+ Commit: 07a401a, github.com/apache/spark/pull/4660
+
+ [SPARK-5811] Added documentation for maven coordinates and added Spark Packages support
+ Burak Yavuz <brkyvz@gmail.com>, Davies Liu <davies@databricks.com>
+ 2015-02-17 17:15:43 -0800
+ Commit: cb90584, github.com/apache/spark/pull/4662
+
+ [SPARK-5785] [PySpark] narrow dependency for cogroup/join in PySpark
+ Davies Liu <davies@databricks.com>
+ 2015-02-17 16:54:57 -0800
+ Commit: 8120235, github.com/apache/spark/pull/4629
+
+ [SPARK-5852][SQL]Fail to convert a newly created empty metastore parquet table to a data source parquet table.
+ Yin Huai <yhuai@databricks.com>, Cheng Hao <hao.cheng@intel.com>
+ 2015-02-17 15:47:59 -0800
+ Commit: 07d8ef9, github.com/apache/spark/pull/4655
+
+ [SPARK-5872] [SQL] create a sqlCtx in pyspark shell
+ Davies Liu <davies@databricks.com>
+ 2015-02-17 15:44:37 -0800
+ Commit: 0dba382, github.com/apache/spark/pull/4659
+
+ [SPARK-5871] output explain in Python
+ Davies Liu <davies@databricks.com>
+ 2015-02-17 13:48:38 -0800
+ Commit: cb06160, github.com/apache/spark/pull/4658
+
+ [SPARK-4172] [PySpark] Progress API in Python
+ Davies Liu <davies@databricks.com>
+ 2015-02-17 13:36:43 -0800
+ Commit: 35e23ff, github.com/apache/spark/pull/3027
+
+ [SPARK-5868][SQL] Fix python UDFs in HiveContext and checks in SQLContext
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-17 13:23:45 -0800
+ Commit: e65dc1f, github.com/apache/spark/pull/4657
+
+ [SQL] [Minor] Update the HiveContext Unittest
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-17 12:25:35 -0800
+ Commit: 0135651, github.com/apache/spark/pull/4584
+
+ [Minor][SQL] Use same function to check path parameter in JSONRelation
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-17 12:24:13 -0800
+ Commit: d74d5e8, github.com/apache/spark/pull/4649
+
+ [SPARK-5862][SQL] Only transformUp the given plan once in HiveMetastoreCatalog
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-17 12:23:18 -0800
+ Commit: 62063b7, github.com/apache/spark/pull/4651
+
+ [Minor] fix typo in SQL document
+ CodingCat <zhunansjtu@gmail.com>
+ 2015-02-17 12:16:52 -0800
+ Commit: 5636c4a, github.com/apache/spark/pull/4656
+
+ [SPARK-5864] [PySpark] support .jar as python package
+ Davies Liu <davies@databricks.com>
+ 2015-02-17 12:05:06 -0800
+ Commit: 71cf6e2, github.com/apache/spark/pull/4652
+
+ SPARK-5841 [CORE] [HOTFIX] Memory leak in DiskBlockManager
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-17 19:40:06 +0000
+ Commit: e64afcd, github.com/apache/spark/pull/4648
+
+ [SPARK-5661]function hasShutdownDeleteTachyonDir should use shutdownDeleteTachyonPaths to determine whether contains file
+ xukun 00228947 <xukun.xu@huawei.com>, viper-kun <xukun.xu@huawei.com>
+ 2015-02-17 18:59:41 +0000
+ Commit: 420bc9b, github.com/apache/spark/pull/4418
+
+ [SPARK-5778] throw if nonexistent metrics config file provided
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-02-17 10:57:16 -0800
+ Commit: 2bf2b56, github.com/apache/spark/pull/4571
+
+ [SPARK-5859] [PySpark] [SQL] fix DataFrame Python API
+ Davies Liu <davies@databricks.com>
+ 2015-02-17 10:22:48 -0800
+ Commit: 4a581aa, github.com/apache/spark/pull/4645
+
+ [SPARK-5166][SPARK-5247][SPARK-5258][SQL] API Cleanup / Documentation
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-17 10:21:17 -0800
+ Commit: cd3d415, github.com/apache/spark/pull/4642
+
+ [SPARK-5858][MLLIB] Remove unnecessary first() call in GLM
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-17 10:17:45 -0800
+ Commit: 97cb568, github.com/apache/spark/pull/4647
+
+ SPARK-5856: In Maven build script, launch Zinc with more memory
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-17 10:10:01 -0800
+ Commit: 8240629, github.com/apache/spark/pull/4643
+
+ Revert "[SPARK-5363] [PySpark] check ending mark in non-block way"
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-17 07:48:27 -0800
+ Commit: aeb85cd
+
+ [SPARK-5826][Streaming] Fix Configuration not serializable problem
+ jerryshao <saisai.shao@intel.com>
+ 2015-02-17 10:45:18 +0000
+ Commit: b8da5c3, github.com/apache/spark/pull/4612
+
+ HOTFIX: Style issue causing build break
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-16 22:10:39 -0800
+ Commit: e9241fa
+
+ [SPARK-5802][MLLIB] cache transformed data in glm
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-16 22:09:04 -0800
+ Commit: dfe0fa0, github.com/apache/spark/pull/4593
+
+ [SPARK-5853][SQL] Schema support in Row.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-16 20:42:57 -0800
+ Commit: d0701d9, github.com/apache/spark/pull/4640
+
+ SPARK-5850: Remove experimental label for Scala 2.11 and FlumePollingStream
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-16 20:33:33 -0800
+ Commit: c6a7069, github.com/apache/spark/pull/4638
+
+ [SPARK-5363] [PySpark] check ending mark in non-block way
+ Davies Liu <davies@databricks.com>
+ 2015-02-16 20:32:03 -0800
+ Commit: baad6b3, github.com/apache/spark/pull/4601
+
+ [SQL] Various DataFrame doc changes.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-16 19:00:30 -0800
+ Commit: e355b54, github.com/apache/spark/pull/4636
+
+ [SPARK-5849] Handle more types of invalid JSON requests in SubmitRestProtocolMessage.parseAction
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-16 18:08:02 -0800
+ Commit: 385a339, github.com/apache/spark/pull/4637
+
+ [SPARK-3340] Deprecate ADD_JARS and ADD_FILES
+ azagrebin <azagrebin@gmail.com>
+ 2015-02-16 18:06:19 -0800
+ Commit: d8c70fb, github.com/apache/spark/pull/4616
+
+ [SPARK-5788] [PySpark] capture the exception in python write thread
+ Davies Liu <davies@databricks.com>
+ 2015-02-16 17:57:14 -0800
+ Commit: c2a9a61, github.com/apache/spark/pull/4577
+
+ SPARK-5848: tear down the ConsoleProgressBar timer
+ Matt Whelan <mwhelan@perka.com>
+ 2015-02-17 00:59:49 +0000
+ Commit: 52994d8, github.com/apache/spark/pull/4635
+
+ [SPARK-4865][SQL]Include temporary tables in SHOW TABLES
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-16 15:59:23 -0800
+ Commit: 8a94bf7, github.com/apache/spark/pull/4618
+
+ [SQL] Optimize arithmetic and predicate operators
+ kai <kaizeng@eecs.berkeley.edu>
+ 2015-02-16 15:58:05 -0800
+ Commit: 639a3c2, github.com/apache/spark/pull/4472
+
+ [SPARK-5839][SQL]HiveMetastoreCatalog does not recognize table names and aliases of data source tables.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-16 15:54:01 -0800
+ Commit: a15a0a0, github.com/apache/spark/pull/4626
+
+ [SPARK-5746][SQL] Check invalid cases for the write path of data source API
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-16 15:51:59 -0800
+ Commit: 4198654, github.com/apache/spark/pull/4617
+
+ HOTFIX: Break in Jekyll build from #4589
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-16 15:43:56 -0800
+ Commit: ad8fd4f
+
+ [SPARK-2313] Use socket to communicate GatewayServer port back to Python driver
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-16 15:25:11 -0800
+ Commit: b70b8ba, github.com/apache/spark/pull/3424.
+
+ SPARK-5357: Update commons-codec version to 1.10 (current)
+ Matt Whelan <mwhelan@perka.com>
+ 2015-02-16 23:05:34 +0000
+ Commit: 8c45619, github.com/apache/spark/pull/4153
+
+ SPARK-5841: remove DiskBlockManager shutdown hook on stop
+ Matt Whelan <mwhelan@perka.com>
+ 2015-02-16 22:54:32 +0000
+ Commit: dd977df, github.com/apache/spark/pull/4627
+
+ [SPARK-5833] [SQL] Adds REFRESH TABLE command
+ Cheng Lian <lian@databricks.com>
+ 2015-02-16 12:52:05 -0800
+ Commit: 864d77e, github.com/apache/spark/pull/4624
+
+ [SPARK-5296] [SQL] Add more filter types for data sources API
+ Cheng Lian <lian@databricks.com>
+ 2015-02-16 12:48:55 -0800
+ Commit: 363a9a7, github.com/apache/spark/pull/4623
+
+ [SQL] Add fetched row count in SparkSQLCLIDriver
+ OopsOutOfMemory <victorshengli@126.com>
+ 2015-02-16 12:34:09 -0800
+ Commit: 0368494, github.com/apache/spark/pull/4604
+
+ [SQL] Initial support for reporting location of error in sql string
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-16 12:32:56 -0800
+ Commit: 63fa123, github.com/apache/spark/pull/4587
+
+ [SPARK-5824] [SQL] add null format in ctas and set default col comment to null
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-16 12:31:36 -0800
+ Commit: c2eaaea, github.com/apache/spark/pull/4609
+
+ [SQL] [Minor] Update the SpecificMutableRow.copy
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-16 12:21:08 -0800
+ Commit: 1a88955, github.com/apache/spark/pull/4619
+
+ SPARK-5795 [STREAMING] api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-16 19:32:31 +0000
+ Commit: fef2267, github.com/apache/spark/pull/4608
+
+ [SPARK-5799][SQL] Compute aggregation function on specified numeric columns
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-16 10:06:11 -0800
+ Commit: 0165e9d, github.com/apache/spark/pull/4592
+
+ [SPARK-4553] [SPARK-5767] [SQL] Wires Parquet data source with the newly introduced write support for data source API
+ Cheng Lian <lian@databricks.com>
+ 2015-02-16 01:38:31 -0800
+ Commit: 78f7edb, github.com/apache/spark/pull/4563
+
+ [Minor] [SQL] Renames stringRddToDataFrame to stringRddToDataFrameHolder for consistency
+ Cheng Lian <lian@databricks.com>
+ 2015-02-16 01:33:37 -0800
+ Commit: 066301c, github.com/apache/spark/pull/4613
+
+ [Ml] SPARK-5804 Explicitly manage cache in Crossvalidator k-fold loop
+ Peter Rudenko <petro.rudenko@gmail.com>
+ 2015-02-16 00:07:23 -0800
+ Commit: 0d93205, github.com/apache/spark/pull/4595
+
+ [Ml] SPARK-5796 Don't transform data on a last estimator in Pipeline
+ Peter Rudenko <petro.rudenko@gmail.com>
+ 2015-02-15 20:51:32 -0800
+ Commit: 9cf7d70, github.com/apache/spark/pull/4590
+
+ SPARK-5815 [MLLIB] Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-15 20:41:27 -0800
+ Commit: db3c539, github.com/apache/spark/pull/4614
+
+ [SPARK-5769] Set params in constructors and in setParams in Python ML pipelines
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-15 20:29:26 -0800
+ Commit: d710991, github.com/apache/spark/pull/4564
+
+ SPARK-5669 [BUILD] Spark assembly includes incompatibly licensed libgfortran, libgcc code via JBLAS
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-15 09:15:48 -0800
+ Commit: 4e099d7, github.com/apache/spark/pull/4453
+
+ [MLLIB][SPARK-5502] User guide for isotonic regression
+ martinzapletal <zapletal-martin@email.cz>
+ 2015-02-15 09:10:03 -0800
+ Commit: d96e188, github.com/apache/spark/pull/4536
+
+ [HOTFIX] Ignore DirectKafkaStreamSuite.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-13 12:43:53 -0800
+ Commit: 70ebad4
+
+ [SPARK-5827][SQL] Add missing import in the example of SqlContext
+ Takeshi Yamamuro <linguin.m.s@gmail.com>
+ 2015-02-15 14:42:20 +0000
+ Commit: 9c1c70d, github.com/apache/spark/pull/4615
+
+ SPARK-5822 [BUILD] cannot import src/main/scala & src/test/scala into eclipse as source folder
+ gli <gli@redhat.com>
+ 2015-02-14 20:43:27 +0000
+ Commit: f87f3b7, github.com/apache/spark/pull/4531
+
+ Revise formatting of previous commit f80e2629bb74bc62960c61ff313f7e7802d61319
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-14 20:12:29 +0000
+ Commit: 1945fcf
+
+ [SPARK-5800] Streaming Docs. Change linked files according the selected language
+ gasparms <gmunoz@stratio.com>
+ 2015-02-14 20:10:29 +0000
+ Commit: e99e170, github.com/apache/spark/pull/4589
+
+ [SPARK-5752][SQL] Don't implicitly convert RDDs directly to DataFrames
+ Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
+ 2015-02-13 23:03:22 -0800
+ Commit: ba91bf5, github.com/apache/spark/pull/4556
+
+ SPARK-3290 [GRAPHX] No unpersist callls in SVDPlusPlus
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-13 20:12:52 -0800
+ Commit: db57479, github.com/apache/spark/pull/4234
+
+ [SPARK-5227] [SPARK-5679] Disable FileSystem cache in WholeTextFileRecordReaderSuite
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-13 17:45:31 -0800
+ Commit: 152147f, github.com/apache/spark/pull/4599
+
+ [SPARK-5730][ML] add doc groups to spark.ml components
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-13 16:45:59 -0800
+ Commit: fccd38d, github.com/apache/spark/pull/4600
+
+ [SPARK-5803][MLLIB] use ArrayBuilder to build primitive arrays
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-13 16:43:49 -0800
+ Commit: 356b798, github.com/apache/spark/pull/4594
+
+ [SPARK-5806] re-organize sections in mllib-clustering.md
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-13 15:09:27 -0800
+ Commit: 9658763, github.com/apache/spark/pull/4598
+
+ [SPARK-5789][SQL]Throw a better error message if JsonRDD.parseJson encounters unrecoverable parsing errors.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-13 13:51:06 -0800
+ Commit: d9d0250, github.com/apache/spark/pull/4582
+
+ [SPARK-5642] [SQL] Apply column pruning on unused aggregation fields
+ Daoyuan Wang <daoyuan.wang@intel.com>, Michael Armbrust <michael@databricks.com>
+ 2015-02-13 13:46:50 -0800
+ Commit: efffc2e, github.com/apache/spark/pull/4415
+
+ [HOTFIX] Fix build break in MesosSchedulerBackendSuite
+ Andrew Or <andrew@databricks.com>
+ 2015-02-13 13:10:29 -0800
+ Commit: 4160371
+
+ SPARK-5805 Fixed the type error in documentation.
+ Emre Sevinç <emre.sevinc@gmail.com>
+ 2015-02-13 12:31:27 -0800
+ Commit: ad73189, github.com/apache/spark/pull/4596
+
+ [SPARK-5735] Replace uses of EasyMock with Mockito
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-13 09:53:57 -0800
+ Commit: cc9eec1, github.com/apache/spark/pull/4578
+
+ [SPARK-5783] Better eventlog-parsing error messages
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-02-13 09:47:26 -0800
+ Commit: e5690a5, github.com/apache/spark/pull/4573
+
+ [SPARK-5503][MLLIB] Example code for Power Iteration Clustering
+ sboeschhuawei <stephen.boesch@huawei.com>
+ 2015-02-13 09:45:57 -0800
+ Commit: 5e63942, github.com/apache/spark/pull/4495
+
+ [SPARK-5732][CORE]:Add an option to print the spark version in spark script.
+ uncleGen <hustyugm@gmail.com>, genmao.ygm <genmao.ygm@alibaba-inc.com>
+ 2015-02-13 09:43:10 -0800
+ Commit: 5c883df, github.com/apache/spark/pull/4522
+
+ [SPARK-4832][Deploy]some other processes might take the daemon pid
+ WangTaoTheTonic <barneystinson@aliyun.com>, WangTaoTheTonic <wangtao111@huawei.com>
+ 2015-02-13 10:27:23 +0000
+ Commit: 1255e83, github.com/apache/spark/pull/3683
+
+ [SQL] Fix docs of SQLContext.tables
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-12 20:37:55 -0800
+ Commit: a8f560c, github.com/apache/spark/pull/4579
+
+ [SPARK-3365][SQL]Wrong schema generated for List type
+ tianyi <tianyi.asiainfo@gmail.com>
+ 2015-02-12 22:18:39 -0800
+ Commit: b9f332a, github.com/apache/spark/pull/4581
+
+ [SPARK-3299][SQL]Public API in SQLContext to list tables
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-12 18:08:01 -0800
+ Commit: edbac17, github.com/apache/spark/pull/4547
+
+ [SQL] Move SaveMode to SQL package.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-12 15:32:17 -0800
+ Commit: 925fd84, github.com/apache/spark/pull/4542
+
+ [SPARK-5335] Fix deletion of security groups within a VPC
+ Vladimir Grigor <vladimir@kiosked.com>, Vladimir Grigor <vladimir@voukka.com>
+ 2015-02-12 23:26:24 +0000
+ Commit: 5c9db4e, github.com/apache/spark/pull/4122
+
+ [SPARK-5755] [SQL] remove unnecessary Add
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-12 15:22:07 -0800
+ Commit: f7103b3, github.com/apache/spark/pull/4551
+
+ [SPARK-5573][SQL] Add explode to dataframes
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-12 15:19:19 -0800
+ Commit: c7eb9ee, github.com/apache/spark/pull/4546
+
+ [SPARK-5758][SQL] Use LongType as the default type for integers in JSON schema inference.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-12 15:17:25 -0800
+ Commit: b0c79da, github.com/apache/spark/pull/4544
+
+ [SPARK-5780] [PySpark] Mute the logging during unit tests
+ Davies Liu <davies@databricks.com>
+ 2015-02-12 14:54:38 -0800
+ Commit: bf0d15c, github.com/apache/spark/pull/4572
+
+ SPARK-5747: Fix wordsplitting bugs in make-distribution.sh
+ David Y. Ross <dyross@gmail.com>
+ 2015-02-12 14:52:38 -0800
+ Commit: 11a0d5b, github.com/apache/spark/pull/4540
+
+ [SPARK-5759][Yarn]ExecutorRunnable should catch YarnException while NMClient start contain...
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2015-02-12 14:50:16 -0800
+ Commit: 02d5b32, github.com/apache/spark/pull/4554
+
+ [SPARK-5760][SPARK-5761] Fix standalone rest protocol corner cases + revamp tests
+ Andrew Or <andrew@databricks.com>
+ 2015-02-12 14:47:52 -0800
+ Commit: 11d1080, github.com/apache/spark/pull/4557
+
+ [SPARK-5762] Fix shuffle write time for sort-based shuffle
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2015-02-12 14:46:37 -0800
+ Commit: 0040fc5, github.com/apache/spark/pull/4559
+
+ [SPARK-5765][Examples]Fixed word split problem in run-example and compute-classpath
+ Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
+ 2015-02-12 14:44:21 -0800
+ Commit: 9a1de4b, github.com/apache/spark/pull/4561
+
+ [SPARK-5645] Added local read bytes/time to task metrics
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2015-02-12 14:35:44 -0800
+ Commit: 74f34bb, github.com/apache/spark/pull/4510
+
+ [SQL] Improve error messages
+ Michael Armbrust <michael@databricks.com>, wangfei <wangfei1@huawei.com>
+ 2015-02-12 13:11:28 -0800
+ Commit: e3a975d, github.com/apache/spark/pull/4558
+
+ [SQL][DOCS] Update sql documentation
+ Antonio Navarro Perez <ajnavarro@users.noreply.github.com>
+ 2015-02-12 12:46:17 -0800
+ Commit: cbd659e, github.com/apache/spark/pull/4560
+
+ [SPARK-5757][MLLIB] replace SQL JSON usage in model import/export by json4s
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-12 10:48:13 -0800
+ Commit: e26c149, github.com/apache/spark/pull/4555
+
+ [SPARK-5655] Don't chmod700 application files if running in YARN
+ Andrew Rowson <github@growse.com>
+ 2015-02-12 18:41:39 +0000
+ Commit: e23c8f5, github.com/apache/spark/pull/4509
+
+ [SQL] Make dataframe more tolerant of being serialized
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-11 19:05:49 -0800
+ Commit: 3c1b9bf, github.com/apache/spark/pull/4545
+
+ [SQL] Two DataFrame fixes.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-11 18:32:48 -0800
+ Commit: bcb1382, github.com/apache/spark/pull/4543
+
+ [SPARK-3688][SQL] More inline comments for LogicalPlan.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-11 15:26:31 -0800
+ Commit: 08ab3d2, github.com/apache/spark/pull/4539
+
+ [SPARK-3688][SQL]LogicalPlan can't resolve column correctlly
+ tianyi <tianyi.asiainfo@gmail.com>
+ 2015-02-11 12:50:17 -0800
+ Commit: e136f47, github.com/apache/spark/pull/4524
+
+ [SPARK-5454] More robust handling of self joins
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-11 12:31:56 -0800
+ Commit: 1bb3631, github.com/apache/spark/pull/4520
+
+ Remove outdated remark about take(n).
+ Daniel Darabos <darabos.daniel@gmail.com>
+ 2015-02-11 20:24:17 +0000
+ Commit: 72adfc5, github.com/apache/spark/pull/4533
+
+ [SPARK-5677] [SPARK-5734] [SQL] [PySpark] Python DataFrame API remaining tasks
+ Davies Liu <davies@databricks.com>
+ 2015-02-11 12:13:16 -0800
+ Commit: d66aae2, github.com/apache/spark/pull/4528
+
+ [SPARK-5733] Error Link in Pagination of HistroyPage when showing Incomplete Applications
+ guliangliang <guliangliang@qiyi.com>
+ 2015-02-11 15:55:49 +0000
+ Commit: 864dccd, github.com/apache/spark/pull/4523
+
+ SPARK-5727 [BUILD] Deprecate Debian packaging
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-11 08:30:16 +0000
+ Commit: 057ec4f, github.com/apache/spark/pull/4516
+
+ SPARK-5728 [STREAMING] MQTTStreamSuite leaves behind ActiveMQ database files
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-11 08:13:51 +0000
+ Commit: 476b6d7, github.com/apache/spark/pull/4517
+
+ [SPARK-4964] [Streaming] refactor createRDD to take leaders via map instead of array
+ cody koeninger <cody@koeninger.org>
+ 2015-02-11 00:13:27 -0800
+ Commit: 811d179, github.com/apache/spark/pull/4511
+
+ Preparing development version 1.3.1-SNAPSHOT
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-11 07:47:03 +0000
+ Commit: e57c81b
+
+ Preparing Spark release v1.3.0-snapshot1
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-11 07:47:02 +0000
+ Commit: d97bfc6
+
+ Revert "Preparing Spark release v1.3.0-snapshot1"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-10 23:46:04 -0800
+ Commit: 6a91d59
+
+ Revert "Preparing development version 1.3.1-SNAPSHOT"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-10 23:46:02 -0800
+ Commit: 3a50383
+
+ HOTFIX: Adding Junit to Hive tests for Maven build
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-10 23:39:21 -0800
+ Commit: 0386fc4
+
+ Preparing development version 1.3.1-SNAPSHOT
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-11 06:45:03 +0000
+ Commit: ba12b79
+
+ Preparing Spark release v1.3.0-snapshot1
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-11 06:45:03 +0000
+ Commit: 53068f5
+
+ HOTFIX: Java 6 compilation error in Spark SQL
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-10 22:43:32 -0800
+ Commit: 15180bc
+
+ Revert "Preparing Spark release v1.3.0-snapshot1"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-10 22:44:10 -0800
+ Commit: 536dae9
+
+ Revert "Preparing development version 1.3.1-SNAPSHOT"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-10 22:44:07 -0800
+ Commit: 01b562e
+
+ Preparing development version 1.3.1-SNAPSHOT
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-11 06:15:29 +0000
+ Commit: db80d0f
+
+ Preparing Spark release v1.3.0-snapshot1
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-11 06:15:29 +0000
+ Commit: c2e4001
+
+ Updating versions for Spark 1.3
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-10 21:54:55 -0800
+ Commit: 2f52489
+
+ [SPARK-5714][Mllib] Refactor initial step of LDA to remove redundant operations
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-10 21:51:15 -0800
+ Commit: ba3aa8f, github.com/apache/spark/pull/4501
+
+ [SPARK-5702][SQL] Allow short names for built-in data sources.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-10 20:40:21 -0800
+ Commit: 63af90c, github.com/apache/spark/pull/4489
+
+ [SPARK-5729] Potential NPE in standalone REST API
+ Andrew Or <andrew@databricks.com>
+ 2015-02-10 20:19:14 -0800
+ Commit: 1bc75b0, github.com/apache/spark/pull/4518
+
+ [SPARK-4879] Use driver to coordinate Hadoop output committing for speculative tasks
+ mcheah <mcheah@palantir.com>, Josh Rosen <joshrosen@databricks.com>
+ 2015-02-10 20:12:18 -0800
+ Commit: 79cd59c, github.com/apache/spark/pull/4155.
+
+ [SQL][DataFrame] Fix column computability bug.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-10 19:50:44 -0800
+ Commit: e477e91, github.com/apache/spark/pull/4519
+
+ [SPARK-5709] [SQL] Add EXPLAIN support in DataFrame API for debugging purpose
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-10 19:40:51 -0800
+ Commit: 7fa0d5f, github.com/apache/spark/pull/4496
+
+ [SPARK-5704] [SQL] [PySpark] createDataFrame from RDD with columns
+ Davies Liu <davies@databricks.com>
+ 2015-02-10 19:40:12 -0800
+ Commit: 1056c5b, github.com/apache/spark/pull/4498
+
+ [SPARK-5683] [SQL] Avoid multiple json generator created
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-10 18:19:56 -0800
+ Commit: fc0446f, github.com/apache/spark/pull/4468
+
+ [SQL] Add an exception for analysis errors.
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-10 17:32:42 -0800
+ Commit: 748cdc1, github.com/apache/spark/pull/4439
+
+ [SPARK-5658][SQL] Finalize DDL and write support APIs
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-10 17:29:52 -0800
+ Commit: a21090e, github.com/apache/spark/pull/4446
+
+ [SPARK-5493] [core] Add option to impersonate user.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-02-10 17:19:10 -0800
+ Commit: 8e75b0e, github.com/apache/spark/pull/4405
+
+ [SQL] Make Options in the data source API CREATE TABLE statements optional.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-10 17:06:12 -0800
+ Commit: 445dbc7, github.com/apache/spark/pull/4515
+
+ [SPARK-5725] [SQL] Fixes ParquetRelation2.equals
+ Cheng Lian <lian@databricks.com>
+ 2015-02-10 17:02:44 -0800
+ Commit: f43bc3d, github.com/apache/spark/pull/4513
+
+ [SPARK-5343][GraphX]: ShortestPaths traverses backwards
+ Brennon York <brennon.york@capitalone.com>
+ 2015-02-10 14:57:00 -0800
+ Commit: 5be8902, github.com/apache/spark/pull/4478
+
+ [SPARK-5021] [MLlib] Gaussian Mixture now supports Sparse Input
+ MechCoder <manojkumarsivaraj334@gmail.com>
+ 2015-02-10 14:05:55 -0800
+ Commit: bba0953, github.com/apache/spark/pull/4459
+
+ [HOTFIX][SPARK-4136] Fix compilation and tests
+ Andrew Or <andrew@databricks.com>
+ 2015-02-10 11:18:01 -0800
+ Commit: 4e3aa68
+
+ [SPARK-5686][SQL] Add show current roles command in HiveQl
+ OopsOutOfMemory <victorshengli@126.com>
+ 2015-02-10 13:20:15 -0800
+ Commit: 8b7587a, github.com/apache/spark/pull/4471
+
+ [SQL] Add toString to DataFrame/Column
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-10 13:14:01 -0800
+ Commit: ef739d9, github.com/apache/spark/pull/4436
+
+ SPARK-5613: Catch the ApplicationNotFoundException exception to avoid thread from getting killed on yarn restart.
+ Kashish Jain <kashish.jain@guavus.com>
+ 2015-02-06 13:47:23 -0800
+ Commit: c294216, github.com/apache/spark/pull/4392
+
+ [SPARK-5592][SQL] java.net.URISyntaxException when insert data to a partitioned table
+ wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
+ 2015-02-10 11:54:30 -0800
+ Commit: dbfce30, github.com/apache/spark/pull/4368
+
+ SPARK-4136. Under dynamic allocation, cancel outstanding executor requests when no longer needed
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-02-10 11:07:25 -0800
+ Commit: e53da21, github.com/apache/spark/pull/4168
+
+ [SPARK-5716] [SQL] Support TOK_CHARSETLITERAL in HiveQl
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-10 11:08:21 -0800
+ Commit: e508237, github.com/apache/spark/pull/4502
+
+ [Spark-5717] [MLlib] add stop and reorganize import
+ JqueryFan <firing@126.com>, Yuhao Yang <hhbyyh@gmail.com>
+ 2015-02-10 17:37:32 +0000
+ Commit: b32f553, github.com/apache/spark/pull/4503
+
+ [SPARK-5700] [SQL] [Build] Bumps jets3t to 0.9.3 for hadoop-2.3 and hadoop-2.4 profiles
+ Cheng Lian <lian@databricks.com>
+ 2015-02-10 02:28:47 -0800
+ Commit: d6f31e0, github.com/apache/spark/pull/4499
+
+ SPARK-5239 [CORE] JdbcRDD throws "java.lang.AbstractMethodError: oracle.jdbc.driver.xxxxxx.isClosed()Z"
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-10 09:19:01 +0000
+ Commit: 4cfc025, github.com/apache/spark/pull/4470
+
+ [SPARK-4964][Streaming][Kafka] More updates to Exactly-once Kafka stream
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2015-02-09 22:45:48 -0800
+ Commit: 281614d, github.com/apache/spark/pull/4384
+
+ [SPARK-5597][MLLIB] save/load for decision trees and emsembles
+ Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
+ 2015-02-09 22:09:07 -0800
+ Commit: 01905c4, github.com/apache/spark/pull/4444.
+
+ [SQL] Remove the duplicated code
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-09 21:33:34 -0800
+ Commit: 663d34e, github.com/apache/spark/pull/4494
+
+ [SPARK-5701] Only set ShuffleReadMetrics when task has shuffle deps
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2015-02-09 21:22:09 -0800
+ Commit: 6ddbca4, github.com/apache/spark/pull/4488
+
+ [SPARK-5703] AllJobsPage throws empty.max exception
+ Andrew Or <andrew@databricks.com>
+ 2015-02-09 21:18:48 -0800
+ Commit: 8326255, github.com/apache/spark/pull/4490
+
+ [SPARK-2996] Implement userClassPathFirst for driver, yarn.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-02-09 21:17:06 -0800
+ Commit: 6a1e0f9, github.com/apache/spark/pull/3233
+
+ SPARK-4900 [MLLIB] MLlib SingularValueDecomposition ARPACK IllegalStateException
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-09 21:13:58 -0800
+ Commit: ebf1df0, github.com/apache/spark/pull/4485
+
+ Add a config option to print DAG.
+ KaiXinXiaoLei <huleilei1@huawei.com>
+ 2015-02-09 20:58:58 -0800
+ Commit: dad05e0, github.com/apache/spark/pull/4257
+
+ [SPARK-5469] restructure pyspark.sql into multiple files
+ Davies Liu <davies@databricks.com>
+ 2015-02-09 20:49:22 -0800
+ Commit: f0562b4, github.com/apache/spark/pull/4479
+
+ [SPARK-5698] Do not let user request negative # of executors
+ Andrew Or <andrew@databricks.com>
+ 2015-02-09 17:33:29 -0800
+ Commit: 62b1e1f, github.com/apache/spark/pull/4483
+
+ [SPARK-5699] [SQL] [Tests] Runs hive-thriftserver tests whenever SQL code is modified
+ Cheng Lian <lian@databricks.com>
+ 2015-02-09 16:52:05 -0800
+ Commit: 71f0f51, github.com/apache/spark/pull/4486
+
+ [SPARK-5648][SQL] support "alter ... unset tblproperties("key")"
+ DoingDone9 <799203320@qq.com>
+ 2015-02-09 16:40:26 -0800
+ Commit: e2bf59a, github.com/apache/spark/pull/4424
+
+ [SPARK-2096][SQL] support dot notation on array of struct
+ Wenchen Fan <cloud0fan@outlook.com>
+ 2015-02-09 16:39:34 -0800
+ Commit: 15f557f, github.com/apache/spark/pull/2405
+
+ [SPARK-5614][SQL] Predicate pushdown through Generate.
+ Lu Yan <luyan02@baidu.com>
+ 2015-02-09 16:25:38 -0800
+ Commit: ce2c89c, github.com/apache/spark/pull/4394
+
+ [SPARK-5696] [SQL] [HOTFIX] Asks HiveThriftServer2 to re-initialize log4j using Hive configurations
+ Cheng Lian <lian@databricks.com>
+ 2015-02-09 16:23:12 -0800
+ Commit: 379233c, github.com/apache/spark/pull/4484
+
+ [SQL] Code cleanup.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-09 16:20:42 -0800
+ Commit: e241601, github.com/apache/spark/pull/4482
+
+ [SQL] Add some missing DataFrame functions.
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-09 16:02:56 -0800
+ Commit: a70dca0, github.com/apache/spark/pull/4437
+
+ [SPARK-5675][SQL] XyzType companion object should subclass XyzType
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-09 14:51:46 -0800
+ Commit: 1e2fab2, github.com/apache/spark/pull/4463
+
+ [SPARK-4905][STREAMING] FlumeStreamSuite fix.
+ Hari Shreedharan <hshreedharan@apache.org>
+ 2015-02-09 14:17:14 -0800
+ Commit: 18c5a99, github.com/apache/spark/pull/4371
+
+ [SPARK-5691] Fixing wrong data structure lookup for dupe app registratio...
+ mcheah <mcheah@palantir.com>
+ 2015-02-09 13:20:14 -0800
+ Commit: 6a0144c, github.com/apache/spark/pull/4477
+
+ [SPARK-5678] Convert DataFrame to pandas.DataFrame and Series
+ Davies Liu <davies@databricks.com>
+ 2015-02-09 11:42:52 -0800
+ Commit: 43972b5, github.com/apache/spark/pull/4476
+
+ [SPARK-5664][BUILD] Restore stty settings when exiting from SBT's spark-shell
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-09 11:45:12 -0800
+ Commit: fa67877, github.com/apache/spark/pull/4451
+
+ SPARK-4267 [YARN] Failing to launch jobs on Spark on YARN with Hadoop 2.5.0 or later
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-09 10:33:57 -0800
+ Commit: c88d4ab, github.com/apache/spark/pull/4452
+
+ [SPARK-5473] [EC2] Expose SSH failures after status checks pass
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2015-02-09 09:44:53 +0000
+ Commit: f2aa7b7, github.com/apache/spark/pull/4262
+
+ [SPARK-5539][MLLIB] LDA guide
+ Xiangrui Meng <meng@databricks.com>, Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-08 23:40:36 -0800
+ Commit: 5782ee2, github.com/apache/spark/pull/4465
+
+ [SPARK-5472][SQL] Fix Scala code style
+ Hung Lin <hung@zoomdata.com>
+ 2015-02-08 22:36:42 -0800
+ Commit: 955f286, github.com/apache/spark/pull/4464
+
+ SPARK-4405 [MLLIB] Matrices.* construction methods should check for rows x cols overflow
+ Sean Owen <sowen@cloudera.com>
+ 2015-02-08 21:08:50 -0800
+ Commit: fa8ea48, github.com/apache/spark/pull/4461
+
+ [SPARK-5660][MLLIB] Make Matrix apply public
+ Joseph K. Bradley <joseph@databricks.com>, Xiangrui Meng <meng@databricks.com>
+ 2015-02-08 21:07:36 -0800
+ Commit: df9b105, github.com/apache/spark/pull/4447
+
+ [SPARK-5643][SQL] Add a show method to print the content of a DataFrame in tabular format.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-08 18:56:51 -0800
+ Commit: e1996aa, github.com/apache/spark/pull/4416
+
+ SPARK-5665 [DOCS] Update netlib-java documentation
+ Sam Halliday <sam.halliday@Gmail.com>, Sam Halliday <sam.halliday@gmail.com>
+ 2015-02-08 16:34:26 -0800
+ Commit: c515634, github.com/apache/spark/pull/4448
+
+ [SPARK-5598][MLLIB] model save/load for ALS
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-08 16:26:20 -0800
+ Commit: 9e4d58f, github.com/apache/spark/pull/4422
+
+ [SQL] Set sessionState in QueryExecution.
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-08 14:55:07 -0800
+ Commit: 42c56b6, github.com/apache/spark/pull/4445
+
+ [SPARK-3039] [BUILD] Spark assembly for new hadoop API (hadoop 2) contai...
+ medale <medale94@yahoo.com>
+ 2015-02-08 10:35:29 +0000
+ Commit: bc55e20, github.com/apache/spark/pull/4315
+
+ [SPARK-5672][Web UI] Don't return `ERROR 500` when have missing args
+ Kirill A. Korinskiy <catap@catap.ru>
+ 2015-02-08 10:31:46 +0000
+ Commit: 96010fa, github.com/apache/spark/pull/4239
+
+ [SPARK-5671] Upgrade jets3t to 0.9.2 in hadoop-2.3 and 2.4 profiles
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-07 17:19:08 -0800
+ Commit: 0f9d765, github.com/apache/spark/pull/4454
+
+ [SPARK-5108][BUILD] Jackson dependency management for Hadoop-2.6.0 support
+ Zhan Zhang <zhazhan@gmail.com>
+ 2015-02-07 19:41:30 +0000
+ Commit: 51fbca4, github.com/apache/spark/pull/3938
+
+ [BUILD] Add the ability to launch spark-shell from SBT.
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-07 00:14:38 -0800
+ Commit: 6bda169, github.com/apache/spark/pull/4438
+
+ [SPARK-5388] Provide a stable application submission gateway for standalone cluster mode
+ Andrew Or <andrew@databricks.com>
+ 2015-02-06 15:57:06 -0800
+ Commit: 6ec0cdc, github.com/apache/spark/pull/4216
+
+ SPARK-5403: Ignore UserKnownHostsFile in SSH calls
+ Grzegorz Dubicki <grzegorz.dubicki@gmail.com>
+ 2015-02-06 15:43:58 -0800
+ Commit: 3d99741, github.com/apache/spark/pull/4196
+
+ [SPARK-5601][MLLIB] make streaming linear algorithms Java-friendly
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-06 15:42:59 -0800
+ Commit: 11b28b9, github.com/apache/spark/pull/4432
+
+ [SQL] [Minor] HiveParquetSuite was disabled by mistake, re-enable them
+ Cheng Lian <lian@databricks.com>
+ 2015-02-06 15:23:42 -0800
+ Commit: 4005802, github.com/apache/spark/pull/4440
+
+ [SQL] Use TestSQLContext in Java tests
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-06 15:11:02 -0800
+ Commit: c950058, github.com/apache/spark/pull/4441
+
+ [SPARK-4994][network]Cleanup removed executors' ShuffleInfo in yarn shuffle service
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2015-02-06 14:47:52 -0800
+ Commit: af6ddf8, github.com/apache/spark/pull/3828
+
+ [SPARK-5444][Network]Add a retry to deal with the conflict port in netty server.
+ huangzhaowei <carlmartinmax@gmail.com>
+ 2015-02-06 14:35:29 -0800
+ Commit: caca15a, github.com/apache/spark/pull/4240
+
+ [SPARK-4874] [CORE] Collect record count metrics
+ Kostas Sakellis <kostas@cloudera.com>
+ 2015-02-06 14:31:20 -0800
+ Commit: 9fa29a6, github.com/apache/spark/pull/4067
+
+ [HOTFIX] Fix the maven build after adding sqlContext to spark-shell
+ Michael Armbrust <michael@databricks.com>
+ 2015-02-06 14:27:06 -0800
+ Commit: 11dbf71, github.com/apache/spark/pull/4443
+
+ [SPARK-5600] [core] Clean up FsHistoryProvider test, fix app sort order.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-02-06 14:23:09 -0800
+ Commit: 09feecc, github.com/apache/spark/pull/4370
+
+ SPARK-5633 pyspark saveAsTextFile support for compression codec
+ Vladimir Vladimirov <vladimir.vladimirov@magnetic.com>
+ 2015-02-06 13:55:02 -0800
+ Commit: 1d32341, github.com/apache/spark/pull/4403
+
+ [HOTFIX][MLLIB] fix a compilation error with java 6
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-06 13:52:35 -0800
+ Commit: 87e0f0d, github.com/apache/spark/pull/4442
+
+ [SPARK-4983] Insert waiting time before tagging EC2 instances
+ GenTang <gen.tang86@gmail.com>, Gen TANG <gen.tang86@gmail.com>
+ 2015-02-06 13:27:34 -0800
+ Commit: 2872d83, github.com/apache/spark/pull/3986
+
+ [SPARK-5586][Spark Shell][SQL] Make `sqlContext` available in spark shell
+ OopsOutOfMemory <victorshengli@126.com>
+ 2015-02-06 13:20:10 -0800
+ Commit: 2ef9853, github.com/apache/spark/pull/4387
+
+ [SPARK-5278][SQL] Introduce UnresolvedGetField and complete the check of ambiguous reference to fields
+ Wenchen Fan <cloud0fan@outlook.com>
+ 2015-02-06 13:08:09 -0800
+ Commit: 1b148ad, github.com/apache/spark/pull/4068
+
+ [SQL][Minor] Remove cache keyword in SqlParser
+ wangfei <wangfei1@huawei.com>
+ 2015-02-06 12:42:23 -0800
+ Commit: d822606, github.com/apache/spark/pull/4393
+
+ [SQL][HiveConsole][DOC] HiveConsole `correct hiveconsole imports`
+ OopsOutOfMemory <victorshengli@126.com>
+ 2015-02-06 12:41:28 -0800
+ Commit: 2abaa6e, github.com/apache/spark/pull/4389
+
+ [SPARK-5595][SPARK-5603][SQL] Add a rule to do PreInsert type casting and field renaming and invalidating in memory cache after INSERT
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-06 12:38:07 -0800
+ Commit: 3c34d62, github.com/apache/spark/pull/4373
+
+ [SPARK-5324][SQL] Results of describe can't be queried
+ OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
+ 2015-02-06 12:33:20 -0800
+ Commit: 0fc35da, github.com/apache/spark/pull/4249
+
+ [SPARK-5619][SQL] Support 'show roles' in HiveContext
+ q00251598 <qiyadong@huawei.com>
+ 2015-02-06 12:29:26 -0800
+ Commit: cc66a3c, github.com/apache/spark/pull/4397
+
+ [SPARK-5640] Synchronize ScalaReflection where necessary
+ Tobias Schlatter <tobias@meisch.ch>
+ 2015-02-06 12:15:02 -0800
+ Commit: 779e28b, github.com/apache/spark/pull/4431
+
+ [SPARK-5650][SQL] Support optional 'FROM' clause
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-06 12:13:44 -0800
+ Commit: 921121d, github.com/apache/spark/pull/4426
+
+ [SPARK-5628] Add version option to spark-ec2
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2015-02-06 12:08:22 -0800
+ Commit: ab0ffde, github.com/apache/spark/pull/4414
+
+ [SPARK-2945][YARN][Doc]add doc for spark.executor.instances
+ WangTaoTheTonic <wangtao111@huawei.com>
+ 2015-02-06 11:57:02 -0800
+ Commit: 540f474, github.com/apache/spark/pull/4350
+
+ [SPARK-4361][Doc] Add more docs for Hadoop Configuration
+ zsxwing <zsxwing@gmail.com>
+ 2015-02-06 11:50:20 -0800
+ Commit: 528dd34, github.com/apache/spark/pull/3225
+
+ [HOTFIX] Fix test build break in ExecutorAllocationManagerSuite.
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-06 11:47:32 -0800
+ Commit: 9e828f4
+
+ [SPARK-5652][Mllib] Use broadcasted weights in LogisticRegressionModel
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-06 11:22:11 -0800
+ Commit: 6fda4c1, github.com/apache/spark/pull/4429
+
+ [SPARK-5555] Enable UISeleniumSuite tests
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-06 11:14:58 -0800
+ Commit: 93fee7b, github.com/apache/spark/pull/4334
+
+ SPARK-2450 Adds executor log links to Web UI
+ Kostas Sakellis <kostas@cloudera.com>, Josh Rosen <joshrosen@databricks.com>
+ 2015-02-06 11:13:00 -0800
+ Commit: e74dd04, github.com/apache/spark/pull/3486
+
+ [SPARK-5618][Spark Core][Minor] Optimise utility code.
+ Makoto Fukuhara <fukuo33@gmail.com>
+ 2015-02-06 11:11:38 -0800
+ Commit: 3feb798, github.com/apache/spark/pull/4396
+
+ [SPARK-5593][Core]Replace BlockManagerListener with ExecutorListener in ExecutorAllocationListener
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2015-02-06 11:09:37 -0800
+ Commit: 9387dc1, github.com/apache/spark/pull/4369
+
+ [SPARK-4877] Allow user first classes to extend classes in the parent.
+ Stephen Haberman <stephen@exigencecorp.com>
+ 2015-02-06 11:03:56 -0800
+ Commit: 52386cf, github.com/apache/spark/pull/3725
+
+ [SPARK-5396] Syntax error in spark scripts on windows.
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2015-02-06 10:58:26 -0800
+ Commit: 2dc94cd, github.com/apache/spark/pull/4428
+
+ [SPARK-5636] Ramp up faster in dynamic allocation
+ Andrew Or <andrew@databricks.com>
+ 2015-02-06 10:54:23 -0800
+ Commit: 0a90305, github.com/apache/spark/pull/4409
+
+ SPARK-4337. [YARN] Add ability to cancel pending requests
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-02-06 10:53:16 -0800
+ Commit: 1568391, github.com/apache/spark/pull/4141
+
+ [SPARK-5416] init Executor.threadPool before ExecutorSource
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-02-06 12:22:25 +0000
+ Commit: f9bc4ef, github.com/apache/spark/pull/4212
+
+ [Build] Set all Debian package permissions to 755
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2015-02-06 11:38:39 +0000
+ Commit: 3638216, github.com/apache/spark/pull/4277
+
+ Update ec2-scripts.md
+ Miguel Peralvo <miguel.peralvo@gmail.com>
+ 2015-02-06 11:04:48 +0000
+ Commit: f6613fc, github.com/apache/spark/pull/4300
+
+ [SPARK-5470][Core]use defaultClassLoader to load classes in KryoSerializer
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2015-02-06 11:00:35 +0000
+ Commit: 8007a4f, github.com/apache/spark/pull/4258
+
+ [SPARK-5653][YARN] In ApplicationMaster rename isDriver to isClusterMode
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2015-02-06 10:48:31 -0800
+ Commit: 4ff8855, github.com/apache/spark/pull/4430
+
+ [SPARK-5582] [history] Ignore empty log directories.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-02-06 10:07:20 +0000
+ Commit: faccdcb, github.com/apache/spark/pull/4352
+
+ [SPARK-5157][YARN] Configure more JVM options properly when we use ConcMarkSweepGC for AM.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-02-06 09:39:12 +0000
+ Commit: 25d8044, github.com/apache/spark/pull/3956
+
+ [Minor] Remove permission for execution from spark-shell.cmd
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-02-06 09:33:36 +0000
+ Commit: 7c54681, github.com/apache/spark/pull/3983
+
+ [SPARK-5380][GraphX] Solve an ArrayIndexOutOfBoundsException when build graph with a file format error
+ Leolh <leosandylh@gmail.com>
+ 2015-02-06 09:01:53 +0000
+ Commit: ffdb2e9, github.com/apache/spark/pull/4176
+
+ [SPARK-5013] [MLlib] Added documentation and sample data file for GaussianMixture
+ Travis Galoppo <tjg2107@columbia.edu>
+ 2015-02-06 10:26:51 -0800
+ Commit: f408db6, github.com/apache/spark/pull/4401
+
+ [SPARK-4789] [SPARK-4942] [SPARK-5031] [mllib] Standardize ML Prediction APIs
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-05 23:43:47 -0800
+ Commit: 45b95e7, github.com/apache/spark/pull/3637
+
+ [SPARK-5604][MLLIB] remove checkpointDir from trees
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-05 23:32:09 -0800
+ Commit: c35a11e, github.com/apache/spark/pull/4407
+
+ [SPARK-5639][SQL] Support DataFrame.renameColumn.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-05 23:02:40 -0800
+ Commit: 0639d3e, github.com/apache/spark/pull/4410
+
+ Revert "SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2."
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-05 18:37:55 -0800
+ Commit: 6d31531
+
+ SPARK-5557: Explicitly include servlet API in dependencies.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-05 18:14:54 -0800
+ Commit: 34131fd, github.com/apache/spark/pull/4411
+
+ [HOTFIX] [SQL] Disables Metastore Parquet table conversion for "SQLQuerySuite.CTAS with serde"
+ Cheng Lian <lian@databricks.com>
+ 2015-02-05 18:09:18 -0800
+ Commit: ce6d8bb, github.com/apache/spark/pull/4413
+
+ [SPARK-5638][SQL] Add a config flag to disable eager analysis of DataFrames
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-05 18:07:10 -0800
+ Commit: 4fd67e4, github.com/apache/spark/pull/4408
+
+ [SPARK-5620][DOC] group methods in generated unidoc
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-05 16:26:51 -0800
+ Commit: e2be79d, github.com/apache/spark/pull/4404
+
+ [SPARK-5182] [SPARK-5528] [SPARK-5509] [SPARK-3575] [SQL] Parquet data source improvements
+ Cheng Lian <lian@databricks.com>
+ 2015-02-05 15:29:56 -0800
+ Commit: 50c48eb, github.com/apache/spark/pull/4308
+
+ [SPARK-5604[MLLIB] remove checkpointDir from LDA
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-05 15:07:33 -0800
+ Commit: 59798cb, github.com/apache/spark/pull/4390
+
+ [SPARK-5460][MLlib] Wrapped `Try` around `deleteAllCheckpoints` - RandomForest.
+ x1- <viva008@gmail.com>
+ 2015-02-05 15:02:04 -0800
+ Commit: 44768f5, github.com/apache/spark/pull/4347
+
+ [SPARK-5135][SQL] Add support for describe table to DDL in SQLContext
+ OopsOutOfMemory <victorshengli@126.com>
+ 2015-02-05 13:07:48 -0800
+ Commit: 55cebcf, github.com/apache/spark/pull/4227
+
+ [SPARK-5617][SQL] fix test failure of SQLQuerySuite
+ wangfei <wangfei1@huawei.com>
+ 2015-02-05 12:44:12 -0800
+ Commit: 785a2e3, github.com/apache/spark/pull/4395
+
+ [Branch-1.3] [DOC] doc fix for date
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-05 12:42:27 -0800
+ Commit: 17ef7f9, github.com/apache/spark/pull/4400
+
+ [SPARK-5474][Build]curl should support URL redirection in build/mvn
+ GuoQiang Li <witgo@qq.com>
+ 2015-02-05 12:03:13 -0800
+ Commit: d1066e9, github.com/apache/spark/pull/4263
+
+ [HOTFIX] MLlib build break.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-05 00:42:50 -0800
+ Commit: c83d118
+
+ SPARK-5548: Fixed a race condition in AkkaUtilsSuite
+ Jacek Lewandowski <lewandowski.jacek@gmail.com>
+ 2015-02-05 12:00:04 -0800
+ Commit: fba2dc6, github.com/apache/spark/pull/4343
+
+ [SPARK-5608] Improve SEO of Spark documentation pages
+ Matei Zaharia <matei@databricks.com>
+ 2015-02-05 11:12:50 -0800
+ Commit: de112a2, github.com/apache/spark/pull/4381
+
+ SPARK-4687. Add a recursive option to the addFile API
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-02-05 10:15:55 -0800
+ Commit: c22ccc0, github.com/apache/spark/pull/3670
+
+ [MLlib] Minor: UDF style update.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-04 23:57:53 -0800
+ Commit: 4074674, github.com/apache/spark/pull/4388
+
+ [SPARK-5612][SQL] Move DataFrame implicit functions into SQLContext.implicits.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-04 23:44:34 -0800
+ Commit: 0040b61, github.com/apache/spark/pull/4386
+
+ [SPARK-5606][SQL] Support plus sign in HiveContext
+ q00251598 <qiyadong@huawei.com>
+ 2015-02-04 23:16:01 -0800
+ Commit: bf43781, github.com/apache/spark/pull/4378
+
+ [SPARK-5599] Check MLlib public APIs for 1.3
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-04 23:03:47 -0800
+ Commit: abc184e, github.com/apache/spark/pull/4377
+
+ [SPARK-5596] [mllib] ML model import/export for GLMs, NaiveBayes
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-04 22:46:48 -0800
+ Commit: 885bcbb, github.com/apache/spark/pull/4233
+
+ SPARK-5607: Update to Kryo 2.24.0 to avoid including objenesis 1.2.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-04 22:39:44 -0800
+ Commit: 59fb5c7, github.com/apache/spark/pull/4383
+
+ [SPARK-5602][SQL] Better support for creating DataFrame from local data collection
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-04 19:53:57 -0800
+ Commit: b8f9c00, github.com/apache/spark/pull/4372
+
+ [SPARK-5538][SQL] Fix flaky CachedTableSuite
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-04 19:52:41 -0800
+ Commit: 1901b19, github.com/apache/spark/pull/4379
+
+ [SQL][DataFrame] Minor cleanup.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-04 19:51:48 -0800
+ Commit: f05bfa6, github.com/apache/spark/pull/4374
+
+ [SPARK-4520] [SQL] This pr fixes the ArrayIndexOutOfBoundsException as r...
+ Sadhan Sood <sadhan@tellapart.com>
+ 2015-02-04 19:18:06 -0800
+ Commit: aa6f4ca, github.com/apache/spark/pull/4148
+
+ [SPARK-5605][SQL][DF] Allow using String to specify colum name in DSL aggregate functions
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-04 18:35:51 -0800
+ Commit: 478ee3f, github.com/apache/spark/pull/4376
+
+ [SPARK-5411] Allow SparkListeners to be specified in SparkConf and loaded when creating SparkContext
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-04 17:18:03 -0800
+ Commit: 47e4d57, github.com/apache/spark/pull/4111
+
+ [SPARK-5577] Python udf for DataFrame
+ Davies Liu <davies@databricks.com>
+ 2015-02-04 15:55:09 -0800
+ Commit: dc9ead9, github.com/apache/spark/pull/4351
+
+ [SPARK-5118][SQL] Fix: create table test stored as parquet as select ..
+ guowei2 <guowei2@asiainfo.com>
+ 2015-02-04 15:26:10 -0800
+ Commit: 06da868, github.com/apache/spark/pull/3921
+
+ [SQL] Use HiveContext's sessionState in HiveMetastoreCatalog.hiveDefaultTableFilePath
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-04 15:22:40 -0800
+ Commit: cb4c3e5, github.com/apache/spark/pull/4355
+
+ [SQL] Correct the default size of TimestampType and expose NumericType
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-04 15:14:49 -0800
+ Commit: 513bb2c, github.com/apache/spark/pull/4314
+
+ [SQL][Hiveconsole] Bring hive console code up to date and update README.md
+ OopsOutOfMemory <victorshengli@126.com>, Sheng, Li <OopsOutOfMemory@users.noreply.github.com>
+ 2015-02-04 15:13:54 -0800
+ Commit: 2cdcfe3, github.com/apache/spark/pull/4330
+
+ [SPARK-5367][SQL] Support star expression in udfs
+ wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
+ 2015-02-04 15:12:07 -0800
+ Commit: 8b803f6, github.com/apache/spark/pull/4353
+
+ [SPARK-5426][SQL] Add SparkSQL Java API helper methods.
+ kul <kuldeep.bora@gmail.com>
+ 2015-02-04 15:08:37 -0800
+ Commit: 38ab92e, github.com/apache/spark/pull/4243
+
+ [SPARK-5587][SQL] Support change database owner
+ wangfei <wangfei1@huawei.com>
+ 2015-02-04 14:35:12 -0800
+ Commit: 7920791, github.com/apache/spark/pull/4357
+
+ [SPARK-5591][SQL] Fix NoSuchObjectException for CTAS
+ wangfei <wangfei1@huawei.com>
+ 2015-02-04 14:33:07 -0800
+ Commit: c79dd1e, github.com/apache/spark/pull/4365
+
+ [SPARK-4939] move to next locality when no pending tasks
+ Davies Liu <davies@databricks.com>
+ 2015-02-04 14:22:07 -0800
+ Commit: f9bb3cb, github.com/apache/spark/pull/3779
+
+ [SPARK-4707][STREAMING] Reliable Kafka Receiver can lose data if the blo...
+ Hari Shreedharan <hshreedharan@apache.org>
+ 2015-02-04 14:20:44 -0800
+ Commit: 14c9f32, github.com/apache/spark/pull/3655
+
+ [SPARK-4964] [Streaming] Exactly-once semantics for Kafka
+ cody koeninger <cody@koeninger.org>
+ 2015-02-04 12:06:34 -0800
+ Commit: a119cae, github.com/apache/spark/pull/3798
+
+ [SPARK-5588] [SQL] support select/filter by SQL expression
+ Davies Liu <davies@databricks.com>
+ 2015-02-04 11:34:46 -0800
+ Commit: 950a0d3, github.com/apache/spark/pull/4359
+
+ [SPARK-5585] Flaky test in MLlib python
+ Davies Liu <davies@databricks.com>
+ 2015-02-04 08:54:20 -0800
+ Commit: 84c6273, github.com/apache/spark/pull/4358
+
+ [SPARK-5574] use given name prefix in dir
+ Imran Rashid <irashid@cloudera.com>
+ 2015-02-04 01:02:20 -0800
+ Commit: 5d9278a, github.com/apache/spark/pull/4344
+
+ [Minor] Fix incorrect warning log
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-04 00:52:41 -0800
+ Commit: 316a4bb, github.com/apache/spark/pull/4360
+
+ [SPARK-5379][Streaming] Add awaitTerminationOrTimeout
+ zsxwing <zsxwing@gmail.com>
+ 2015-02-04 00:40:28 -0800
+ Commit: 4d3dbfd, github.com/apache/spark/pull/4171
+
+ [SPARK-5341] Use maven coordinates as dependencies in spark-shell and spark-submit
+ Burak Yavuz <brkyvz@gmail.com>
+ 2015-02-03 22:39:17 -0800
+ Commit: 3b7acd2, github.com/apache/spark/pull/4215
+
+ [SPARK-4939] revive offers periodically in LocalBackend
+ Davies Liu <davies@databricks.com>
+ 2015-02-03 22:30:23 -0800
+ Commit: e196da8, github.com/apache/spark/pull/4147
+
+ [SPARK-4969][STREAMING][PYTHON] Add binaryRecords to streaming
+ freeman <the.freeman.lab@gmail.com>
+ 2015-02-03 22:24:30 -0800
+ Commit: 9a33f89, github.com/apache/spark/pull/3803
+
+ [SPARK-5579][SQL][DataFrame] Support for project/filter using SQL expressions
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-03 22:15:35 -0800
+ Commit: cb7f783, github.com/apache/spark/pull/4348
+
+ [FIX][MLLIB] fix seed handling in Python GMM
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-03 20:39:11 -0800
+ Commit: 679228b, github.com/apache/spark/pull/4349
+
+ [SPARK-4795][Core] Redesign the "primitive type => Writable" implicit APIs to make them be activated automatically
+ zsxwing <zsxwing@gmail.com>
+ 2015-02-03 20:17:12 -0800
+ Commit: 5c63e05, github.com/apache/spark/pull/3642
+
+ [SPARK-5578][SQL][DataFrame] Provide a convenient way for Scala users to use UDFs
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-03 20:07:46 -0800
+ Commit: b22d5b5, github.com/apache/spark/pull/4345
+
+ [SPARK-5520][MLlib] Make FP-Growth implementation take generic item types (WIP)
+ Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
+ 2015-02-03 17:02:42 -0800
+ Commit: 298ef5b, github.com/apache/spark/pull/4340
+
+ [SPARK-5554] [SQL] [PySpark] add more tests for DataFrame Python API
+ Davies Liu <davies@databricks.com>
+ 2015-02-03 16:01:56 -0800
+ Commit: 4640623, github.com/apache/spark/pull/4331
+
+ [STREAMING] SPARK-4986 Wait for receivers to deregister and receiver job to terminate
+ Jesper Lundgren <jesper.lundgren@vpon.com>
+ 2015-02-03 14:53:39 -0800
+ Commit: 092d4ba, github.com/apache/spark/pull/4338
+
+ [SPARK-5153][Streaming][Test] Increased timeout to deal with flaky KafkaStreamSuite
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2015-02-03 13:46:02 -0800
+ Commit: d644bd9, github.com/apache/spark/pull/4342
+
+ [SPARK-4508] [SQL] build native date type to conform behavior to Hive
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-03 12:21:45 -0800
+ Commit: 6e244cf, github.com/apache/spark/pull/4325
+
+ [SPARK-5383][SQL] Support alias for udtfs
+ wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
+ 2015-02-03 12:16:31 -0800
+ Commit: 5dbeb21, github.com/apache/spark/pull/4186
+
+ [SPARK-5550] [SQL] Support the case insensitive for UDF
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-02-03 12:12:26 -0800
+ Commit: 654c992, github.com/apache/spark/pull/4326
+
+ [SPARK-4987] [SQL] parquet timestamp type support
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-03 12:06:06 -0800
+ Commit: 67d5220, github.com/apache/spark/pull/3820
+
+ [SQL] DataFrame API update
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-03 10:34:56 -0800
+ Commit: 4204a12, github.com/apache/spark/pull/4332
+
+ Minor: Fix TaskContext deprecated annotations.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-03 10:34:16 -0800
+ Commit: f7948f3, github.com/apache/spark/pull/4333
+
+ [SPARK-5549] Define TaskContext interface in Scala.
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-03 00:46:04 -0800
+ Commit: bebf4c4, github.com/apache/spark/pull/4324
+
+ [SPARK-5551][SQL] Create type alias for SchemaRDD for source backward compatibility
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-03 00:29:23 -0800
+ Commit: 523a935, github.com/apache/spark/pull/4327
+
+ [SQL][DataFrame] Remove DataFrameApi, ExpressionApi, and GroupedDataFrameApi
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-03 00:29:04 -0800
+ Commit: 37df330, github.com/apache/spark/pull/4328
+
+ [minor] update streaming linear algorithms
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-03 00:14:43 -0800
+ Commit: 659329f, github.com/apache/spark/pull/4329
+
+ [SPARK-1405] [mllib] Latent Dirichlet Allocation (LDA) using EM
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-02 23:57:35 -0800
+ Commit: 980764f, github.com/apache/spark/pull/2388
+
+ [SPARK-5536] replace old ALS implementation by the new one
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-02 23:49:09 -0800
+ Commit: 0cc7b88, github.com/apache/spark/pull/4321
+
+ [SPARK-5414] Add SparkFirehoseListener class for consuming all SparkListener events
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-02-02 23:35:07 -0800
+ Commit: b8ebebe, github.com/apache/spark/pull/4210
+
+ [SPARK-5501][SPARK-5420][SQL] Write support for the data source API
+ Yin Huai <yhuai@databricks.com>
+ 2015-02-02 23:30:44 -0800
+ Commit: 13531dd, github.com/apache/spark/pull/4294
+
+ [SPARK-5012][MLLib][PySpark]Python API for Gaussian Mixture Model
+ FlytxtRnD <meethu.mathew@flytxt.com>
+ 2015-02-02 23:04:55 -0800
+ Commit: 50a1a87, github.com/apache/spark/pull/4059
+
+ [SPARK-3778] newAPIHadoopRDD doesn't properly pass credentials for secure hdfs
+ Thomas Graves <tgraves@apache.org>
+ 2015-02-02 22:45:55 -0800
+ Commit: c31c36c, github.com/apache/spark/pull/4292
+
+ [SPARK-4979][MLLIB] Streaming logisitic regression
+ freeman <the.freeman.lab@gmail.com>
+ 2015-02-02 22:42:15 -0800
+ Commit: eb0da6c, github.com/apache/spark/pull/4306
+
+ [SPARK-5219][Core] Add locks to avoid scheduling race conditions
+ zsxwing <zsxwing@gmail.com>
+ 2015-02-02 21:42:18 -0800
+ Commit: c306555, github.com/apache/spark/pull/4019
+
+ [Doc] Minor: Fixes several formatting issues
+ Cheng Lian <lian@databricks.com>
+ 2015-02-02 21:14:21 -0800
+ Commit: 60f67e7, github.com/apache/spark/pull/4316
+
+ SPARK-3996: Add jetty servlet and continuations.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-02 21:01:36 -0800
+ Commit: 7930d2b, github.com/apache/spark/pull/4323
+
+ SPARK-5542: Decouple publishing, packaging, and tagging in release script
+ Patrick Wendell <patrick@databricks.com>, Patrick Wendell <pwendell@gmail.com>
+ 2015-02-02 21:00:30 -0800
+ Commit: 0ef38f5, github.com/apache/spark/pull/4319
+
+ [SPARK-5543][WebUI] Remove unused import JsonUtil from from JsonProtocol
+ nemccarthy <nathan@nemccarthy.me>
+ 2015-02-02 20:03:13 -0800
+ Commit: cb39f12, github.com/apache/spark/pull/4320
+
+ [SPARK-5472][SQL] A JDBC data source for Spark SQL.
+ Tor Myklebust <tmyklebu@gmail.com>
+ 2015-02-02 19:50:14 -0800
+ Commit: 8f471a6, github.com/apache/spark/pull/4261
+
+ [SPARK-5512][Mllib] Run the PIC algorithm with initial vector suggected by the PIC paper
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-02 19:34:25 -0800
+ Commit: 1bcd465, github.com/apache/spark/pull/4301
+
+ [SPARK-5154] [PySpark] [Streaming] Kafka streaming support in Python
+ Davies Liu <davies@databricks.com>, Tathagata Das <tdas@databricks.com>
+ 2015-02-02 19:16:27 -0800
+ Commit: 0561c45, github.com/apache/spark/pull/3715
+
+ [SQL] Improve DataFrame API error reporting
+ Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
+ 2015-02-02 19:01:47 -0800
+ Commit: 554403f, github.com/apache/spark/pull/4296
+
+ Revert "[SPARK-4508] [SQL] build native date type to conform behavior to Hive"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-02 17:52:17 -0800
+ Commit: eccb9fb
+
+ Spark 3883: SSL support for HttpServer and Akka
+ Jacek Lewandowski <lewandowski.jacek@gmail.com>, Jacek Lewandowski <jacek.lewandowski@datastax.com>
+ 2015-02-02 17:18:54 -0800
+ Commit: cfea300, github.com/apache/spark/pull/3571
+
+ [SPARK-5540] hide ALS.solveLeastSquares
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-02 17:10:01 -0800
+ Commit: ef65cf0, github.com/apache/spark/pull/4318
+
+ [SPARK-5534] [graphx] Graph getStorageLevel fix
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-02 17:02:29 -0800
+ Commit: f133dec, github.com/apache/spark/pull/4317
+
+ [SPARK-5514] DataFrame.collect should call executeCollect
+ Reynold Xin <rxin@databricks.com>
+ 2015-02-02 16:55:36 -0800
+ Commit: 8aa3cff, github.com/apache/spark/pull/4313
+
+ [SPARK-5195][sql]Update HiveMetastoreCatalog.scala(override the MetastoreRelation's sameresult method only compare databasename and table name)
+ seayi <405078363@qq.com>, Michael Armbrust <michael@databricks.com>
+ 2015-02-02 16:06:52 -0800
+ Commit: dca6faa, github.com/apache/spark/pull/3898
+
+ [SPARK-2309][MLlib] Multinomial Logistic Regression
+ DB Tsai <dbtsai@alpinenow.com>
+ 2015-02-02 15:59:15 -0800
+ Commit: b1aa8fe, github.com/apache/spark/pull/3833
+
+ [SPARK-5513][MLLIB] Add nonnegative option to ml's ALS
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-02 15:55:44 -0800
+ Commit: 46d50f1, github.com/apache/spark/pull/4302
+
+ [SPARK-4508] [SQL] build native date type to conform behavior to Hive
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-02 15:49:22 -0800
+ Commit: 1646f89, github.com/apache/spark/pull/3732
+
+ SPARK-5500. Document that feeding hadoopFile into a shuffle operation wi...
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-02-02 14:52:46 -0800
+ Commit: 8309349, github.com/apache/spark/pull/4293
+
+ [SPARK-5461] [graphx] Add isCheckpointed, getCheckpointedFiles methods to Graph
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-02-02 14:34:48 -0800
+ Commit: 842d000, github.com/apache/spark/pull/4253
+
+ SPARK-5425: Use synchronised methods in system properties to create SparkConf
+ Jacek Lewandowski <lewandowski.jacek@gmail.com>
+ 2015-02-02 14:07:19 -0800
+ Commit: 5a55261, github.com/apache/spark/pull/4222
+
+ Disabling Utils.chmod700 for Windows
+ Martin Weindel <martin.weindel@gmail.com>, mweindel <m.weindel@usu-software.de>
+ 2015-02-02 13:46:18 -0800
+ Commit: bff65b5, github.com/apache/spark/pull/4299
+
+ Make sure only owner can read / write to directories created for the job.
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-21 14:38:14 -0800
+ Commit: 52f5754
+
+ [HOTFIX] Add jetty references to build for YARN module.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-02 14:00:14 -0800
+ Commit: 2321dd1
+
+ [SPARK-4631][streaming][FIX] Wait for a receiver to start before publishing test data.
+ Iulian Dragos <jaguarul@gmail.com>
+ 2015-02-02 14:00:33 -0800
+ Commit: e908322, github.com/apache/spark/pull/4270
+
+ [SPARK-5212][SQL] Add support of schema-less, custom field delimiter and SerDe for HiveQL transform
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-02 13:53:55 -0800
+ Commit: 683e938, github.com/apache/spark/pull/4014
+
+ [SPARK-5530] Add executor container to executorIdToContainer
+ Xutingjun <1039320815@qq.com>
+ 2015-02-02 12:37:51 -0800
+ Commit: 62a93a1, github.com/apache/spark/pull/4309
+
+ [Docs] Fix Building Spark link text
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2015-02-02 12:33:49 -0800
+ Commit: 3f941b6, github.com/apache/spark/pull/4312
+
+ [SPARK-5173]support python application running on yarn cluster mode
+ lianhuiwang <lianhuiwang09@gmail.com>, Wang Lianhui <lianhuiwang09@gmail.com>
+ 2015-02-02 12:32:28 -0800
+ Commit: f5e6375, github.com/apache/spark/pull/3976
+
+ SPARK-4585. Spark dynamic executor allocation should use minExecutors as...
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-02-02 12:27:08 -0800
+ Commit: b2047b5, github.com/apache/spark/pull/4051
+
+ [MLLIB] SPARK-5491 (ex SPARK-1473): Chi-square feature selection
+ Alexander Ulanov <nashb@yandex.ru>
+ 2015-02-02 12:13:05 -0800
+ Commit: c081b21, github.com/apache/spark/pull/1484
+
+ SPARK-5492. Thread statistics can break with older Hadoop versions
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-02-02 00:54:06 -0800
+ Commit: 6f34131, github.com/apache/spark/pull/4305
+
+ [SPARK-5478][UI][Minor] Add missing right parentheses
+ jerryshao <saisai.shao@intel.com>
+ 2015-02-01 23:56:13 -0800
+ Commit: 63dfe21, github.com/apache/spark/pull/4267
+
+ [SPARK-5353] Log failures in REPL class loading
+ Tobias Schlatter <tobias@meisch.ch>
+ 2015-02-01 21:43:49 -0800
+ Commit: 9f0a6e1, github.com/apache/spark/pull/4130
+
+ [SPARK-3996]: Shade Jetty in Spark deliverables
+ Patrick Wendell <patrick@databricks.com>
+ 2015-02-01 21:13:57 -0800
+ Commit: a15f6e3, github.com/apache/spark/pull/4285
+
+ [SPARK-4001][MLlib] adding parallel FP-Growth algorithm for frequent pattern mining in MLlib
+ Jacky Li <jacky.likun@huawei.com>, Jacky Li <jackylk@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
+ 2015-02-01 20:07:25 -0800
+ Commit: 859f724, github.com/apache/spark/pull/2847
+
+ [Spark-5406][MLlib] LocalLAPACK mode in RowMatrix.computeSVD should have much smaller upper bound
+ Yuhao Yang <hhbyyh@gmail.com>
+ 2015-02-01 19:40:26 -0800
+ Commit: d85cd4e, github.com/apache/spark/pull/4200
+
+ [SPARK-5465] [SQL] Fixes filter push-down for Parquet data source
+ Cheng Lian <lian@databricks.com>
+ 2015-02-01 18:52:39 -0800
+ Commit: ec10032, github.com/apache/spark/pull/4255
+
+ [SPARK-5262] [SPARK-5244] [SQL] add coalesce in SQLParser and widen types for parameters of coalesce
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-02-01 18:51:38 -0800
+ Commit: 8cf4a1f, github.com/apache/spark/pull/4057
+
+ [SPARK-5196][SQL] Support `comment` in Create Table Field DDL
+ OopsOutOfMemory <victorshengli@126.com>
+ 2015-02-01 18:41:49 -0800
+ Commit: 1b56f1d, github.com/apache/spark/pull/3999
+
+ [SPARK-1825] Make Windows Spark client work fine with Linux YARN cluster
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2015-02-01 18:26:28 -0800
+ Commit: 7712ed5, github.com/apache/spark/pull/3943
+
+ [SPARK-5176] The thrift server does not support cluster mode
+ Tom Panning <tom.panning@nextcentury.com>
+ 2015-02-01 17:57:31 -0800
+ Commit: 1ca0a10, github.com/apache/spark/pull/4137
+
+ [SPARK-5155] Build fails with spark-ganglia-lgpl profile
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-02-01 17:53:56 -0800
+ Commit: c80194b, github.com/apache/spark/pull/4303
+
+ [Minor][SQL] Little refactor DataFrame related codes
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-02-01 17:52:18 -0800
+ Commit: ef89b82, github.com/apache/spark/pull/4298
+
+ [SPARK-4859][Core][Streaming] Refactor LiveListenerBus and StreamingListenerBus
+ zsxwing <zsxwing@gmail.com>
+ 2015-02-01 17:47:51 -0800
+ Commit: 883bc88, github.com/apache/spark/pull/4006
+
+ [SPARK-5424][MLLIB] make the new ALS impl take generic ID types
+ Xiangrui Meng <meng@databricks.com>
+ 2015-02-01 14:13:31 -0800
+ Commit: 4a17122, github.com/apache/spark/pull/4281
+
+ [SPARK-5207] [MLLIB] StandardScalerModel mean and variance re-use
+ Octavian Geagla <ogeagla@gmail.com>
+ 2015-02-01 09:21:14 -0800
+ Commit: bdb0680, github.com/apache/spark/pull/4140
+
+ [SPARK-5422] Add support for sending Graphite metrics via UDP
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-01-31 23:41:05 -0800
+ Commit: 80bd715, github.com/apache/spark/pull/4218
+
+ SPARK-3359 [CORE] [DOCS] `sbt/sbt unidoc` doesn't work with Java 8
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-31 10:40:42 -0800
+ Commit: c84d5a1, github.com/apache/spark/pull/4193
+
+ [SPARK-3975] Added support for BlockMatrix addition and multiplication
+ Burak Yavuz <brkyvz@gmail.com>, Burak Yavuz <brkyvz@dn51t42l.sunet>, Burak Yavuz <brkyvz@dn51t4rd.sunet>, Burak Yavuz <brkyvz@dn0a221430.sunet>, Burak Yavuz <brkyvz@dn0a22b17d.sunet>
+ 2015-01-31 00:47:30 -0800
+ Commit: ef8974b, github.com/apache/spark/pull/4274
+
+ [MLLIB][SPARK-3278] Monotone (Isotonic) regression using parallel pool adjacent violators algorithm
+ martinzapletal <zapletal-martin@email.cz>, Xiangrui Meng <meng@databricks.com>, Martin Zapletal <zapletal-martin@email.cz>
+ 2015-01-31 00:46:02 -0800
+ Commit: 34250a6, github.com/apache/spark/pull/3519
+
+ [SPARK-5307] Add a config option for SerializationDebugger.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-31 00:06:36 -0800
+ Commit: 6364083, github.com/apache/spark/pull/4297
+
+ [SQL] remove redundant field "childOutput" from execution.Aggregate, use child.output instead
+ kai <kaizeng@eecs.berkeley.edu>
+ 2015-01-30 23:19:10 -0800
+ Commit: f54c9f6, github.com/apache/spark/pull/4291
+
+ [SPARK-5307] SerializationDebugger
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-30 22:34:10 -0800
+ Commit: 740a568, github.com/apache/spark/pull/4098
+
+ [SPARK-5504] [sql] convertToCatalyst should support nested arrays
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-01-30 15:40:14 -0800
+ Commit: e643de4, github.com/apache/spark/pull/4295
+
+ SPARK-5400 [MLlib] Changed name of GaussianMixtureEM to GaussianMixture
+ Travis Galoppo <tjg2107@columbia.edu>
+ 2015-01-30 15:32:25 -0800
+ Commit: 9869773, github.com/apache/spark/pull/4290
+
+ [SPARK-4259][MLlib]: Add Power Iteration Clustering Algorithm with Gaussian Similarity Function
+ sboeschhuawei <stephen.boesch@huawei.com>, Fan Jiang <fanjiang.sc@huawei.com>, Jiang Fan <fjiang6@gmail.com>, Stephen Boesch <stephen.boesch@huawei.com>, Xiangrui Meng <meng@databricks.com>
+ 2015-01-30 14:09:49 -0800
+ Commit: f377431, github.com/apache/spark/pull/4254
+
+ [SPARK-5486] Added validate method to BlockMatrix
+ Burak Yavuz <brkyvz@gmail.com>
+ 2015-01-30 13:59:10 -0800
+ Commit: 6ee8338, github.com/apache/spark/pull/4279
+
+ [SPARK-5496][MLLIB] Allow both classification and Classification in Algo for trees.
+ Xiangrui Meng <meng@databricks.com>
+ 2015-01-30 10:08:07 -0800
+ Commit: 0a95085, github.com/apache/spark/pull/4287
+
+ [MLLIB] SPARK-4846: throw a RuntimeException and give users hints to increase the minCount
+ Joseph J.C. Tang <jinntrance@gmail.com>
+ 2015-01-30 10:07:26 -0800
+ Commit: 54d9575, github.com/apache/spark/pull/4247
+
+ SPARK-5393. Flood of util.RackResolver log messages after SPARK-1714
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-01-30 11:31:54 -0600
+ Commit: 254eaa4, github.com/apache/spark/pull/4192
+
+ [SPARK-5457][SQL] Add missing DSL for ApproxCountDistinct.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2015-01-30 01:21:35 -0800
+ Commit: 6f21dce, github.com/apache/spark/pull/4250
+
+ [SPARK-5094][MLlib] Add Python API for Gradient Boosted Trees
+ Kazuki Taniguchi <kazuki.t.1018@gmail.com>
+ 2015-01-30 00:39:44 -0800
+ Commit: bc1fc9b, github.com/apache/spark/pull/3951
+
+ [SPARK-5322] Added transpose functionality to BlockMatrix
+ Burak Yavuz <brkyvz@gmail.com>
+ 2015-01-29 21:26:29 -0800
+ Commit: dd4d84c, github.com/apache/spark/pull/4275
+
+ [SQL] Support df("*") to select all columns in a data frame.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-29 19:09:08 -0800
+ Commit: 80def9d, github.com/apache/spark/pull/4283
+
+ [SPARK-5462] [SQL] Use analyzed query plan in DataFrame.apply()
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-29 18:23:05 -0800
+ Commit: 22271f9, github.com/apache/spark/pull/4282
+
+ [SPARK-5395] [PySpark] fix python process leak while coalesce()
+ Davies Liu <davies@databricks.com>
+ 2015-01-29 17:28:37 -0800
+ Commit: 5c746ee, github.com/apache/spark/pull/4238
+
+ [SQL] DataFrame API improvements
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-29 17:24:00 -0800
+ Commit: ce9c43b, github.com/apache/spark/pull/4280
+
+ Revert "[WIP] [SPARK-3996]: Shade Jetty in Spark deliverables"
+ Patrick Wendell <patrick@databricks.com>
+ 2015-01-29 17:14:27 -0800
+ Commit: d2071e8
+
+ remove 'return'
+ Yoshihiro Shimizu <shimizu@amoad.com>
+ 2015-01-29 16:55:00 -0800
+ Commit: 5338772, github.com/apache/spark/pull/4268
+
+ [WIP] [SPARK-3996]: Shade Jetty in Spark deliverables
+ Patrick Wendell <patrick@databricks.com>
+ 2015-01-29 16:31:19 -0800
+ Commit: f240fe3, github.com/apache/spark/pull/4252
+
+ [SPARK-5464] Fix help() for Python DataFrame instances
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-29 16:23:20 -0800
+ Commit: 0bb15f2, github.com/apache/spark/pull/4278
+
+ [SPARK-4296][SQL] Trims aliases when resolving and checking aggregate expressions
+ Yin Huai <yhuai@databricks.com>, Cheng Lian <lian@databricks.com>
+ 2015-01-29 15:49:34 -0800
+ Commit: c00d517, github.com/apache/spark/pull/4010
+
+ [SPARK-5373][SQL] Literal in agg grouping expressions leads to incorrect result
+ wangfei <wangfei1@huawei.com>
+ 2015-01-29 15:47:13 -0800
+ Commit: c1b3eeb, github.com/apache/spark/pull/4169
+
+ [SPARK-5367][SQL] Support star expression in udf
+ wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
+ 2015-01-29 15:44:53 -0800
+ Commit: fbaf9e0, github.com/apache/spark/pull/4163
+
+ [SPARK-4786][SQL]: Parquet filter pushdown for castable types
+ Yash Datta <Yash.Datta@guavus.com>
+ 2015-01-29 15:42:23 -0800
+ Commit: de221ea, github.com/apache/spark/pull/4156
+
+ [SPARK-5309][SQL] Add support for dictionaries in PrimitiveConverter for Strin...
+ Michael Davies <Michael.BellDavies@gmail.com>
+ 2015-01-29 15:40:59 -0800
+ Commit: 940f375, github.com/apache/spark/pull/4187
+
+ [SPARK-5429][SQL] Use javaXML plan serialization for Hive golden answers on Hive 0.13.1
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-01-29 15:28:22 -0800
+ Commit: bce0ba1, github.com/apache/spark/pull/4223
+
+ [SPARK-5445][SQL] Consolidate Java and Scala DSL static methods.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-29 15:13:09 -0800
+ Commit: 7156322, github.com/apache/spark/pull/4276
+
+ [SPARK-5466] Add explicit guava dependencies where needed.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-01-29 13:00:45 -0800
+ Commit: f9e5694, github.com/apache/spark/pull/4272
+
+ [SPARK-5477] refactor stat.py
+ Xiangrui Meng <meng@databricks.com>
+ 2015-01-29 10:11:44 -0800
+ Commit: a3dc618, github.com/apache/spark/pull/4266
+
+ [SQL] Various DataFrame DSL update.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-29 00:01:10 -0800
+ Commit: 5ad78f6, github.com/apache/spark/pull/4260
+
+ [SPARK-3977] Conversion methods for BlockMatrix to other Distributed Matrices
+ Burak Yavuz <brkyvz@gmail.com>
+ 2015-01-28 23:42:07 -0800
+ Commit: a63be1a, github.com/apache/spark/pull/4256
+
+ [SPARK-5445][SQL] Made DataFrame dsl usable in Java
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-28 19:10:32 -0800
+ Commit: 5b9760d, github.com/apache/spark/pull/4241
+
+ [SPARK-5430] move treeReduce and treeAggregate from mllib to core
+ Xiangrui Meng <meng@databricks.com>
+ 2015-01-28 17:26:03 -0800
+ Commit: 4ee79c7, github.com/apache/spark/pull/4228
+
+ [SPARK-4586][MLLIB] Python API for ML pipeline and parameters
+ Xiangrui Meng <meng@databricks.com>, Davies Liu <davies@databricks.com>
+ 2015-01-28 17:14:23 -0800
+ Commit: e80dc1c, github.com/apache/spark/pull/4151
+
+ [SPARK-5441][pyspark] Make SerDeUtil PairRDD to Python conversions more robust
+ Michael Nazario <mnazario@palantir.com>
+ 2015-01-28 13:55:01 -0800
+ Commit: e023112, github.com/apache/spark/pull/4236
+
+ [SPARK-4387][PySpark] Refactoring python profiling code to make it extensible
+ Yandu Oppacher <yandu.oppacher@jadedpixel.com>, Davies Liu <davies@databricks.com>
+ 2015-01-28 13:48:06 -0800
+ Commit: 3bead67, github.com/apache/spark/pull/3255.
+
+ [SPARK-5417] Remove redundant executor-id set() call
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-01-28 13:04:52 -0800
+ Commit: a731314, github.com/apache/spark/pull/4213
+
+ [SPARK-5434] [EC2] Preserve spaces in EC2 path
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2015-01-28 12:56:03 -0800
+ Commit: d44ee43, github.com/apache/spark/pull/4224
+
+ [SPARK-5437] Fix DriverSuite and SparkSubmitSuite timeout issues
+ Andrew Or <andrew@databricks.com>
+ 2015-01-28 12:52:31 -0800
+ Commit: 84b6ecd, github.com/apache/spark/pull/4230
+
+ [SPARK-4955]With executor dynamic scaling enabled,executor shoude be added or killed in yarn-cluster mode.
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2015-01-28 12:50:57 -0800
+ Commit: 81f8f34, github.com/apache/spark/pull/3962
+
+ [SPARK-5440][pyspark] Add toLocalIterator to pyspark rdd
+ Michael Nazario <mnazario@palantir.com>
+ 2015-01-28 12:47:12 -0800
+ Commit: 456c11f, github.com/apache/spark/pull/4237
+
+ SPARK-1934 [CORE] "this" reference escape to "selectorThread" during construction in ConnectionManager
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-28 12:44:35 -0800
+ Commit: 9b18009, github.com/apache/spark/pull/4225
+
+ [SPARK-5188][BUILD] make-distribution.sh should support curl, not only wget to get Tachyon
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-28 12:43:22 -0800
+ Commit: e902dc4, github.com/apache/spark/pull/3988
+
+ SPARK-5458. Refer to aggregateByKey instead of combineByKey in docs
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-01-28 12:41:23 -0800
+ Commit: 406f6d3, github.com/apache/spark/pull/4251
+
+ [SPARK-5447][SQL] Replaced reference to SchemaRDD with DataFrame.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-28 12:10:01 -0800
+ Commit: c8e934e, github.com/apache/spark/pull/4242
+
+ [SPARK-5361]Multiple Java RDD <-> Python RDD conversions not working correctly
+ Winston Chen <wchen@quid.com>
+ 2015-01-28 11:08:44 -0800
+ Commit: 453d799, github.com/apache/spark/pull/4146
+
+ [SPARK-5291][CORE] Add timestamp and reason why an executor is removed to SparkListenerExecutorAdded and SparkListenerExecutorRemoved
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-28 11:02:51 -0800
+ Commit: 0b35fcd, github.com/apache/spark/pull/4082
+
+ [SPARK-3974][MLlib] Distributed Block Matrix Abstractions
+ Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>, Burak Yavuz <brkyvz@dn51t42l.sunet>, Burak Yavuz <brkyvz@dn51t4rd.sunet>, Burak Yavuz <brkyvz@dn0a221430.sunet>
+ 2015-01-28 10:06:37 -0800
+ Commit: eeb53bf, github.com/apache/spark/pull/3200
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-01-28 02:15:14 -0800
+ Commit: 622ff09, github.com/apache/spark/pull/1480
+
+ [SPARK-5415] bump sbt to version to 0.13.7
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-01-28 02:13:06 -0800
+ Commit: 661d3f9, github.com/apache/spark/pull/4211
+
+ [SPARK-4809] Rework Guava library shading.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-01-28 00:29:29 -0800
+ Commit: 37a5e27, github.com/apache/spark/pull/3658
+
+ [SPARK-5097][SQL] Test cases for DataFrame expressions.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-27 18:10:49 -0800
+ Commit: d743732, github.com/apache/spark/pull/4235
+
+ [SPARK-5097][SQL] DataFrame
+ Reynold Xin <rxin@databricks.com>, Davies Liu <davies@databricks.com>
+ 2015-01-27 16:08:24 -0800
+ Commit: 119f45d, github.com/apache/spark/pull/4173
+
+ SPARK-5199. FS read metrics should support CombineFileSplits and track bytes from all FSs
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-01-27 15:42:55 -0800
+ Commit: b1b35ca, github.com/apache/spark/pull/4050
+
+ [MLlib] fix python example of ALS in guide
+ Davies Liu <davies@databricks.com>
+ 2015-01-27 15:33:01 -0800
+ Commit: fdaad4e, github.com/apache/spark/pull/4226
+
+ SPARK-5308 [BUILD] MD5 / SHA1 hash format doesn't match standard Maven output
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-27 10:22:50 -0800
+ Commit: ff356e2, github.com/apache/spark/pull/4161
+
+ [SPARK-5321] Support for transposing local matrices
+ Burak Yavuz <brkyvz@gmail.com>
+ 2015-01-27 01:46:17 -0800
+ Commit: 9142674, github.com/apache/spark/pull/4109
+
+ [SPARK-5419][Mllib] Fix the logic in Vectors.sqdist
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-01-27 01:29:14 -0800
+ Commit: 7b0ed79, github.com/apache/spark/pull/4217
+
+ [SPARK-3726] [MLlib] Allow sampling_rate not equal to 1.0 in RandomForests
+ MechCoder <manojkumarsivaraj334@gmail.com>
+ 2015-01-26 19:46:17 -0800
+ Commit: d6894b1, github.com/apache/spark/pull/4073
+
+ [SPARK-5119] java.lang.ArrayIndexOutOfBoundsException on trying to train...
+ lewuathe <lewuathe@me.com>
+ 2015-01-26 18:03:21 -0800
+ Commit: f2ba5c6, github.com/apache/spark/pull/3975
+
+ [SPARK-5052] Add common/base classes to fix guava methods signatures.
+ Elmer Garduno <elmerg@google.com>
+ 2015-01-26 17:40:48 -0800
+ Commit: 661e0fc, github.com/apache/spark/pull/3874
+
+ SPARK-960 [CORE] [TEST] JobCancellationSuite "two jobs sharing the same stage" is broken
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-26 14:32:27 -0800
+ Commit: 0497ea5, github.com/apache/spark/pull/4180
+
+ Fix command spaces issue in make-distribution.sh
+ David Y. Ross <dyross@gmail.com>
+ 2015-01-26 14:26:10 -0800
+ Commit: b38034e, github.com/apache/spark/pull/4126
+
+ SPARK-4147 [CORE] Reduce log4j dependency
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-26 14:23:42 -0800
+ Commit: 54e7b45, github.com/apache/spark/pull/4190
+
+ [SPARK-5339][BUILD] build/mvn doesn't work because of invalid URL for maven's tgz.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-26 13:07:49 -0800
+ Commit: c094c73, github.com/apache/spark/pull/4124
+
+ [SPARK-5355] use j.u.c.ConcurrentHashMap instead of TrieMap
+ Davies Liu <davies@databricks.com>
+ 2015-01-26 12:51:32 -0800
+ Commit: 1420931, github.com/apache/spark/pull/4208
+
+ [SPARK-5384][mllib] Vectors.sqdist returns inconsistent results for sparse/dense vectors when the vectors have different lengths
+ Yuhao Yang <hhbyyh@gmail.com>
+ 2015-01-25 22:18:09 -0800
+ Commit: 8125168, github.com/apache/spark/pull/4183
+
+ [SPARK-5268] don't stop CoarseGrainedExecutorBackend for irrelevant DisassociatedEvent
+ CodingCat <zhunansjtu@gmail.com>
+ 2015-01-25 19:28:53 -0800
+ Commit: 8df9435, github.com/apache/spark/pull/4063
+
+ SPARK-4430 [STREAMING] [TEST] Apache RAT Checks fail spuriously on test files
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-25 19:16:44 -0800
+ Commit: 0528b85, github.com/apache/spark/pull/4189
+
+ [SPARK-5326] Show fetch wait time as optional metric in the UI
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2015-01-25 16:48:26 -0800
+ Commit: fc2168f, github.com/apache/spark/pull/4110
+
+ [SPARK-5344][WebUI] HistoryServer cannot recognize that inprogress file was renamed to completed file
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-25 15:34:20 -0800
+ Commit: 8f5c827, github.com/apache/spark/pull/4132
+
+ SPARK-4506 [DOCS] Addendum: Update more docs to reflect that standalone works in cluster mode
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-25 15:25:05 -0800
+ Commit: 9f64357, github.com/apache/spark/pull/4160
+
+ SPARK-5382: Use SPARK_CONF_DIR in spark-class if it is defined
+ Jacek Lewandowski <lewandowski.jacek@gmail.com>
+ 2015-01-25 15:15:09 -0800
+ Commit: 1c30afd, github.com/apache/spark/pull/4179
+
+ SPARK-3782 [CORE] Direct use of log4j in AkkaUtils interferes with certain logging configurations
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-25 15:11:57 -0800
+ Commit: 383425a, github.com/apache/spark/pull/4184
+
+ SPARK-3852 [DOCS] Document spark.driver.extra* configs
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-25 15:08:05 -0800
+ Commit: c586b45, github.com/apache/spark/pull/4185
+
+ [SPARK-5402] log executor ID at executor-construction time
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-01-25 14:20:02 -0800
+ Commit: aea2548, github.com/apache/spark/pull/4195
+
+ [SPARK-5401] set executor ID before creating MetricsSystem
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2015-01-25 14:17:59 -0800
+ Commit: 2d9887b, github.com/apache/spark/pull/4194
+
+ Add comment about defaultMinPartitions
+ Idan Zalzberg <idanzalz@gmail.com>
+ 2015-01-25 11:28:05 -0800
+ Commit: 412a58e, github.com/apache/spark/pull/4102
+
+ Closes #4157
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-25 00:24:59 -0800
+ Commit: d22ca1e
+
+ [SPARK-5214][Test] Add a test to demonstrate EventLoop can be stopped in the event thread
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-24 11:00:35 -0800
+ Commit: 0d1e67e, github.com/apache/spark/pull/4174
+
+ [SPARK-5058] Part 2. Typos and broken URL
+ Jongyoul Lee <jongyoul@gmail.com>
+ 2015-01-23 23:34:11 -0800
+ Commit: 09e09c5, github.com/apache/spark/pull/4172
+
+ [SPARK-5351][GraphX] Do not use Partitioner.defaultPartitioner as a partitioner of EdgeRDDImp...
+ Takeshi Yamamuro <linguin.m.s@gmail.com>
+ 2015-01-23 19:25:15 -0800
+ Commit: e224dbb, github.com/apache/spark/pull/4136
+
+ [SPARK-5063] More helpful error messages for several invalid operations
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-23 17:53:15 -0800
+ Commit: cef1f09, github.com/apache/spark/pull/3884
+
+ [SPARK-3541][MLLIB] New ALS implementation with improved storage
+ Xiangrui Meng <meng@databricks.com>
+ 2015-01-22 22:09:13 -0800
+ Commit: ea74365, github.com/apache/spark/pull/3720
+
+ [SPARK-5315][Streaming] Fix reduceByWindow Java API not work bug
+ jerryshao <saisai.shao@intel.com>
+ 2015-01-22 22:04:21 -0800
+ Commit: e0f7fb7, github.com/apache/spark/pull/4104
+
+ [SPARK-5233][Streaming] Fix error replaying of WAL introduced bug
+ jerryshao <saisai.shao@intel.com>
+ 2015-01-22 21:58:53 -0800
+ Commit: 3c3fa63, github.com/apache/spark/pull/4032
+
+ SPARK-5370. [YARN] Remove some unnecessary synchronization in YarnAlloca...
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-01-22 13:49:35 -0600
+ Commit: 820ce03, github.com/apache/spark/pull/4164
+
+ [SPARK-5365][MLlib] Refactor KMeans to reduce redundant data
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-01-22 08:16:35 -0800
+ Commit: 246111d, github.com/apache/spark/pull/4159
+
+ [SPARK-5147][Streaming] Delete the received data WAL log periodically
+ Tathagata Das <tathagata.das1565@gmail.com>, jerryshao <saisai.shao@intel.com>
+ 2015-01-21 23:41:44 -0800
+ Commit: 3027f06, github.com/apache/spark/pull/4149
+
+ [SPARK-5317]Set BoostingStrategy.defaultParams With Enumeration Algo.Classification or Algo.Regression
+ Basin <jpsachilles@gmail.com>
+ 2015-01-21 23:06:34 -0800
+ Commit: fcb3e18, github.com/apache/spark/pull/4103
+
+ [SPARK-3424][MLLIB] cache point distances during k-means|| init
+ Xiangrui Meng <meng@databricks.com>
+ 2015-01-21 21:20:31 -0800
+ Commit: ca7910d, github.com/apache/spark/pull/4144
+
+ [SPARK-5202] [SQL] Add hql variable substitution support
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-01-21 17:34:18 -0800
+ Commit: 27bccc5, github.com/apache/spark/pull/4003
+
+ [SPARK-5355] make SparkConf thread-safe
+ Davies Liu <davies@databricks.com>
+ 2015-01-21 16:51:42 -0800
+ Commit: 9bad062, github.com/apache/spark/pull/4143
+
+ [SPARK-4984][CORE][WEBUI] Adding a pop-up containing the full job description when it is very long
+ wangfei <wangfei1@huawei.com>
+ 2015-01-21 15:27:42 -0800
+ Commit: 3be2a88, github.com/apache/spark/pull/3819
+
+ [SQL] [Minor] Remove deprecated parquet tests
+ Cheng Lian <lian@databricks.com>
+ 2015-01-21 14:38:10 -0800
+ Commit: ba19689, github.com/apache/spark/pull/4116
+
+ Revert "[SPARK-5244] [SQL] add coalesce() in sql parser"
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-21 14:27:43 -0800
+ Commit: b328ac6
+
+ [SPARK-5009] [SQL] Long keyword support in SQL Parsers
+ Cheng Hao <hao.cheng@intel.com>
+ 2015-01-21 13:05:56 -0800
+ Commit: 8361078, github.com/apache/spark/pull/3926
+
+ [SPARK-5244] [SQL] add coalesce() in sql parser
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-01-21 12:59:41 -0800
+ Commit: 812d367, github.com/apache/spark/pull/4040
+
+ [SPARK-5064][GraphX] Add numEdges upperbound validation for R-MAT graph generator to prevent infinite loop
+ Kenji Kikushima <kikushima.kenji@lab.ntt.co.jp>
+ 2015-01-21 12:34:00 -0800
+ Commit: 3ee3ab5, github.com/apache/spark/pull/3950
+
+ [SPARK-4749] [mllib]: Allow initializing KMeans clusters using a seed
+ nate.crosswhite <nate.crosswhite@stresearch.com>, nxwhite-str <nxwhite-str@users.noreply.github.com>, Xiangrui Meng <meng@databricks.com>
+ 2015-01-21 10:32:10 -0800
+ Commit: 7450a99, github.com/apache/spark/pull/3610
+
+ [MLlib] [SPARK-5301] Missing conversions and operations on IndexedRowMatrix and CoordinateMatrix
+ Reza Zadeh <reza@databricks.com>
+ 2015-01-21 09:48:38 -0800
+ Commit: aa1e22b, github.com/apache/spark/pull/4089
+
+ SPARK-1714. Take advantage of AMRMClient APIs to simplify logic in YarnA...
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-01-21 10:31:54 -0600
+ Commit: 2eeada3, github.com/apache/spark/pull/3765
+
+ [SPARK-5336][YARN]spark.executor.cores must not be less than spark.task.cpus
+ WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
+ 2015-01-21 09:42:30 -0600
+ Commit: 8c06a5f, github.com/apache/spark/pull/4123
+
+ [SPARK-5297][Streaming] Fix Java file stream type erasure problem
+ jerryshao <saisai.shao@intel.com>
+ 2015-01-20 23:37:47 -0800
+ Commit: 424d8c6, github.com/apache/spark/pull/4101
+
+ [HOTFIX] Update pom.xml to pull MapR's Hadoop version 2.4.1.
+ Kannan Rajah <rkannan82@gmail.com>
+ 2015-01-20 23:34:04 -0800
+ Commit: ec5b0f2, github.com/apache/spark/pull/4108
+
+ [SPARK-5275] [Streaming] include python source code
+ Davies Liu <davies@databricks.com>
+ 2015-01-20 22:44:58 -0800
+ Commit: bad6c57, github.com/apache/spark/pull/4128
+
+ [SPARK-5294][WebUI] Hide tables in AllStagePages for "Active Stages, Completed Stages and Failed Stages" when they are empty
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-20 16:40:46 -0800
+ Commit: 9a151ce, github.com/apache/spark/pull/4083
+
+ [SPARK-5186] [MLLIB] Vector.equals and Vector.hashCode are very inefficient
+ Yuhao Yang <hhbyyh@gmail.com>, Yuhao Yang <yuhao@yuhaodevbox.sh.intel.com>
+ 2015-01-20 15:20:20 -0800
+ Commit: 2f82c84, github.com/apache/spark/pull/3997
+
+ [SPARK-5323][SQL] Remove Row's Seq inheritance.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-20 15:16:14 -0800
+ Commit: d181c2a, github.com/apache/spark/pull/4115
+
+ [SPARK-5287][SQL] Add defaultSizeOf to every data type.
+ Yin Huai <yhuai@databricks.com>
+ 2015-01-20 13:26:36 -0800
+ Commit: bc20a52, github.com/apache/spark/pull/4081
+
+ SPARK-5019 [MLlib] - GaussianMixtureModel exposes instances of MultivariateGauss...
+ Travis Galoppo <tjg2107@columbia.edu>
+ 2015-01-20 12:58:11 -0800
+ Commit: 23e2554, github.com/apache/spark/pull/4088
+
+ [SPARK-5329][WebUI] UIWorkloadGenerator should stop SparkContext.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-20 12:40:55 -0800
+ Commit: 769aced, github.com/apache/spark/pull/4112
+
+ SPARK-4660: Use correct class loader in JavaSerializer (copy of PR #3840...
+ Jacek Lewandowski <lewandowski.jacek@gmail.com>
+ 2015-01-20 12:38:01 -0800
+ Commit: c93a57f, github.com/apache/spark/pull/4113
+
+ [SQL][Minor] Refactors deeply nested FP style code in BooleanSimplification
+ Cheng Lian <lian@databricks.com>
+ 2015-01-20 11:20:14 -0800
+ Commit: 8140802, github.com/apache/spark/pull/4091
+
+ [SPARK-5333][Mesos] MesosTaskLaunchData occurs BufferUnderflowException
+ Jongyoul Lee <jongyoul@gmail.com>
+ 2015-01-20 10:17:29 -0800
+ Commit: 9d9294a, github.com/apache/spark/pull/4119
+
+ [SPARK-4803] [streaming] Remove duplicate RegisterReceiver message
+ Ilayaperumal Gopinathan <igopinathan@pivotal.io>
+ 2015-01-20 01:41:10 -0800
+ Commit: 4afad9c, github.com/apache/spark/pull/3648
+
+ [SQL][minor] Add a log4j file for catalyst test.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-20 00:55:25 -0800
+ Commit: debc031, github.com/apache/spark/pull/4117
+
+ SPARK-5270 [CORE] Provide isEmpty() function in RDD API
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-19 22:50:44 -0800
+ Commit: 306ff18, github.com/apache/spark/pull/4074
+
+ [SPARK-5214][Core] Add EventLoop and change DAGScheduler to an EventLoop
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-19 18:15:51 -0800
+ Commit: e69fb8c, github.com/apache/spark/pull/4016
+
+ [SPARK-4504][Examples] fix run-example failure if multiple assembly jars exist
+ Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
+ 2015-01-19 11:58:16 -0800
+ Commit: 74de94e, github.com/apache/spark/pull/3377
+
+ [SPARK-5286][SQL] Fail to drop an invalid table when using the data source API
+ Yin Huai <yhuai@databricks.com>
+ 2015-01-19 10:45:29 -0800
+ Commit: 2604bc3, github.com/apache/spark/pull/4076
+
+ [SPARK-5284][SQL] Insert into Hive throws NPE when a inner complex type field has a null value
+ Yin Huai <yhuai@databricks.com>
+ 2015-01-19 10:44:12 -0800
+ Commit: cd5da42, github.com/apache/spark/pull/4077
+
+ [SPARK-5282][mllib]: RowMatrix easily gets int overflow in the memory size warning
+ Yuhao Yang <hhbyyh@gmail.com>
+ 2015-01-19 10:10:15 -0800
+ Commit: 4432568, github.com/apache/spark/pull/4069
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-01-19 02:05:24 -0800
+ Commit: 1ac1c1d, github.com/apache/spark/pull/3584
+
+ [SPARK-5088] Use spark-class for running executors directly
+ Jongyoul Lee <jongyoul@gmail.com>
+ 2015-01-19 02:01:56 -0800
+ Commit: 4a4f9cc, github.com/apache/spark/pull/3897
+
+ [SPARK-3288] All fields in TaskMetrics should be private and use getters/setters
+ Ilya Ganelin <ilya.ganelin@capitalone.com>
+ 2015-01-19 01:32:22 -0800
+ Commit: 3453d57, github.com/apache/spark/pull/4020
+
+ SPARK-5217 Spark UI should report pending stages during job execution on AllStagesPage.
+ Prashant Sharma <prashant.s@imaginea.com>
+ 2015-01-19 01:28:42 -0800
+ Commit: 851b6a9, github.com/apache/spark/pull/4043
+
+ [SQL] fix typo in class description
+ Jacky Li <jacky.likun@gmail.com>
+ 2015-01-18 23:59:08 -0800
+ Commit: 7dbf1fd, github.com/apache/spark/pull/4100
+
+ [SQL][minor] Put DataTypes.java in java dir.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-18 16:35:40 -0800
+ Commit: 1955645, github.com/apache/spark/pull/4097
+
+ [SQL][Minor] Update sql doc according to data type APIs changes
+ scwf <wangfei1@huawei.com>
+ 2015-01-18 11:03:13 -0800
+ Commit: 1a200a3, github.com/apache/spark/pull/4095
+
+ [SPARK-5279][SQL] Use java.math.BigDecimal as the exposed Decimal type.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-18 11:01:42 -0800
+ Commit: 1727e08, github.com/apache/spark/pull/4092
+
+ [HOTFIX]: Minor clean up regarding skipped artifacts in build files.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-01-17 23:15:12 -0800
+ Commit: ad16da1, github.com/apache/spark/pull/4080
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <patrick@databricks.com>
+ 2015-01-17 20:39:54 -0800
+ Commit: e12b5b6, github.com/apache/spark/pull/681
+
+ [SQL][Minor] Added comments and examples to explain BooleanSimplification
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-17 17:35:53 -0800
+ Commit: e7884bc, github.com/apache/spark/pull/4090
+
+ [SPARK-5096] Use sbt tasks instead of vals to get hadoop version
+ Michael Armbrust <michael@databricks.com>
+ 2015-01-17 17:03:07 -0800
+ Commit: 6999910, github.com/apache/spark/pull/3905
+
+ [SPARK-4937][SQL] Comment for the newly optimization rules in `BooleanSimplification`
+ scwf <wangfei1@huawei.com>
+ 2015-01-17 15:51:24 -0800
+ Commit: c1f3c27, github.com/apache/spark/pull/4086
+
+ [SQL][minor] Improved Row documentation.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-17 00:11:08 -0800
+ Commit: f3bfc76, github.com/apache/spark/pull/4085
+
+ [SPARK-5193][SQL] Remove Spark SQL Java-specific API.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-16 21:09:06 -0800
+ Commit: 61b427d, github.com/apache/spark/pull/4065
+
+ [SPARK-4937][SQL] Adding optimization to simplify the And, Or condition in spark sql
+ scwf <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
+ 2015-01-16 14:01:22 -0800
+ Commit: ee1c1f3, github.com/apache/spark/pull/3778
+
+ [SPARK-733] Add documentation on use of accumulators in lazy transformation
+ Ilya Ganelin <ilya.ganelin@capitalone.com>
+ 2015-01-16 13:25:17 -0800
+ Commit: fd3a8a1, github.com/apache/spark/pull/4022
+
+ [SPARK-4923][REPL] Add Developer API to REPL to allow re-publishing the REPL jar
+ Chip Senkbeil <rcsenkbe@us.ibm.com>, Chip Senkbeil <chip.senkbeil@gmail.com>
+ 2015-01-16 12:56:40 -0800
+ Commit: d05c9ee, github.com/apache/spark/pull/4034
+
+ [WebUI] Fix collapse of WebUI layout
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-16 12:19:08 -0800
+ Commit: ecf943d, github.com/apache/spark/pull/3995
+
+ [SPARK-5231][WebUI] History Server shows wrong job submission time.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-16 10:05:11 -0800
+ Commit: e8422c5, github.com/apache/spark/pull/4029
+
+ [DOCS] Fix typo in return type of cogroup
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-16 09:28:44 -0800
+ Commit: f6b852a, github.com/apache/spark/pull/4072
+
+ [SPARK-5201][CORE] deal with int overflow in the ParallelCollectionRDD.slice method
+ Ye Xianjin <advancedxy@gmail.com>
+ 2015-01-16 09:20:53 -0800
+ Commit: e200ac8, github.com/apache/spark/pull/4002
+
+ [SPARK-1507][YARN]specify # cores for ApplicationMaster
+ WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
+ 2015-01-16 09:16:56 -0800
+ Commit: 2be82b1, github.com/apache/spark/pull/4018
+
+ [SPARK-4092] [CORE] Fix InputMetrics for coalesce'd Rdds
+ Kostas Sakellis <kostas@cloudera.com>
+ 2015-01-15 18:48:39 -0800
+ Commit: a79a9f9, github.com/apache/spark/pull/3120
+
+ [SPARK-4857] [CORE] Adds Executor membership events to SparkListener
+ Kostas Sakellis <kostas@cloudera.com>
+ 2015-01-15 17:53:42 -0800
+ Commit: 96c2c71, github.com/apache/spark/pull/3711
+
+ [Minor] Fix tiny typo in BlockManager
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-15 17:07:44 -0800
+ Commit: 65858ba, github.com/apache/spark/pull/4046
+
+ [SPARK-5274][SQL] Reconcile Java and Scala UDFRegistration.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-15 16:15:12 -0800
+ Commit: 1881431, github.com/apache/spark/pull/4056
+
+ [SPARK-5224] [PySpark] improve performance of parallelize list/ndarray
+ Davies Liu <davies@databricks.com>
+ 2015-01-15 11:40:41 -0800
+ Commit: 3c8650c, github.com/apache/spark/pull/4024
+
+ [SPARK-5193][SQL] Tighten up HiveContext API
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-14 20:31:02 -0800
+ Commit: 4b325c7, github.com/apache/spark/pull/4054
+
+ [SPARK-5254][MLLIB] remove developers section from spark.ml guide
+ Xiangrui Meng <meng@databricks.com>
+ 2015-01-14 18:54:17 -0800
+ Commit: 6abc45e, github.com/apache/spark/pull/4053
+
+ [SPARK-5193][SQL] Tighten up SQLContext API
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-14 18:36:15 -0800
+ Commit: cfa397c, github.com/apache/spark/pull/4049
+
+ [SPARK-5254][MLLIB] Update the user guide to position spark.ml better
+ Xiangrui Meng <meng@databricks.com>
+ 2015-01-14 17:50:33 -0800
+ Commit: 13d2406, github.com/apache/spark/pull/4052
+
+ [SPARK-5234][ml]examples for ml don't have sparkContext.stop
+ Yuhao Yang <yuhao@yuhaodevbox.sh.intel.com>
+ 2015-01-14 11:53:43 -0800
+ Commit: 76389c5, github.com/apache/spark/pull/4044
+
+ [SPARK-5235] Make SQLConf Serializable
+ Alex Baretta <alexbaretta@gmail.com>
+ 2015-01-14 11:51:55 -0800
+ Commit: 2fd7f72, github.com/apache/spark/pull/4031
+
+ [SPARK-4014] Add TaskContext.attemptNumber and deprecate TaskContext.attemptId
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-14 11:45:40 -0800
+ Commit: 259936b, github.com/apache/spark/pull/3849
+
+ [SPARK-5228][WebUI] Hide tables for "Active Jobs/Completed Jobs/Failed Jobs" when they are empty
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-14 11:10:29 -0800
+ Commit: 9d4449c, github.com/apache/spark/pull/4028
+
+ [SPARK-2909] [MLlib] [PySpark] SparseVector in pyspark now supports indexing
+ MechCoder <manojkumarsivaraj334@gmail.com>
+ 2015-01-14 11:03:11 -0800
+ Commit: 5840f54, github.com/apache/spark/pull/4025
+
+ [SQL] some comments fix for GROUPING SETS
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-01-14 09:50:01 -0800
+ Commit: 38bdc99, github.com/apache/spark/pull/4000
+
+ [SPARK-5211][SQL]Restore HiveMetastoreTypes.toDataType
+ Yin Huai <yhuai@databricks.com>
+ 2015-01-14 09:47:30 -0800
+ Commit: 81f72a0, github.com/apache/spark/pull/4026
+
+ [SPARK-5248] [SQL] move sql.types.decimal.Decimal to sql.types.Decimal
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2015-01-14 09:36:59 -0800
+ Commit: a3f7421, github.com/apache/spark/pull/4041
+
+ [SPARK-5167][SQL] Move Row into sql package and make it usable for Java.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-14 00:38:55 -0800
+ Commit: d5eeb35, github.com/apache/spark/pull/4030
+
+ [SPARK-5123][SQL] Reconcile Java/Scala API for data types.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-13 17:16:41 -0800
+ Commit: f996909, github.com/apache/spark/pull/3958
+
+ [SPARK-5168] Make SQLConf a field rather than mixin in SQLContext
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-13 13:30:35 -0800
+ Commit: 14e3f11, github.com/apache/spark/pull/3965
+
+ [SPARK-4912][SQL] Persistent tables for the Spark SQL data sources api
+ Yin Huai <yhuai@databricks.com>, Michael Armbrust <michael@databricks.com>
+ 2015-01-13 13:01:27 -0800
+ Commit: 6463e0b, github.com/apache/spark/pull/3960
+
+ [SPARK-5223] [MLlib] [PySpark] fix MapConverter and ListConverter in MLlib
+ Davies Liu <davies@databricks.com>
+ 2015-01-13 12:50:31 -0800
+ Commit: 8ead999, github.com/apache/spark/pull/4023
+
+ [SPARK-5131][Streaming][DOC]: There is a discrepancy in WAL implementation and configuration doc.
+ uncleGen <hustyugm@gmail.com>
+ 2015-01-13 10:07:19 -0800
+ Commit: 39e333e, github.com/apache/spark/pull/3930
+
+ [SPARK-4697][YARN]System properties should override environment variables
+ WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
+ 2015-01-13 09:43:48 -0800
+ Commit: 9dea64e, github.com/apache/spark/pull/3557
+
+ [SPARK-5006][Deploy]spark.port.maxRetries doesn't work
+ WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
+ 2015-01-13 09:28:21 -0800
+ Commit: f7741a9, github.com/apache/spark/pull/3841
+
+ [SPARK-5138][SQL] Ensure schema can be inferred from a namedtuple
+ Gabe Mulley <gabe@edx.org>
+ 2015-01-12 21:44:51 -0800
+ Commit: 1e42e96, github.com/apache/spark/pull/3978
+
+ [SPARK-5049][SQL] Fix ordering of partition columns in ParquetTableScan
+ Michael Armbrust <michael@databricks.com>
+ 2015-01-12 15:19:09 -0800
+ Commit: 5d9fa55, github.com/apache/spark/pull/3990
+
+ [SPARK-4999][Streaming] Change storeInBlockManager to false by default
+ jerryshao <saisai.shao@intel.com>
+ 2015-01-12 13:14:44 -0800
+ Commit: 3aed305, github.com/apache/spark/pull/3906
+
+ SPARK-5172 [BUILD] spark-examples-***.jar shades a wrong Hadoop distribution
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-12 12:15:34 -0800
+ Commit: aff49a3, github.com/apache/spark/pull/3992
+
+ [SPARK-5078] Optionally read from SPARK_LOCAL_HOSTNAME
+ Michael Armbrust <michael@databricks.com>
+ 2015-01-12 11:57:59 -0800
+ Commit: a3978f3, github.com/apache/spark/pull/3893
+
+ SPARK-4159 [BUILD] Addendum: improve running of single test after enabling Java tests
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-12 11:00:56 -0800
+ Commit: 13e610b, github.com/apache/spark/pull/3993
+
+ [SPARK-5102][Core]subclass of MapStatus needs to be registered with Kryo
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2015-01-12 10:57:12 -0800
+ Commit: ef9224e, github.com/apache/spark/pull/4007
+
+ [SPARK-5200] Disable web UI in Hive ThriftServer tests
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-12 10:47:12 -0800
+ Commit: 82fd38d, github.com/apache/spark/pull/3998
+
+ SPARK-5018 [MLlib] [WIP] Make MultivariateGaussian public
+ Travis Galoppo <tjg2107@columbia.edu>
+ 2015-01-11 21:31:16 -0800
+ Commit: 2130de9, github.com/apache/spark/pull/3923
+
+ [SPARK-4033][Examples]Input of the SparkPi too big causes the emption exception
+ huangzhaowei <carlmartinmax@gmail.com>
+ 2015-01-11 16:32:47 -0800
+ Commit: f38ef65, github.com/apache/spark/pull/2874
+
+ [SPARK-4951][Core] Fix the issue that a busy executor may be killed
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-11 16:23:28 -0800
+ Commit: 6942b97, github.com/apache/spark/pull/3783
+
+ [SPARK-5073] spark.storage.memoryMapThreshold have two default value
+ lewuathe <lewuathe@me.com>
+ 2015-01-11 13:50:42 -0800
+ Commit: 1656aae, github.com/apache/spark/pull/3900
+
+ [SPARK-5032] [graphx] Remove GraphX MIMA exclude for 1.3
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-01-10 17:25:39 -0800
+ Commit: 3313260, github.com/apache/spark/pull/3856
+
+ [SPARK-5029][SQL] Enable from follow multiple brackets
+ scwf <wangfei1@huawei.com>
+ 2015-01-10 17:07:34 -0800
+ Commit: d22a31f, github.com/apache/spark/pull/3853
+
+ [SPARK-4871][SQL] Show sql statement in spark ui when run sql with spark-sql
+ wangfei <wangfei1@huawei.com>
+ 2015-01-10 17:04:56 -0800
+ Commit: 92d9a70, github.com/apache/spark/pull/3718
+
+ [Minor]Resolve sbt warnings during build (MQTTStreamSuite.scala).
+ GuoQiang Li <witgo@qq.com>
+ 2015-01-10 15:38:43 -0800
+ Commit: 8a29dc7, github.com/apache/spark/pull/3989
+
+ [SPARK-5181] do not print writing WAL log when WAL is disabled
+ CodingCat <zhunansjtu@gmail.com>
+ 2015-01-10 15:35:41 -0800
+ Commit: f0d558b, github.com/apache/spark/pull/3985
+
+ [SPARK-4692] [SQL] Support ! boolean logic operator like NOT
+ YanTangZhai <hakeemzhai@tencent.com>, Michael Armbrust <michael@databricks.com>
+ 2015-01-10 15:05:23 -0800
+ Commit: 0ca51cc, github.com/apache/spark/pull/3555
+
+ [SPARK-5187][SQL] Fix caching of tables with HiveUDFs in the WHERE clause
+ Michael Armbrust <michael@databricks.com>
+ 2015-01-10 14:25:45 -0800
+ Commit: 3684fd2, github.com/apache/spark/pull/3987
+
+ SPARK-4963 [SQL] Add copy to SQL's Sample operator
+ Yanbo Liang <yanbohappy@gmail.com>
+ 2015-01-10 14:16:37 -0800
+ Commit: 77106df, github.com/apache/spark/pull/3827
+
+ [SPARK-4861][SQL] Refactory command in spark sql
+ scwf <wangfei1@huawei.com>
+ 2015-01-10 14:08:04 -0800
+ Commit: b3e86dc, github.com/apache/spark/pull/3948
+
+ [SPARK-4574][SQL] Adding support for defining schema in foreign DDL commands.
+ scwf <wangfei1@huawei.com>, Yin Huai <yhuai@databricks.com>, Fei Wang <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
+ 2015-01-10 13:53:21 -0800
+ Commit: 693a323, github.com/apache/spark/pull/3431
+
+ [SPARK-4943][SQL] Allow table name having dot for db/catalog
+ Alex Liu <alex_liu68@yahoo.com>
+ 2015-01-10 13:23:09 -0800
+ Commit: 4b39fd1, github.com/apache/spark/pull/3941
+
+ [SPARK-4925][SQL] Publish Spark SQL hive-thriftserver maven artifact
+ Alex Liu <alex_liu68@yahoo.com>
+ 2015-01-10 13:19:12 -0800
+ Commit: 1e56eba, github.com/apache/spark/pull/3766
+
+ [SPARK-5141][SQL]CaseInsensitiveMap throws java.io.NotSerializableException
+ luogankun <luogankun@gmail.com>
+ 2015-01-09 20:38:41 -0800
+ Commit: 545dfcb, github.com/apache/spark/pull/3944
+
+ [SPARK-4406] [MLib] FIX: Validate k in SVD
+ MechCoder <manojkumarsivaraj334@gmail.com>
+ 2015-01-09 17:45:18 -0800
+ Commit: 4554529, github.com/apache/spark/pull/3945
+
+ [SPARK-4990][Deploy]to find default properties file, search SPARK_CONF_DIR first
+ WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
+ 2015-01-09 17:10:02 -0800
+ Commit: 8782eb9, github.com/apache/spark/pull/3823
+
+ [Minor] Fix import order and other coding style
+ bilna <bilnap@am.amrita.edu>, Bilna P <bilna.p@gmail.com>
+ 2015-01-09 14:45:28 -0800
+ Commit: 4e1f12d, github.com/apache/spark/pull/3966
+
+ [DOC] Fixed Mesos version in doc from 0.18.1 to 0.21.0
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-09 14:40:45 -0800
+ Commit: ae62872, github.com/apache/spark/pull/3982
+
+ [SPARK-4737] Task set manager properly handles serialization errors
+ mcheah <mcheah@palantir.com>
+ 2015-01-09 14:16:20 -0800
+ Commit: e0f28e0, github.com/apache/spark/pull/3638
+
+ [SPARK-1953][YARN]yarn client mode Application Master memory size is same as driver memory...
+ WangTaoTheTonic <barneystinson@aliyun.com>
+ 2015-01-09 13:20:32 -0800
+ Commit: e966452, github.com/apache/spark/pull/3607
+
+ [SPARK-5015] [mllib] Random seed for GMM + make test suite deterministic
+ Joseph K. Bradley <joseph@databricks.com>
+ 2015-01-09 13:00:15 -0800
+ Commit: 7e8e62a, github.com/apache/spark/pull/3981
+
+ [SPARK-3619] Upgrade to Mesos 0.21 to work around MESOS-1688
+ Jongyoul Lee <jongyoul@gmail.com>
+ 2015-01-09 10:47:08 -0800
+ Commit: 454fe12, github.com/apache/spark/pull/3934
+
+ [SPARK-5145][Mllib] Add BLAS.dsyr and use it in GaussianMixtureEM
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-01-09 10:27:33 -0800
+ Commit: e9ca16e, github.com/apache/spark/pull/3949
+
+ [SPARK-1143] Separate pool tests into their own suite.
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2015-01-09 09:47:06 -0800
+ Commit: b6aa557, github.com/apache/spark/pull/3967
+
+ HOTFIX: Minor improvements to make-distribution.sh
+ Patrick Wendell <pwendell@gmail.com>
+ 2015-01-09 09:40:18 -0800
+ Commit: 1790b38, github.com/apache/spark/pull/3973
+
+ SPARK-5136 [DOCS] Improve documentation around setting up Spark IntelliJ project
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-09 09:35:46 -0800
+ Commit: 547df97, github.com/apache/spark/pull/3952
+
+ [Minor] Fix test RetryingBlockFetcherSuite after changed config name
+ Aaron Davidson <aaron@databricks.com>
+ 2015-01-09 09:20:16 -0800
+ Commit: b4034c3, github.com/apache/spark/pull/3972
+
+ [SPARK-5169][YARN]fetch the correct max attempts
+ WangTaoTheTonic <barneystinson@aliyun.com>
+ 2015-01-09 08:10:09 -0600
+ Commit: f3da4bd, github.com/apache/spark/pull/3942
+
+ [SPARK-5122] Remove Shark from spark-ec2
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2015-01-08 17:42:08 -0800
+ Commit: 167a5ab, github.com/apache/spark/pull/3939
+
+ [SPARK-4048] Enhance and extend hadoop-provided profile.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2015-01-08 17:15:13 -0800
+ Commit: 48cecf6, github.com/apache/spark/pull/2982
+
+ [SPARK-4891][PySpark][MLlib] Add gamma/log normal/exp dist sampling to P...
+ RJ Nowling <rnowling@gmail.com>
+ 2015-01-08 15:03:43 -0800
+ Commit: c9c8b21, github.com/apache/spark/pull/3955
+
+ [SPARK-4973][CORE] Local directory in the driver of client-mode continues remaining even if application finished when external shuffle is enabled
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-08 13:43:09 -0800
+ Commit: a00af6b, github.com/apache/spark/pull/3811
+
+ SPARK-5148 [MLlib] Make usersOut/productsOut storagelevel in ALS configurable
+ Fernando Otero (ZeoS) <fotero@gmail.com>
+ 2015-01-08 12:42:54 -0800
+ Commit: 72df5a3, github.com/apache/spark/pull/3953
+
+ Document that groupByKey will OOM for large keys
+ Eric Moyer <eric_moyer@yahoo.com>
+ 2015-01-08 11:55:23 -0800
+ Commit: 538f221, github.com/apache/spark/pull/3936
+
+ [SPARK-5130][Deploy]Take yarn-cluster as cluster mode in spark-submit
+ WangTaoTheTonic <barneystinson@aliyun.com>
+ 2015-01-08 11:45:42 -0800
+ Commit: 0760787, github.com/apache/spark/pull/3929
+
+ [Minor] Fix the value represented by spark.executor.id for consistency.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2015-01-08 11:35:56 -0800
+ Commit: 0a59727, github.com/apache/spark/pull/3812
+
+ [SPARK-4989][CORE] avoid wrong eventlog conf cause cluster down in standalone mode
+ Zhang, Liye <liye.zhang@intel.com>
+ 2015-01-08 10:40:26 -0800
+ Commit: 06dc4b5, github.com/apache/spark/pull/3824
+
+ [SPARK-4917] Add a function to convert into a graph with canonical edges in GraphOps
+ Takeshi Yamamuro <linguin.m.s@gmail.com>
+ 2015-01-08 09:55:12 -0800
+ Commit: f825e19, github.com/apache/spark/pull/3760
+
+ SPARK-5087. [YARN] Merge yarn.Client and yarn.ClientBase
+ Sandy Ryza <sandy@cloudera.com>
+ 2015-01-08 09:25:43 -0800
+ Commit: 8d45834, github.com/apache/spark/pull/3896
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2015-01-07 23:25:56 -0800
+ Commit: c082385, github.com/apache/spark/pull/3880
+
+ [SPARK-5116][MLlib] Add extractor for SparseVector and DenseVector
+ Shuo Xiang <shuoxiangpub@gmail.com>
+ 2015-01-07 23:22:37 -0800
+ Commit: c66a976, github.com/apache/spark/pull/3919
+
+ [SPARK-5126][Core] Verify Spark urls before creating Actors so that invalid urls can crash the process.
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-07 23:01:30 -0800
+ Commit: 2b729d2, github.com/apache/spark/pull/3927
+
+ [SPARK-5132][Core]Correct stage Attempt Id key in stageInfofromJson
+ hushan[胡珊] <hushan@xiaomi.com>
+ 2015-01-07 12:09:12 -0800
+ Commit: d345ebe, github.com/apache/spark/pull/3932
+
+ [SPARK-5128][MLLib] Add common used log1pExp API in MLUtils
+ DB Tsai <dbtsai@alpinenow.com>
+ 2015-01-07 10:13:41 -0800
+ Commit: 60e2d9e, github.com/apache/spark/pull/3915
+
+ [SPARK-2458] Make failed application log visible on History Server
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2015-01-07 07:32:16 -0800
+ Commit: 6e74ede, github.com/apache/spark/pull/3467
+
+ [SPARK-2165][YARN]add support for setting maxAppAttempts in the ApplicationSubmissionContext
+ WangTaoTheTonic <barneystinson@aliyun.com>
+ 2015-01-07 08:14:39 -0600
+ Commit: 8fdd489, github.com/apache/spark/pull/3878
+
+ [YARN][SPARK-4929] Bug fix: fix the yarn-client code to support HA
+ huangzhaowei <carlmartinmax@gmail.com>
+ 2015-01-07 08:10:42 -0600
+ Commit: 5fde661, github.com/apache/spark/pull/3771
+
+ [SPARK-5099][Mllib] Simplify logistic loss function
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-01-06 21:23:31 -0800
+ Commit: e21acc1, github.com/apache/spark/pull/3899
+
+ [SPARK-5050][Mllib] Add unit test for sqdist
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2015-01-06 14:00:45 -0800
+ Commit: bb38ebb, github.com/apache/spark/pull/3869
+
+ SPARK-5017 [MLlib] - Use SVD to compute determinant and inverse of covariance matrix
+ Travis Galoppo <tjg2107@columbia.edu>
+ 2015-01-06 13:57:42 -0800
+ Commit: 4108e5f, github.com/apache/spark/pull/3871
+
+ SPARK-4159 [CORE] Maven build doesn't run JUnit test suites
+ Sean Owen <sowen@cloudera.com>
+ 2015-01-06 12:02:08 -0800
+ Commit: 4cba6eb, github.com/apache/spark/pull/3651
+
+ [Minor] Fix comments for GraphX 2D partitioning strategy
+ kj-ki <kikushima.kenji@lab.ntt.co.jp>
+ 2015-01-06 09:49:37 -0800
+ Commit: 5e3ec11, github.com/apache/spark/pull/3904
+
+ [SPARK-1600] Refactor FileInputStream tests to remove Thread.sleep() calls and SystemClock usage
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-06 00:31:19 -0800
+ Commit: a6394bc, github.com/apache/spark/pull/3801
+
+ SPARK-4843 [YARN] Squash ExecutorRunnableUtil and ExecutorRunnable
+ Kostas Sakellis <kostas@cloudera.com>
+ 2015-01-05 23:26:33 -0800
+ Commit: 451546a, github.com/apache/spark/pull/3696
+
+ [SPARK-5040][SQL] Support expressing unresolved attributes using $"attribute name" notation in SQL DSL.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-05 15:34:22 -0800
+ Commit: 04d55d8, github.com/apache/spark/pull/3862
+
+ [SPARK-5093] Set spark.network.timeout to 120s consistently.
+ Reynold Xin <rxin@databricks.com>
+ 2015-01-05 15:19:53 -0800
+ Commit: bbcba3a, github.com/apache/spark/pull/3903
+
+ [SPARK-5089][PYSPARK][MLLIB] Fix vector convert
+ freeman <the.freeman.lab@gmail.com>
+ 2015-01-05 13:10:59 -0800
+ Commit: 6c6f325, github.com/apache/spark/pull/3902
+
+ [SPARK-4465] runAsSparkUser doesn't affect TaskRunner in Mesos environme...
+ Jongyoul Lee <jongyoul@gmail.com>
+ 2015-01-05 12:05:09 -0800
+ Commit: 1c0e7ce, github.com/apache/spark/pull/3741
+
+ [SPARK-5057] Log message in failed askWithReply attempts
+ WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
+ 2015-01-05 11:59:38 -0800
+ Commit: ce39b34, github.com/apache/spark/pull/3875
+
+ [SPARK-4688] Have a single shared network timeout in Spark
+ Varun Saxena <vsaxena.varun@gmail.com>, varunsaxena <vsaxena.varun@gmail.com>
+ 2015-01-05 10:32:37 -0800
+ Commit: d3f07fd, github.com/apache/spark/pull/3562
+
+ [SPARK-5074][Core] Fix a non-deterministic test failure
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-04 21:18:33 -0800
+ Commit: 5c506ce, github.com/apache/spark/pull/3889
+
+ [SPARK-5083][Core] Fix a flaky test in TaskResultGetterSuite
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-04 21:09:21 -0800
+ Commit: 27e7f5a, github.com/apache/spark/pull/3894
+
+ [SPARK-5069][Core] Fix the race condition of TaskSchedulerImpl.dagScheduler
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-04 21:06:04 -0800
+ Commit: 6c726a3, github.com/apache/spark/pull/3887
+
+ [SPARK-5067][Core] Use '===' to compare well-defined case class
+ zsxwing <zsxwing@gmail.com>
+ 2015-01-04 21:03:17 -0800
+ Commit: 7239652, github.com/apache/spark/pull/3886
+
+ [SPARK-4835] Disable validateOutputSpecs for Spark Streaming jobs
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-04 20:26:18 -0800
+ Commit: 939ba1f, github.com/apache/spark/pull/3832
+
+ [SPARK-4631] unit test for MQTT
+ bilna <bilnap@am.amrita.edu>, Bilna P <bilna.p@gmail.com>
+ 2015-01-04 19:37:48 -0800
+ Commit: e767d7d, github.com/apache/spark/pull/3844
+
+ [SPARK-4787] Stop SparkContext if a DAGScheduler init error occurs
+ Dale <tigerquoll@outlook.com>
+ 2015-01-04 13:28:37 -0800
+ Commit: 3fddc94, github.com/apache/spark/pull/3809
+
+ [SPARK-794][Core] Remove sleep() in ClusterScheduler.stop
+ Brennon York <brennon.york@capitalone.com>
+ 2015-01-04 12:40:39 -0800
+ Commit: b96008d, github.com/apache/spark/pull/3851
+
+ [SPARK-5058] Updated broken links
+ sigmoidanalytics <mayur@sigmoidanalytics.com>
+ 2015-01-03 19:46:08 -0800
+ Commit: 342612b, github.com/apache/spark/pull/3877
+
+ Fixed typos in streaming-kafka-integration.md
+ Akhil Das <akhld@darktech.ca>
+ 2015-01-02 15:12:27 -0800
+ Commit: cdccc26, github.com/apache/spark/pull/3876
+
+ [SPARK-3325][Streaming] Add a parameter to the method print in class DStream
+ Yadong Qi <qiyadong2010@gmail.com>, q00251598 <qiyadong@huawei.com>, Tathagata Das <tathagata.das1565@gmail.com>, wangfei <wangfei1@huawei.com>
+ 2015-01-02 15:09:41 -0800
+ Commit: bd88b71, github.com/apache/spark/pull/3865
+
+ [HOTFIX] Bind web UI to ephemeral port in DriverSuite
+ Josh Rosen <joshrosen@databricks.com>
+ 2015-01-01 15:03:54 -0800
+ Commit: 0128398, github.com/apache/spark/pull/3873
+
+ [SPARK-5038] Add explicit return type for implicit functions.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-31 17:07:47 -0800
+ Commit: 7749dd6, github.com/apache/spark/pull/3860
+
+ SPARK-2757 [BUILD] [STREAMING] Add Mima test for Spark Sink after 1.10 is released
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-31 16:59:17 -0800
+ Commit: 4bb1248, github.com/apache/spark/pull/3842
+
+ [SPARK-5035] [Streaming] ReceiverMessage trait should extend Serializable
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-31 16:02:47 -0800
+ Commit: fe6efac, github.com/apache/spark/pull/3857
+
+ SPARK-5020 [MLlib] GaussianMixtureModel.predictMembership() should take an RDD only
+ Travis Galoppo <tjg2107@columbia.edu>
+ 2014-12-31 15:39:58 -0800
+ Commit: c4f0b4f, github.com/apache/spark/pull/3854
+
+ [SPARK-5028][Streaming]Add total received and processed records metrics to Streaming UI
+ jerryshao <saisai.shao@intel.com>
+ 2014-12-31 14:45:31 -0800
+ Commit: fdc2aa4, github.com/apache/spark/pull/3852
+
+ [SPARK-4790][STREAMING] Fix ReceivedBlockTrackerSuite waits for old file...
+ Hari Shreedharan <hshreedharan@apache.org>
+ 2014-12-31 14:35:07 -0800
+ Commit: 3610d3c, github.com/apache/spark/pull/3726
+
+ [SPARK-5038][SQL] Add explicit return type for implicit functions in Spark SQL
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-31 14:25:03 -0800
+ Commit: c88a3d7, github.com/apache/spark/pull/3859
+
+ [HOTFIX] Disable Spark UI in SparkSubmitSuite tests
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-12 12:38:37 -0800
+ Commit: e24d3a9
+
+ SPARK-4547 [MLLIB] OOM when making bins in BinaryClassificationMetrics
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-31 13:37:04 -0800
+ Commit: 3d194cc, github.com/apache/spark/pull/3702
+
+ [SPARK-4298][Core] - The spark-submit cannot read Main-Class from Manifest.
+ Brennon York <brennon.york@capitalone.com>
+ 2014-12-31 11:54:10 -0800
+ Commit: 8e14c5e, github.com/apache/spark/pull/3561
+
+ [SPARK-4797] Replace breezeSquaredDistance
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2014-12-31 11:50:53 -0800
+ Commit: 06a9aa5, github.com/apache/spark/pull/3643
+
+ [SPARK-1010] Clean up uses of System.setProperty in unit tests
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-30 18:12:20 -0800
+ Commit: 352ed6b, github.com/apache/spark/pull/3739
+
+ [SPARK-4998][MLlib]delete the "train" function
+ Liu Jiongzhou <ljzzju@163.com>
+ 2014-12-30 15:55:56 -0800
+ Commit: 035bac8, github.com/apache/spark/pull/3836
+
+ [SPARK-4813][Streaming] Fix the issue that ContextWaiter didn't handle 'spurious wakeup'
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-30 14:39:13 -0800
+ Commit: 6a89782, github.com/apache/spark/pull/3661
+
+ [Spark-4995] Replace Vector.toBreeze.activeIterator with foreachActive
+ Jakub Dubovsky <dubovsky@avast.com>
+ 2014-12-30 14:19:07 -0800
+ Commit: 0f31992, github.com/apache/spark/pull/3846
+
+ SPARK-3955 part 2 [CORE] [HOTFIX] Different versions between jackson-mapper-asl and jackson-core-asl
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-30 14:00:57 -0800
+ Commit: b239ea1, github.com/apache/spark/pull/3829
+
+ [SPARK-4570][SQL]add BroadcastLeftSemiJoinHash
+ wangxiaojing <u9jing@gmail.com>
+ 2014-12-30 13:54:12 -0800
+ Commit: 07fa191, github.com/apache/spark/pull/3442
+
+ [SPARK-4935][SQL] When hive.cli.print.header configured, spark-sql aborted if passed in a invalid sql
+ wangfei <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
+ 2014-12-30 13:44:30 -0800
+ Commit: 8f29b7c, github.com/apache/spark/pull/3761
+
+ [SPARK-4386] Improve performance when writing Parquet files
+ Michael Davies <Michael.BellDavies@gmail.com>
+ 2014-12-30 13:40:51 -0800
+ Commit: 7425bec, github.com/apache/spark/pull/3843
+
+ [SPARK-4937][SQL] Normalizes conjunctions and disjunctions to eliminate common predicates
+ Cheng Lian <lian@databricks.com>
+ 2014-12-30 13:38:27 -0800
+ Commit: 61a99f6, github.com/apache/spark/pull/3784
+
+ [SPARK-4928][SQL] Fix: Operator '>,<,>=,<=' with decimal between different precision report error
+ guowei2 <guowei2@asiainfo.com>
+ 2014-12-30 12:21:00 -0800
+ Commit: a75dd83, github.com/apache/spark/pull/3767
+
+ [SPARK-4930][SQL][DOCS]Update SQL programming guide, CACHE TABLE is eager
+ luogankun <luogankun@gmail.com>
+ 2014-12-30 12:18:55 -0800
+ Commit: 2deac74, github.com/apache/spark/pull/3773
+
+ [SPARK-4916][SQL][DOCS]Update SQL programming guide about cache section
+ luogankun <luogankun@gmail.com>
+ 2014-12-30 12:17:49 -0800
+ Commit: f7a41a0, github.com/apache/spark/pull/3759
+
+ [SPARK-4493][SQL] Tests for IsNull / IsNotNull in the ParquetFilterSuite
+ Cheng Lian <lian@databricks.com>
+ 2014-12-30 12:16:45 -0800
+ Commit: 19a8802, github.com/apache/spark/pull/3748
+
+ [Spark-4512] [SQL] Unresolved Attribute Exception in Sort By
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-30 12:11:44 -0800
+ Commit: 53f0a00, github.com/apache/spark/pull/3386
+
+ [SPARK-5002][SQL] Using ascending by default when not specify order in order by
+ wangfei <wangfei1@huawei.com>
+ 2014-12-30 12:07:24 -0800
+ Commit: daac221, github.com/apache/spark/pull/3838
+
+ [SPARK-4904] [SQL] Remove the unnecessary code change in Generic UDF
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-30 11:47:08 -0800
+ Commit: 63b84b7, github.com/apache/spark/pull/3745
+
+ [SPARK-4959] [SQL] Attributes are case sensitive when using a select query from a projection
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-30 11:33:47 -0800
+ Commit: 5595eaa, github.com/apache/spark/pull/3796
+
+ [SPARK-4975][SQL] Fix HiveInspectorSuite test failure
+ scwf <wangfei1@huawei.com>, Fei Wang <wangfei1@huawei.com>
+ 2014-12-30 11:30:47 -0800
+ Commit: 65357f1, github.com/apache/spark/pull/3814
+
+ [SQL] enable view test
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-30 11:29:13 -0800
+ Commit: 94d60b7, github.com/apache/spark/pull/3826
+
+ [SPARK-4908][SQL] Prevent multiple concurrent hive native commands
+ Michael Armbrust <michael@databricks.com>
+ 2014-12-30 11:24:46 -0800
+ Commit: 480bd1d, github.com/apache/spark/pull/3834
+
+ [SPARK-4882] Register PythonBroadcast with Kryo so that PySpark works with KryoSerializer
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-30 09:29:52 -0800
+ Commit: efa80a53, github.com/apache/spark/pull/3831
+
+ [SPARK-4920][UI] add version on master and worker page for standalone mode
+ Zhang, Liye <liye.zhang@intel.com>
+ 2014-12-30 09:19:47 -0800
+ Commit: 9077e72, github.com/apache/spark/pull/3769
+
+ [SPARK-4972][MLlib] Updated the scala doc for lasso and ridge regression for the change of LeastSquaresGradient
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-12-29 17:17:12 -0800
+ Commit: 040d6f2, github.com/apache/spark/pull/3808
+
+ Added setMinCount to Word2Vec.scala
+ ganonp <ganonp@gmail.com>
+ 2014-12-29 15:31:19 -0800
+ Commit: 343db39, github.com/apache/spark/pull/3693
+
+ SPARK-4156 [MLLIB] EM algorithm for GMMs
+ Travis Galoppo <tjg2107@columbia.edu>, Travis Galoppo <travis@localhost.localdomain>, tgaloppo <tjg2107@columbia.edu>, FlytxtRnD <meethu.mathew@flytxt.com>
+ 2014-12-29 15:29:15 -0800
+ Commit: 6cf6fdf, github.com/apache/spark/pull/3022
+
+ SPARK-4968: takeOrdered to skip reduce step in case mappers return no partitions
+ Yash Datta <Yash.Datta@guavus.com>
+ 2014-12-29 13:49:45 -0800
+ Commit: 9bc0df6, github.com/apache/spark/pull/3830
+
+ [SPARK-4409][MLlib] Additional Linear Algebra Utils
+ Burak Yavuz <brkyvz@gmail.com>, Xiangrui Meng <meng@databricks.com>
+ 2014-12-29 13:24:26 -0800
+ Commit: 02b55de, github.com/apache/spark/pull/3319
+
+ [Minor] Fix a typo of type parameter in JavaUtils.scala
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-29 12:05:08 -0800
+ Commit: 8d72341, github.com/apache/spark/pull/3789
+
+ [SPARK-4946] [CORE] Using AkkaUtils.askWithReply in MapOutputTracker.askTracker to reduce the chance of the communicating problem
+ YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>
+ 2014-12-29 11:30:54 -0800
+ Commit: 815de54, github.com/apache/spark/pull/3785
+
+ Adde LICENSE Header to build/mvn, build/sbt and sbt/sbt
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-29 10:48:53 -0800
+ Commit: 4cef05e, github.com/apache/spark/pull/3817
+
+ [SPARK-4982][DOC] `spark.ui.retainedJobs` description is wrong in Spark UI configuration guide
+ wangxiaojing <u9jing@gmail.com>
+ 2014-12-29 10:45:14 -0800
+ Commit: 6645e52, github.com/apache/spark/pull/3818
+
+ [SPARK-4966][YARN]The MemoryOverhead value is setted not correctly
+ meiyoula <1039320815@qq.com>
+ 2014-12-29 08:20:30 -0600
+ Commit: 14fa87b, github.com/apache/spark/pull/3797
+
+ [SPARK-4501][Core] - Create build/mvn to automatically download maven/zinc/scalac
+ Brennon York <brennon.york@capitalone.com>
+ 2014-12-27 13:25:18 -0800
+ Commit: a3e51cc, github.com/apache/spark/pull/3707
+
+ [SPARK-4952][Core]Handle ConcurrentModificationExceptions in SparkEnv.environmentDetails
+ GuoQiang Li <witgo@qq.com>
+ 2014-12-26 23:31:29 -0800
+ Commit: 080ceb7, github.com/apache/spark/pull/3788
+
+ [SPARK-4954][Core] add spark version infomation in log for standalone mode
+ Zhang, Liye <liye.zhang@intel.com>
+ 2014-12-26 23:23:13 -0800
+ Commit: 786808a, github.com/apache/spark/pull/3790
+
+ [SPARK-3955] Different versions between jackson-mapper-asl and jackson-c...
+ Jongyoul Lee <jongyoul@gmail.com>
+ 2014-12-26 22:59:34 -0800
+ Commit: 2483c1e, github.com/apache/spark/pull/3716
+
+ HOTFIX: Slight tweak on previous commit.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-26 22:55:04 -0800
+ Commit: 82bf4be
+
+ [SPARK-3787][BUILD] Assembly jar name is wrong when we build with sbt omitting -Dhadoop.version
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-26 22:52:04 -0800
+ Commit: de95c57, github.com/apache/spark/pull/3046
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-26 22:39:56 -0800
+ Commit: 534f24b, github.com/apache/spark/pull/3456
+
+ SPARK-4971: Fix typo in BlockGenerator comment
+ CodingCat <zhunansjtu@gmail.com>
+ 2014-12-26 12:03:22 -0800
+ Commit: fda4331, github.com/apache/spark/pull/3807
+
+ [SPARK-4608][Streaming] Reorganize StreamingContext implicit to improve API convenience
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-25 19:46:05 -0800
+ Commit: f9ed2b6, github.com/apache/spark/pull/3464
+
+ [SPARK-4537][Streaming] Expand StreamingSource to add more metrics
+ jerryshao <saisai.shao@intel.com>
+ 2014-12-25 19:39:49 -0800
+ Commit: f205fe4, github.com/apache/spark/pull/3466
+
+ [EC2] Update mesos/spark-ec2 branch to branch-1.3
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2014-12-25 14:16:50 -0800
+ Commit: ac82785, github.com/apache/spark/pull/3804
+
+ [EC2] Update default Spark version to 1.2.0
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2014-12-25 14:13:12 -0800
+ Commit: b6b6393, github.com/apache/spark/pull/3793
+
+ Fix "Building Spark With Maven" link in README.md
+ Denny Lee <denny.g.lee@gmail.com>
+ 2014-12-25 14:05:55 -0800
+ Commit: 08b18c7, github.com/apache/spark/pull/3802
+
+ [SPARK-4953][Doc] Fix the description of building Spark with YARN
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-25 07:05:43 -0800
+ Commit: 11dd993, github.com/apache/spark/pull/3787
+
+ [SPARK-4873][Streaming] Use `Future.zip` instead of `Future.flatMap`(for-loop) in WriteAheadLogBasedBlockHandler
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-24 19:49:41 -0800
+ Commit: b4d0db8, github.com/apache/spark/pull/3721
+
+ SPARK-4297 [BUILD] Build warning fixes omnibus
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-24 13:32:51 -0800
+ Commit: 29fabb1, github.com/apache/spark/pull/3157
+
+ [SPARK-4881][Minor] Use SparkConf#getBoolean instead of get().toBoolean
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-23 19:14:34 -0800
+ Commit: 199e59a, github.com/apache/spark/pull/3733
+
+ [SPARK-4860][pyspark][sql] speeding up `sample()` and `takeSample()`
+ jbencook <jbenjamincook@gmail.com>, J. Benjamin Cook <jbenjamincook@gmail.com>
+ 2014-12-23 17:46:24 -0800
+ Commit: fd41eb9, github.com/apache/spark/pull/3764
+
+ [SPARK-4606] Send EOF to child JVM when there's no more data to read.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2014-12-23 16:02:59 -0800
+ Commit: 7e2deb7, github.com/apache/spark/pull/3460
+
+ [SPARK-4671][Streaming]Do not replicate streaming block when WAL is enabled
+ jerryshao <saisai.shao@intel.com>
+ 2014-12-23 15:45:53 -0800
+ Commit: 3f5f4cc, github.com/apache/spark/pull/3534
+
+ [SPARK-4802] [streaming] Remove receiverInfo once receiver is de-registered
+ Ilayaperumal Gopinathan <igopinathan@pivotal.io>
+ 2014-12-23 15:14:54 -0800
+ Commit: 10d69e9, github.com/apache/spark/pull/3647
+
+ [SPARK-4913] Fix incorrect event log path
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2014-12-23 14:58:33 -0800
+ Commit: 96281cd, github.com/apache/spark/pull/3755
+
+ [SPARK-4730][YARN] Warn against deprecated YARN settings
+ Andrew Or <andrew@databricks.com>
+ 2014-12-23 14:28:36 -0800
+ Commit: 27c5399, github.com/apache/spark/pull/3590
+
+ [SPARK-4914][Build] Cleans lib_managed before compiling with Hive 0.13.1
+ Cheng Lian <lian@databricks.com>
+ 2014-12-23 12:54:20 -0800
+ Commit: 395b771, github.com/apache/spark/pull/3756
+
+ [SPARK-4932] Add help comments in Analytics
+ Takeshi Yamamuro <linguin.m.s@gmail.com>
+ 2014-12-23 12:39:41 -0800
+ Commit: 9c251c5, github.com/apache/spark/pull/3775
+
+ [SPARK-4834] [standalone] Clean up application files after app finishes.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2014-12-23 12:02:08 -0800
+ Commit: dd15536, github.com/apache/spark/pull/3705
+
+ [SPARK-4931][Yarn][Docs] Fix the format of running-on-yarn.md
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-23 11:18:06 -0800
+ Commit: 2d215ae, github.com/apache/spark/pull/3774
+
+ [SPARK-4890] Ignore downloaded EC2 libs
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2014-12-23 11:12:16 -0800
+ Commit: 2823c7f, github.com/apache/spark/pull/3770
+
+ [Docs] Minor typo fixes
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2014-12-22 22:54:32 -0800
+ Commit: 0e532cc, github.com/apache/spark/pull/3772
+
+ [SPARK-4907][MLlib] Inconsistent loss and gradient in LeastSquaresGradient compared with R
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-12-22 16:42:55 -0800
+ Commit: a96b727, github.com/apache/spark/pull/3746
+
+ [SPARK-4818][Core] Add 'iterator' to reduce memory consumed by join
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-22 14:26:28 -0800
+ Commit: c233ab3, github.com/apache/spark/pull/3671
+
+ [SPARK-4920][UI]:current spark version in UI is not striking.
+ genmao.ygm <genmao.ygm@alibaba-inc.com>
+ 2014-12-22 14:14:39 -0800
+ Commit: de9d7d2, github.com/apache/spark/pull/3763
+
+ [Minor] Fix scala doc
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2014-12-22 14:13:31 -0800
+ Commit: a61aa66, github.com/apache/spark/pull/3751
+
+ [SPARK-4864] Add documentation to Netty-based configs
+ Aaron Davidson <aaron@databricks.com>
+ 2014-12-22 13:09:22 -0800
+ Commit: fbca6b6, github.com/apache/spark/pull/3713
+
+ [SPARK-4079] [CORE] Consolidates Errors if a CompressionCodec is not available
+ Kostas Sakellis <kostas@cloudera.com>
+ 2014-12-22 13:07:01 -0800
+ Commit: 7c0ed13, github.com/apache/spark/pull/3119
+
+ SPARK-4447. Remove layers of abstraction in YARN code no longer needed after dropping yarn-alpha
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-12-22 12:23:43 -0800
+ Commit: d62da64, github.com/apache/spark/pull/3652
+
+ [SPARK-4733] Add missing prameter comments in ShuffleDependency
+ Takeshi Yamamuro <linguin.m.s@gmail.com>
+ 2014-12-22 12:19:23 -0800
+ Commit: fb8e85e, github.com/apache/spark/pull/3594
+
+ [Minor] Improve some code in BroadcastTest for short
+ carlmartin <carlmartinmax@gmail.com>
+ 2014-12-22 12:13:53 -0800
+ Commit: 1d9788e, github.com/apache/spark/pull/3750
+
+ [SPARK-4883][Shuffle] Add a name to the directoryCleaner thread
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-22 12:11:36 -0800
+ Commit: 8773705, github.com/apache/spark/pull/3734
+
+ [SPARK-4870] Add spark version to driver log
+ Zhang, Liye <liye.zhang@intel.com>
+ 2014-12-22 11:36:49 -0800
+ Commit: 39272c8, github.com/apache/spark/pull/3717
+
+ [SPARK-4915][YARN] Fix classname to be specified for external shuffle service.
+ Tsuyoshi Ozawa <ozawa.tsuyoshi@lab.ntt.co.jp>
+ 2014-12-22 11:28:05 -0800
+ Commit: 96606f6, github.com/apache/spark/pull/3757
+
+ [SPARK-4918][Core] Reuse Text in saveAsTextFile
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-22 11:20:00 -0800
+ Commit: 93b2f3a, github.com/apache/spark/pull/3762
+
+ [SPARK-2075][Core] Make the compiler generate same bytes code for Hadoop 1.+ and Hadoop 2.+
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-21 22:10:19 -0800
+ Commit: 6ee6aa7, github.com/apache/spark/pull/3740
+
+ SPARK-4910 [CORE] build failed (use of FileStatus.isFile in Hadoop 1.x)
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-21 13:16:57 -0800
+ Commit: c6a3c0d, github.com/apache/spark/pull/3754
+
+ [Minor] Build Failed: value defaultProperties not found
+ huangzhaowei <carlmartinmax@gmail.com>
+ 2014-12-19 23:32:56 -0800
+ Commit: a764960, github.com/apache/spark/pull/3749
+
+ [SPARK-4140] Document dynamic allocation
+ Andrew Or <andrew@databricks.com>, Tsuyoshi Ozawa <ozawa.tsuyoshi@gmail.com>
+ 2014-12-19 19:36:20 -0800
+ Commit: 15c03e1, github.com/apache/spark/pull/3731
+
+ [SPARK-4831] Do not include SPARK_CLASSPATH if empty
+ Daniel Darabos <darabos.daniel@gmail.com>
+ 2014-12-19 19:32:39 -0800
+ Commit: 7cb3f54, github.com/apache/spark/pull/3678
+
+ SPARK-2641: Passing num executors to spark arguments from properties file
+ Kanwaljit Singh <kanwaljit.singh@guavus.com>
+ 2014-12-19 19:25:39 -0800
+ Commit: 1d64812, github.com/apache/spark/pull/1657
+
+ [SPARK-3060] spark-shell.cmd doesn't accept application options in Windows OS
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2014-12-19 19:19:53 -0800
+ Commit: 8d93247, github.com/apache/spark/pull/3350
+
+ change signature of example to match released code
+ Eran Medan <ehrann.mehdan@gmail.com>
+ 2014-12-19 18:29:36 -0800
+ Commit: c25c669, github.com/apache/spark/pull/3747
+
+ [SPARK-2261] Make event logger use a single file.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2014-12-19 18:21:15 -0800
+ Commit: 4564519, github.com/apache/spark/pull/1222
+
+ [SPARK-4890] Upgrade Boto to 2.34.0; automatically download Boto from PyPi instead of packaging it
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-19 17:02:37 -0800
+ Commit: c28083f, github.com/apache/spark/pull/3737
+
+ [SPARK-4896] don’t redundantly overwrite executor JAR deps
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2014-12-19 15:24:41 -0800
+ Commit: 7981f96, github.com/apache/spark/pull/2848
+
+ [SPARK-4889] update history server example cmds
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2014-12-19 13:56:04 -0800
+ Commit: cdb2c64, github.com/apache/spark/pull/3736
+
+ Small refactoring to pass SparkEnv into Executor rather than creating SparkEnv in Executor.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-19 12:51:12 -0800
+ Commit: 336cd34, github.com/apache/spark/pull/3738
+
+ [Build] Remove spark-staging-1038
+ scwf <wangfei1@huawei.com>
+ 2014-12-19 08:29:38 -0800
+ Commit: 8e253eb, github.com/apache/spark/pull/3743
+
+ [SPARK-4901] [SQL] Hot fix for ByteWritables.copyBytes
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-19 08:04:41 -0800
+ Commit: 5479450, github.com/apache/spark/pull/3742
+
+ SPARK-3428. TaskMetrics for running tasks is missing GC time metrics
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-12-18 22:40:44 -0800
+ Commit: 283263f, github.com/apache/spark/pull/3684
+
+ [SPARK-4674] Refactor getCallSite
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2014-12-18 21:41:02 -0800
+ Commit: d7fc69a, github.com/apache/spark/pull/3532
+
+ [SPARK-4728][MLLib] Add exponential, gamma, and log normal sampling to MLlib da...
+ RJ Nowling <rnowling@gmail.com>
+ 2014-12-18 21:00:49 -0800
+ Commit: ee1fb97, github.com/apache/spark/pull/3680
+
+ [SPARK-4861][SQL] Refactory command in spark sql
+ wangfei <wangfei1@huawei.com>, scwf <wangfei1@huawei.com>
+ 2014-12-18 20:24:56 -0800
+ Commit: c3d91da, github.com/apache/spark/pull/3712
+
+ [SPARK-4573] [SQL] Add SettableStructObjectInspector support in "wrap" function
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-18 20:21:52 -0800
+ Commit: ae9f128, github.com/apache/spark/pull/3429
+
+ [SPARK-2554][SQL] Supporting SumDistinct partial aggregation
+ ravipesala <ravindra.pesala@huawei.com>
+ 2014-12-18 20:19:10 -0800
+ Commit: 7687415, github.com/apache/spark/pull/3348
+
+ [SPARK-4693] [SQL] PruningPredicates may be wrong if predicates contains an empty AttributeSet() references
+ YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>
+ 2014-12-18 20:13:46 -0800
+ Commit: e7de7e5, github.com/apache/spark/pull/3556
+
+ [SPARK-4756][SQL] FIX: sessionToActivePool grow infinitely, even as sessions expire
+ guowei2 <guowei2@asiainfo.com>
+ 2014-12-18 20:10:23 -0800
+ Commit: 22ddb6e, github.com/apache/spark/pull/3617
+
+ [SPARK-3928][SQL] Support wildcard matches on Parquet files.
+ Thu Kyaw <trk007@gmail.com>
+ 2014-12-18 20:08:32 -0800
+ Commit: b68bc6d, github.com/apache/spark/pull/3407
+
+ [SPARK-2663] [SQL] Support the Grouping Set
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-18 18:58:29 -0800
+ Commit: f728e0f, github.com/apache/spark/pull/1567
+
+ [SPARK-4754] Refactor SparkContext into ExecutorAllocationClient
+ Andrew Or <andrew@databricks.com>
+ 2014-12-18 17:37:42 -0800
+ Commit: 9804a75, github.com/apache/spark/pull/3614
+
+ [SPARK-4837] NettyBlockTransferService should use spark.blockManager.port config
+ Aaron Davidson <aaron@databricks.com>
+ 2014-12-18 16:43:16 -0800
+ Commit: 105293a, github.com/apache/spark/pull/3688
+
+ SPARK-4743 - Use SparkEnv.serializer instead of closureSerializer in aggregateByKey and foldByKey
+ Ivan Vergiliev <ivan@leanplum.com>
+ 2014-12-18 16:29:36 -0800
+ Commit: f9f58b9, github.com/apache/spark/pull/3605
+
+ [SPARK-4884]: Improve Partition docs
+ Madhu Siddalingaiah <madhu@madhu.com>
+ 2014-12-18 16:00:53 -0800
+ Commit: d5a596d, github.com/apache/spark/pull/3722
+
+ [SPARK-4880] remove spark.locality.wait in Analytics
+ Ernest <earneyzxl@gmail.com>
+ 2014-12-18 15:42:26 -0800
+ Commit: a7ed6f3, github.com/apache/spark/pull/3730
+
+ [SPARK-4887][MLlib] Fix a bad unittest in LogisticRegressionSuite
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-12-18 13:55:49 -0800
+ Commit: 59a49db, github.com/apache/spark/pull/3735
+
+ [SPARK-3607] ConnectionManager threads.max configs on the thread pools don't work
+ Ilya Ganelin <ilya.ganelin@capitalone.com>
+ 2014-12-18 12:53:18 -0800
+ Commit: 3720057, github.com/apache/spark/pull/3664
+
+ Add mesos specific configurations into doc
+ Timothy Chen <tnachen@gmail.com>
+ 2014-12-18 12:15:53 -0800
+ Commit: d9956f8, github.com/apache/spark/pull/3349
+
+ SPARK-3779. yarn spark.yarn.applicationMaster.waitTries config should be...
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-12-18 12:19:07 -0600
+ Commit: 253b72b, github.com/apache/spark/pull/3471
+
+ [SPARK-4461][YARN] pass extra java options to yarn application master
+ Zhan Zhang <zhazhan@gmail.com>
+ 2014-12-18 10:01:46 -0600
+ Commit: 3b76469, github.com/apache/spark/pull/3409
+
+ [SPARK-4822] Use sphinx tags for Python doc annotations
+ lewuathe <lewuathe@me.com>
+ 2014-12-17 17:31:24 -0800
+ Commit: 3cd5161, github.com/apache/spark/pull/3685
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-17 15:50:10 -0800
+ Commit: ca12608, github.com/apache/spark/pull/3137
+
+ [SPARK-3891][SQL] Add array support to percentile, percentile_approx and constant inspectors support
+ Venkata Ramana G <ramana.gollamudihuawei.com>, Venkata Ramana Gollamudi <ramana.gollamudi@huawei.com>
+ 2014-12-17 15:41:35 -0800
+ Commit: f33d550, github.com/apache/spark/pull/2802
+
+ [SPARK-4856] [SQL] NullType instead of StringType when sampling against empty string or nul...
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-17 15:01:59 -0800
+ Commit: 8d0d2a6, github.com/apache/spark/pull/3708
+
+ [HOTFIX][SQL] Fix parquet filter suite
+ Michael Armbrust <michael@databricks.com>
+ 2014-12-17 14:27:02 -0800
+ Commit: 19c0faa, github.com/apache/spark/pull/3727
+
+ [SPARK-4821] [mllib] [python] [docs] Fix for pyspark.mllib.rand doc
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-12-17 14:12:46 -0800
+ Commit: affc3f4, github.com/apache/spark/pull/3669
+
+ [SPARK-3739] [SQL] Update the split num base on block size for table scanning
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-17 13:39:36 -0800
+ Commit: 636d9fc, github.com/apache/spark/pull/2589
+
+ [SPARK-4755] [SQL] sqrt(negative value) should return null
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-17 12:51:27 -0800
+ Commit: 902e4d5, github.com/apache/spark/pull/3616
+
+ [SPARK-4493][SQL] Don't pushdown Eq, NotEq, Lt, LtEq, Gt and GtEq predicates with nulls for Parquet
+ Cheng Lian <lian@databricks.com>
+ 2014-12-17 12:48:04 -0800
+ Commit: 6277135, github.com/apache/spark/pull/3367
+
+ [SPARK-3698][SQL] Fix case insensitive resolution of GetField.
+ Michael Armbrust <michael@databricks.com>
+ 2014-12-17 12:43:51 -0800
+ Commit: 7ad579e, github.com/apache/spark/pull/3724
+
+ [SPARK-4694]Fix HiveThriftServer2 cann't stop In Yarn HA mode.
+ carlmartin <carlmartinmax@gmail.com>
+ 2014-12-17 12:24:03 -0800
+ Commit: 4782def, github.com/apache/spark/pull/3576
+
+ [SPARK-4625] [SQL] Add sort by for DSL & SimpleSqlParser
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-17 12:01:57 -0800
+ Commit: 5fdcbdc, github.com/apache/spark/pull/3481
+
+ [SPARK-4595][Core] Fix MetricsServlet not work issue
+ Saisai Shao <saisai.shao@intel.com>, Josh Rosen <joshrosen@databricks.com>, jerryshao <saisai.shao@intel.com>
+ 2014-12-17 11:47:44 -0800
+ Commit: cf50631, github.com/apache/spark/pull/3444
+
+ [HOTFIX] Fix RAT exclusion for known_translations file
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-16 23:00:25 -0800
+ Commit: 3d0c37b, github.com/apache/spark/pull/3719
+
+ [Release] Update contributors list format and sort it
+ Andrew Or <andrew@databricks.com>
+ 2014-12-16 22:11:03 -0800
+ Commit: 4e1112e
+
+ [SPARK-4618][SQL] Make foreign DDL commands options case-insensitive
+ scwf <wangfei1@huawei.com>, wangfei <wangfei1@huawei.com>
+ 2014-12-16 21:26:36 -0800
+ Commit: 6069880, github.com/apache/spark/pull/3470
+
+ [SPARK-4866] support StructType as key in MapType
+ Davies Liu <davies@databricks.com>
+ 2014-12-16 21:23:28 -0800
+ Commit: ec5c427, github.com/apache/spark/pull/3714
+
+ [SPARK-4375] [SQL] Add 0 argument support for udf
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-16 21:21:11 -0800
+ Commit: 770d815, github.com/apache/spark/pull/3595
+
+ [SPARK-4720][SQL] Remainder should also return null if the divider is 0.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-12-16 21:19:57 -0800
+ Commit: ddc7ba3, github.com/apache/spark/pull/3581
+
+ [SPARK-4744] [SQL] Short circuit evaluation for AND & OR in CodeGen
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-16 21:18:39 -0800
+ Commit: 0aa834a, github.com/apache/spark/pull/3606
+
+ [SPARK-4798][SQL] A new set of Parquet testing API and test suites
+ Cheng Lian <lian@databricks.com>
+ 2014-12-16 21:16:03 -0800
+ Commit: 3b395e1, github.com/apache/spark/pull/3644
+
+ [Release] Cache known author translations locally
+ Andrew Or <andrew@databricks.com>
+ 2014-12-16 19:28:43 -0800
+ Commit: b85044e
+
+ [Release] Major improvements to generate contributors script
+ Andrew Or <andrew@databricks.com>
+ 2014-12-16 17:55:27 -0800
+ Commit: 6f80b74
+
+ [SPARK-4269][SQL] make wait time configurable in BroadcastHashJoin
+ Jacky Li <jacky.likun@huawei.com>
+ 2014-12-16 15:34:59 -0800
+ Commit: fa66ef6, github.com/apache/spark/pull/3133
+
+ [SPARK-4827][SQL] Fix resolution of deeply nested Project(attr, Project(Star,...)).
+ Michael Armbrust <michael@databricks.com>
+ 2014-12-16 15:31:19 -0800
+ Commit: a66c23e, github.com/apache/spark/pull/3674
+
+ [SPARK-4483][SQL]Optimization about reduce memory costs during the HashOuterJoin
+ tianyi <tianyi@asiainfo-linkage.com>, tianyi <tianyi.asiainfo@gmail.com>
+ 2014-12-16 15:22:29 -0800
+ Commit: 30f6b85, github.com/apache/spark/pull/3375
+
+ [SPARK-4527][SQl]Add BroadcastNestedLoopJoin operator selection testsuite
+ wangxiaojing <u9jing@gmail.com>
+ 2014-12-16 14:45:56 -0800
+ Commit: ea1315e, github.com/apache/spark/pull/3395
+
+ SPARK-4767: Add support for launching in a specified placement group to spark_ec2
+ Holden Karau <holden@pigscanfly.ca>
+ 2014-12-16 14:37:04 -0800
+ Commit: b0dfdbd, github.com/apache/spark/pull/3623
+
+ [SPARK-4812][SQL] Fix the initialization issue of 'codegenEnabled'
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-16 14:13:40 -0800
+ Commit: 6530243, github.com/apache/spark/pull/3660
+
+ [SPARK-4847][SQL]Fix "extraStrategies cannot take effect in SQLContext" issue
+ jerryshao <saisai.shao@intel.com>
+ 2014-12-16 14:08:28 -0800
+ Commit: dc8280d, github.com/apache/spark/pull/3698
+
+ [DOCS][SQL] Add a Note on jsonFile having separate JSON objects per line
+ Peter Vandenabeele <peter@vandenabeele.com>
+ 2014-12-16 13:57:55 -0800
+ Commit: 1a9e35e, github.com/apache/spark/pull/3517
+
+ [SQL] SPARK-4700: Add HTTP protocol spark thrift server
+ Judy Nash <judynash@microsoft.com>, judynash <judynash@microsoft.com>
+ 2014-12-16 12:37:26 -0800
+ Commit: 17688d1, github.com/apache/spark/pull/3672
+
+ [SPARK-3405] add subnet-id and vpc-id options to spark_ec2.py
+ Mike Jennings <mvj101@gmail.com>, Mike Jennings <mvj@google.com>
+ 2014-12-16 12:13:21 -0800
+ Commit: d12c071, github.com/apache/spark/pull/2872
+
+ [SPARK-4855][mllib] testing the Chi-squared hypothesis test
+ jbencook <jbenjamincook@gmail.com>
+ 2014-12-16 11:37:23 -0800
+ Commit: cb48447, github.com/apache/spark/pull/3679
+
+ [SPARK-4437] update doc for WholeCombineFileRecordReader
+ Davies Liu <davies@databricks.com>, Josh Rosen <joshrosen@databricks.com>
+ 2014-12-16 11:19:36 -0800
+ Commit: ed36200, github.com/apache/spark/pull/3301
+
+ [SPARK-4841] fix zip with textFile()
+ Davies Liu <davies@databricks.com>
+ 2014-12-15 22:58:26 -0800
+ Commit: c246b95, github.com/apache/spark/pull/3706
+
+ [SPARK-4792] Add error message when making local dir unsuccessfully
+ meiyoula <1039320815@qq.com>
+ 2014-12-15 22:30:18 -0800
+ Commit: c762877, github.com/apache/spark/pull/3635
+
+ SPARK-4814 [CORE] Enable assertions in SBT, Maven tests / AssertionError from Hive's LazyBinaryInteger
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-15 17:12:05 -0800
+ Commit: 81112e4, github.com/apache/spark/pull/3692
+
+ [Minor][Core] fix comments in MapOutputTracker
+ wangfei <wangfei1@huawei.com>
+ 2014-12-15 16:46:21 -0800
+ Commit: 5c24759, github.com/apache/spark/pull/3700
+
+ SPARK-785 [CORE] ClosureCleaner not invoked on most PairRDDFunctions
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-15 16:06:15 -0800
+ Commit: 2a28bc6, github.com/apache/spark/pull/3690
+
+ [SPARK-4668] Fix some documentation typos.
+ Ryan Williams <ryan.blake.williams@gmail.com>
+ 2014-12-15 14:52:17 -0800
+ Commit: 8176b7a, github.com/apache/spark/pull/3523
+
+ [SPARK-1037] The name of findTaskFromList & findTask in TaskSetManager.scala is confusing
+ Ilya Ganelin <ilya.ganelin@capitalone.com>
+ 2014-12-15 14:51:15 -0800
+ Commit: 38703bb, github.com/apache/spark/pull/3665
+
+ [SPARK-4826] Fix generation of temp file names in WAL tests
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-15 14:33:43 -0800
+ Commit: f6b8591, github.com/apache/spark/pull/3695.
+
+ [SPARK-4494][mllib] IDFModel.transform() add support for single vector
+ Yuu ISHIKAWA <yuu.ishikawa@gmail.com>
+ 2014-12-15 13:44:15 -0800
+ Commit: 8098fab, github.com/apache/spark/pull/3603
+
+ HOTFIX: Disabling failing block manager test
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-15 10:54:45 -0800
+ Commit: 4c06738
+
+ fixed spelling errors in documentation
+ Peter Klipfel <peter@klipfel.me>
+ 2014-12-14 00:01:16 -0800
+ Commit: 2a2983f, github.com/apache/spark/pull/3691
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-11 23:38:40 -0800
+ Commit: ef84dab, github.com/apache/spark/pull/3488
+
+ [SPARK-4829] [SQL] add rule to fold count(expr) if expr is not null
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-11 22:56:42 -0800
+ Commit: 41a3f93, github.com/apache/spark/pull/3676
+
+ [SPARK-4742][SQL] The name of Parquet File generated by AppendingParquetOutputFormat should be zero padded
+ Sasaki Toru <sasakitoa@nttdata.co.jp>
+ 2014-12-11 22:54:21 -0800
+ Commit: 8091dd6, github.com/apache/spark/pull/3602
+
+ [SPARK-4825] [SQL] CTAS fails to resolve when created using saveAsTable
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-11 22:51:49 -0800
+ Commit: 0abbff2, github.com/apache/spark/pull/3673
+
+ [SQL] enable empty aggr test case
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-11 22:50:18 -0800
+ Commit: cbb634a, github.com/apache/spark/pull/3445
+
+ [SPARK-4828] [SQL] sum and avg on empty table should always return null
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-11 22:49:27 -0800
+ Commit: acb3be6, github.com/apache/spark/pull/3675
+
+ [SQL] Remove unnecessary case in HiveContext.toHiveString
+ scwf <wangfei1@huawei.com>
+ 2014-12-11 22:48:03 -0800
+ Commit: d8cf678, github.com/apache/spark/pull/3563
+
+ [SPARK-4293][SQL] Make Cast be able to handle complex types.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-12-11 22:45:25 -0800
+ Commit: 3344803, github.com/apache/spark/pull/3150
+
+ [SPARK-4639] [SQL] Pass maxIterations in as a parameter in Analyzer
+ Jacky Li <jacky.likun@huawei.com>
+ 2014-12-11 22:44:27 -0800
+ Commit: c152dde, github.com/apache/spark/pull/3499
+
+ [SPARK-4662] [SQL] Whitelist more unittest
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-11 22:43:02 -0800
+ Commit: a7f07f5, github.com/apache/spark/pull/3522
+
+ [SPARK-4713] [SQL] SchemaRDD.unpersist() should not raise exception if it is not persisted
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-11 22:41:36 -0800
+ Commit: bf40cf8, github.com/apache/spark/pull/3572
+
+ [SPARK-4806] Streaming doc update for 1.2
+ Tathagata Das <tathagata.das1565@gmail.com>, Josh Rosen <joshrosen@databricks.com>, Josh Rosen <rosenville@gmail.com>
+ 2014-12-11 06:21:23 -0800
+ Commit: b004150, github.com/apache/spark/pull/3653
+
+ [SPARK-4791] [sql] Infer schema from case class with multiple constructors
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-12-10 23:41:15 -0800
+ Commit: 2a5b5fd, github.com/apache/spark/pull/3646
+
+ [CORE]codeStyle: uniform ConcurrentHashMap define in StorageLevel.scala with other places
+ Zhang, Liye <liye.zhang@intel.com>
+ 2014-12-10 20:44:59 -0800
+ Commit: 57d37f9, github.com/apache/spark/pull/2793
+
+ SPARK-3526 Add section about data locality to the tuning guide
+ Andrew Ash <andrew@andrewash.com>
+ 2014-12-10 15:01:15 -0800
+ Commit: 652b781, github.com/apache/spark/pull/2519
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-10 14:41:16 -0800
+ Commit: 36bdb5b, github.com/apache/spark/pull/2883
+
+ [SPARK-4759] Fix driver hanging from coalescing partitions
+ Andrew Or <andrew@databricks.com>
+ 2014-12-10 14:27:53 -0800
+ Commit: 4f93d0c, github.com/apache/spark/pull/3633
+
+ [SPARK-4569] Rename 'externalSorting' in Aggregator
+ Ilya Ganelin <ilya.ganelin@capitalone.com>
+ 2014-12-10 14:19:37 -0800
+ Commit: 447ae2d, github.com/apache/spark/pull/3666
+
+ [SPARK-4793] [Deploy] ensure .jar at end of line
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-10 13:29:27 -0800
+ Commit: e230da1, github.com/apache/spark/pull/3641
+
+ [SPARK-4215] Allow requesting / killing executors only in YARN mode
+ Andrew Or <andrew@databricks.com>
+ 2014-12-10 12:48:24 -0800
+ Commit: faa8fd8, github.com/apache/spark/pull/3615
+
+ [SPARK-4771][Docs] Document standalone cluster supervise mode
+ Andrew Or <andrew@databricks.com>
+ 2014-12-10 12:41:36 -0800
+ Commit: 5621283, github.com/apache/spark/pull/3627
+
+ [SPARK-4329][WebUI] HistoryPage pagenation
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-10 12:29:00 -0800
+ Commit: 0fc637b, github.com/apache/spark/pull/3194
+
+ [SPARK-4161]Spark shell class path is not correctly set if "spark.driver.extraClassPath" is set in defaults.conf
+ GuoQiang Li <witgo@qq.com>
+ 2014-12-10 12:24:04 -0800
+ Commit: 742e709, github.com/apache/spark/pull/3050
+
+ [SPARK-4772] Clear local copies of accumulators as soon as we're done with them
+ Nathan Kronenfeld <nkronenfeld@oculusinfo.com>
+ 2014-12-09 23:53:17 -0800
+ Commit: 94b377f, github.com/apache/spark/pull/3570
+
+ [Minor] Use <sup> tag for help icon in web UI page header
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-12-09 23:47:05 -0800
+ Commit: f79c1cf, github.com/apache/spark/pull/3659
+
+ Config updates for the new shuffle transport.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-09 19:29:09 -0800
+ Commit: 9bd9334, github.com/apache/spark/pull/3657
+
+ [SPARK-4740] Create multiple concurrent connections between two peer nodes in Netty.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-09 17:49:59 -0800
+ Commit: 2b9b726, github.com/apache/spark/pull/3625
+
+ SPARK-4805 [CORE] BlockTransferMessage.toByteArray() trips assertion
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-09 16:38:27 -0800
+ Commit: d8f84f2, github.com/apache/spark/pull/3650
+
+ SPARK-4567. Make SparkJobInfo and SparkStageInfo serializable
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-12-09 16:26:07 -0800
+ Commit: 5e4c06f, github.com/apache/spark/pull/3426
+
+ [SPARK-4714] BlockManager.dropFromMemory() should check whether block has been removed after synchronizing on BlockInfo instance.
+ hushan[胡珊] <hushan@xiaomi.com>
+ 2014-12-09 15:11:20 -0800
+ Commit: 30dca92, github.com/apache/spark/pull/3574
+
+ [SPARK-4765] Make GC time always shown in UI.
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2014-12-09 15:10:36 -0800
+ Commit: 1f51106, github.com/apache/spark/pull/3622
+
+ [SPARK-4691][shuffle] Restructure a few lines in shuffle code
+ maji2014 <maji3@asiainfo.com>
+ 2014-12-09 13:13:12 -0800
+ Commit: b310744, github.com/apache/spark/pull/3553
+
+ [SPARK-874] adding a --wait flag
+ jbencook <jbenjamincook@gmail.com>
+ 2014-12-09 12:16:19 -0800
+ Commit: 61f1a70, github.com/apache/spark/pull/3567
+
+ SPARK-4338. [YARN] Ditch yarn-alpha.
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-12-09 11:02:43 -0800
+ Commit: 912563a, github.com/apache/spark/pull/3215
+
+ [SPARK-4785][SQL] Initilize Hive UDFs on the driver and serialize them with a wrapper
+ Cheng Hao <hao.cheng@intel.com>, Cheng Lian <lian@databricks.com>
+ 2014-12-09 10:28:15 -0800
+ Commit: 383c555, github.com/apache/spark/pull/3640
+
+ [SPARK-3154][STREAMING] Replace ConcurrentHashMap with mutable.HashMap and remove @volatile from 'stopped'
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-08 23:54:15 -0800
+ Commit: bcb5cda, github.com/apache/spark/pull/3634
+
+ [SPARK-4769] [SQL] CTAS does not work when reading from temporary tables
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-12-08 17:39:12 -0800
+ Commit: 51b1fe1, github.com/apache/spark/pull/3336
+
+ [SQL] remove unnecessary import in spark-sql
+ Jacky Li <jacky.likun@huawei.com>
+ 2014-12-08 17:27:46 -0800
+ Commit: 9443843, github.com/apache/spark/pull/3630
+
+ SPARK-4770. [DOC] [YARN] spark.scheduler.minRegisteredResourcesRatio doc...
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-12-08 16:28:36 -0800
+ Commit: cda94d1, github.com/apache/spark/pull/3624
+
+ SPARK-3926 [CORE] Reopened: result of JavaRDD collectAsMap() is not serializable
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-08 16:13:03 -0800
+ Commit: e829bfa, github.com/apache/spark/pull/3587
+
+ [SPARK-4750] Dynamic allocation - synchronize kills
+ Andrew Or <andrew@databricks.com>
+ 2014-12-08 16:02:33 -0800
+ Commit: 65f929d, github.com/apache/spark/pull/3612
+
+ [SPARK-4774] [SQL] Makes HiveFromSpark more portable
+ Kostas Sakellis <kostas@cloudera.com>
+ 2014-12-08 15:44:18 -0800
+ Commit: d6a972b, github.com/apache/spark/pull/3628
+
+ [SPARK-4764] Ensure that files are fetched atomically
+ Christophe Préaud <christophe.preaud@kelkoo.com>
+ 2014-12-08 11:44:54 -0800
+ Commit: ab2abcb, github.com/apache/spark/pull/2855
+
+ [SPARK-4620] Add unpersist in Graph and GraphImpl
+ Takeshi Yamamuro <linguin.m.s@gmail.com>
+ 2014-12-07 19:42:02 -0800
+ Commit: 8817fc7, github.com/apache/spark/pull/3476
+
+ [SPARK-4646] Replace Scala.util.Sorting.quickSort with Sorter(TimSort) in Spark
+ Takeshi Yamamuro <linguin.m.s@gmail.com>
+ 2014-12-07 19:36:08 -0800
+ Commit: 2e6b736, github.com/apache/spark/pull/3507
+
+ [SPARK-3623][GraphX] GraphX should support the checkpoint operation
+ GuoQiang Li <witgo@qq.com>
+ 2014-12-06 00:56:51 -0800
+ Commit: e895e0c, github.com/apache/spark/pull/2631
+
+ Streaming doc : do you mean inadvertently?
+ CrazyJvm <crazyjvm@gmail.com>
+ 2014-12-05 13:42:13 -0800
+ Commit: 6eb1b6f, github.com/apache/spark/pull/3620
+
+ [SPARK-4005][CORE] handle message replies in receive instead of in the individual private methods
+ Zhang, Liye <liye.zhang@intel.com>
+ 2014-12-05 12:00:32 -0800
+ Commit: 98a7d09, github.com/apache/spark/pull/2853
+
+ [SPARK-4761][SQL] Enables Kryo by default in Spark SQL Thrift server
+ Cheng Lian <lian@databricks.com>
+ 2014-12-05 10:27:40 -0800
+ Commit: 6f61e1f, github.com/apache/spark/pull/3621
+
+ [SPARK-4753][SQL] Use catalyst for partition pruning in newParquet.
+ Michael Armbrust <michael@databricks.com>
+ 2014-12-04 22:25:21 -0800
+ Commit: f5801e8, github.com/apache/spark/pull/3613
+
+ Revert "SPARK-2624 add datanucleus jars to the container in yarn-cluster"
+ Andrew Or <andrew@databricks.com>
+ 2014-12-04 21:53:49 -0800
+ Commit: fd85253
+
+ Revert "[HOT FIX] [YARN] Check whether `/lib` exists before listing its files"
+ Andrew Or <andrew@databricks.com>
+ 2014-12-04 21:53:38 -0800
+ Commit: 87437df
+
+ [SPARK-4464] Description about configuration options need to be modified in docs.
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2014-12-04 19:33:02 -0800
+ Commit: ca37903, github.com/apache/spark/pull/3329
+
+ Fix typo in Spark SQL docs.
+ Andy Konwinski <andykonwinski@gmail.com>
+ 2014-12-04 18:27:02 -0800
+ Commit: 15cf3b0, github.com/apache/spark/pull/3611
+
+ [SPARK-4421] Wrong link in spark-standalone.html
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2014-12-04 18:14:36 -0800
+ Commit: ddfc09c, github.com/apache/spark/pull/3279
+
+ [SPARK-4397] Move object RDD to the front of RDD.scala.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-04 16:32:20 -0800
+ Commit: ed92b47, github.com/apache/spark/pull/3580
+
+ [SPARK-4652][DOCS] Add docs about spark-git-repo option
+ lewuathe <lewuathe@me.com>, Josh Rosen <joshrosen@databricks.com>
+ 2014-12-04 15:14:36 -0800
+ Commit: ab8177d, github.com/apache/spark/pull/3513
+
+ [SPARK-4459] Change groupBy type parameter from K to U
+ Saldanha <saldaal1@phusca-l24858.wlan.na.novartis.net>
+ 2014-12-04 14:22:09 -0800
+ Commit: 743a889, github.com/apache/spark/pull/3327
+
+ [SPARK-4745] Fix get_existing_cluster() function with multiple security groups
+ alexdebrie <alexdebrie1@gmail.com>
+ 2014-12-04 14:13:59 -0800
+ Commit: 794f3ae, github.com/apache/spark/pull/3596
+
+ [HOTFIX] Fixing two issues with the release script.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-04 12:11:41 -0800
+ Commit: 8dae26f, github.com/apache/spark/pull/3608
+
+ [SPARK-4253] Ignore spark.driver.host in yarn-cluster and standalone-cluster modes
+ WangTaoTheTonic <barneystinson@aliyun.com>, WangTao <barneystinson@aliyun.com>
+ 2014-12-04 11:52:47 -0800
+ Commit: 8106b1e, github.com/apache/spark/pull/3112
+
+ [SPARK-4683][SQL] Add a beeline.cmd to run on Windows
+ Cheng Lian <lian@databricks.com>
+ 2014-12-04 10:21:03 -0800
+ Commit: 28c7aca, github.com/apache/spark/pull/3599
+
+ [FIX][DOC] Fix broken links in ml-guide.md
+ Xiangrui Meng <meng@databricks.com>
+ 2014-12-04 20:16:35 +0800
+ Commit: 7e758d7, github.com/apache/spark/pull/3601
+
+ [SPARK-4575] [mllib] [docs] spark.ml pipelines doc + bug fixes
+ Joseph K. Bradley <joseph@databricks.com>, jkbradley <joseph.kurata.bradley@gmail.com>, Xiangrui Meng <meng@databricks.com>
+ 2014-12-04 17:00:06 +0800
+ Commit: 469a6e5, github.com/apache/spark/pull/3588
+
+ [docs] Fix outdated comment in tuning guide
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-12-04 00:59:32 -0800
+ Commit: 529439b, github.com/apache/spark/pull/3592
+
+ [SQL] Minor: Avoid calling Seq#size in a loop
+ Aaron Davidson <aaron@databricks.com>
+ 2014-12-04 00:58:42 -0800
+ Commit: c6c7165, github.com/apache/spark/pull/3593
+
+ [SPARK-4685] Include all spark.ml and spark.mllib packages in JavaDoc's MLlib group
+ lewuathe <lewuathe@me.com>, Xiangrui Meng <meng@databricks.com>
+ 2014-12-04 16:51:41 +0800
+ Commit: 20bfea4, github.com/apache/spark/pull/3554
+
+ [SPARK-4719][API] Consolidate various narrow dep RDD classes with MapPartitionsRDD
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-04 00:45:57 -0800
+ Commit: c3ad486, github.com/apache/spark/pull/3578
+
+ [SQL] remove unnecessary import
+ Jacky Li <jacky.likun@huawei.com>
+ 2014-12-04 00:43:55 -0800
+ Commit: ed88db4, github.com/apache/spark/pull/3585
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-03 22:15:46 -0800
+ Commit: 3cdae03, github.com/apache/spark/pull/1875
+
+ [Release] Correctly translate contributors name in release notes
+ Andrew Or <andrew@databricks.com>
+ 2014-12-03 19:08:29 -0800
+ Commit: a4dfb4e
+
+ [SPARK-4580] [SPARK-4610] [mllib] [docs] Documentation for tree ensembles + DecisionTree API fix
+ Joseph K. Bradley <joseph@databricks.com>, Joseph K. Bradley <joseph.kurata.bradley@gmail.com>
+ 2014-12-04 09:57:50 +0800
+ Commit: 657a888, github.com/apache/spark/pull/3461
+
+ [SPARK-4711] [mllib] [docs] Programming guide advice on choosing optimizer
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-12-04 08:58:03 +0800
+ Commit: 27ab0b8, github.com/apache/spark/pull/3569
+
+ [SPARK-4085] Propagate FetchFailedException when Spark fails to read local shuffle file.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-03 16:28:24 -0800
+ Commit: 1826372, github.com/apache/spark/pull/3579
+
+ [SPARK-4498][core] Don't transition ExecutorInfo to RUNNING until Driver adds Executor
+ Mark Hamstra <markhamstra@gmail.com>
+ 2014-12-03 15:08:01 -0800
+ Commit: 96b2785, github.com/apache/spark/pull/3550
+
+ [SPARK-4552][SQL] Avoid exception when reading empty parquet data through Hive
+ Michael Armbrust <michael@databricks.com>
+ 2014-12-03 14:13:35 -0800
+ Commit: 513ef82, github.com/apache/spark/pull/3586
+
+ [HOT FIX] [YARN] Check whether `/lib` exists before listing its files
+ Andrew Or <andrew@databricks.com>
+ 2014-12-03 13:56:23 -0800
+ Commit: 90ec643, github.com/apache/spark/pull/3589
+
+ [SPARK-4642] Add description about spark.yarn.queue to running-on-YARN document.
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2014-12-03 13:16:24 -0800
+ Commit: 692f493, github.com/apache/spark/pull/3500
+
+ [SPARK-4715][Core] Make sure tryToAcquire won't return a negative value
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-03 12:19:40 -0800
+ Commit: edd3cd4, github.com/apache/spark/pull/3575
+
+ [SPARK-4701] Typo in sbt/sbt
+ Masayoshi TSUZUKI <tsudukim@oss.nttdata.co.jp>
+ 2014-12-03 12:08:00 -0800
+ Commit: 96786e3, github.com/apache/spark/pull/3560
+
+ SPARK-2624 add datanucleus jars to the container in yarn-cluster
+ Jim Lim <jim@quixey.com>
+ 2014-12-03 11:16:02 -0800
+ Commit: a975dc3, github.com/apache/spark/pull/3238
+
+ [SPARK-4717][MLlib] Optimize BLAS library to avoid de-reference multiple times in loop
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-12-03 22:31:39 +0800
+ Commit: d005429, github.com/apache/spark/pull/3577
+
+ [SPARK-4708][MLLib] Make k-mean runs two/three times faster with dense/sparse sample
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-12-03 19:01:56 +0800
+ Commit: 7fc49ed, github.com/apache/spark/pull/3565
+
+ [SPARK-4710] [mllib] Eliminate MLlib compilation warnings
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-12-03 18:50:03 +0800
+ Commit: 4ac2151, github.com/apache/spark/pull/3568
+
+ [SPARK-4397][Core] Change the 'since' value of '@deprecated' to '1.3.0'
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-03 02:05:17 -0800
+ Commit: 8af551f, github.com/apache/spark/pull/3573
+
+ [SPARK-4672][Core]Checkpoint() should clear f to shorten the serialization chain
+ JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
+ 2014-12-02 23:53:29 -0800
+ Commit: 77be8b9, github.com/apache/spark/pull/3545
+
+ [SPARK-4672][GraphX]Non-transient PartitionsRDDs will lead to StackOverflow error
+ JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
+ 2014-12-02 17:14:11 -0800
+ Commit: 17c162f, github.com/apache/spark/pull/3544
+
+ [SPARK-4672][GraphX]Perform checkpoint() on PartitionsRDD to shorten the lineage
+ JerryLead <JerryLead@163.com>, Lijie Xu <csxulijie@gmail.com>
+ 2014-12-02 17:08:02 -0800
+ Commit: fc0a147, github.com/apache/spark/pull/3549
+
+ [Release] Translate unknown author names automatically
+ Andrew Or <andrew@databricks.com>
+ 2014-12-02 16:36:12 -0800
+ Commit: 5da21f0
+
+ Minor nit style cleanup in GraphX.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-02 14:40:26 -0800
+ Commit: 2d4f6e7
+
+ [SPARK-4695][SQL] Get result using executeCollect
+ wangfei <wangfei1@huawei.com>
+ 2014-12-02 14:30:44 -0800
+ Commit: 3ae0cda, github.com/apache/spark/pull/3547
+
+ [SPARK-4670] [SQL] wrong symbol for bitwise not
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-02 14:25:12 -0800
+ Commit: 1f5ddf1, github.com/apache/spark/pull/3528
+
+ [SPARK-4593][SQL] Return null when denominator is 0
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-02 14:21:12 -0800
+ Commit: f6df609, github.com/apache/spark/pull/3443
+
+ [SPARK-4676][SQL] JavaSchemaRDD.schema may throw NullType MatchError if sql has null
+ YanTangZhai <hakeemzhai@tencent.com>, yantangzhai <tyz0303@163.com>, Michael Armbrust <michael@databricks.com>
+ 2014-12-02 14:12:48 -0800
+ Commit: 1066427, github.com/apache/spark/pull/3538
+
+ [SPARK-4663][sql]add finally to avoid resource leak
+ baishuo <vc_java@hotmail.com>
+ 2014-12-02 12:12:03 -0800
+ Commit: 69b6fed, github.com/apache/spark/pull/3526
+
+ [SPARK-4536][SQL] Add sqrt and abs to Spark SQL DSL
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-02 12:07:52 -0800
+ Commit: e75e04f, github.com/apache/spark/pull/3401
+
+ Indent license header properly for interfaces.scala.
+ Reynold Xin <rxin@databricks.com>
+ 2014-12-02 11:59:15 -0800
+ Commit: b1f8fe3, github.com/apache/spark/pull/3552
+
+ [SPARK-4686] Link to allowed master URLs is broken
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2014-12-02 09:06:02 -0800
+ Commit: d9a148b, github.com/apache/spark/pull/3542
+
+ [SPARK-4397][Core] Cleanup 'import SparkContext._' in core
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-02 00:18:41 -0800
+ Commit: 6dfe38a, github.com/apache/spark/pull/3530
+
+ [SPARK-4611][MLlib] Implement the efficient vector norm
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-12-02 11:40:43 +0800
+ Commit: 64f3175, github.com/apache/spark/pull/3462
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-12-01 17:27:14 -0800
+ Commit: b0a46d8, github.com/apache/spark/pull/1612
+
+ [SPARK-4268][SQL] Use #::: to get benefit from Stream in SqlLexical.allCaseVersions
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-01 16:39:54 -0800
+ Commit: d3e02dd, github.com/apache/spark/pull/3132
+
+ [SPARK-4529] [SQL] support view with column alias
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-01 16:08:51 -0800
+ Commit: 4df60a8, github.com/apache/spark/pull/3396
+
+ [SQL][DOC] Date type in SQL programming guide
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-12-01 14:03:57 -0800
+ Commit: 5edbcbf, github.com/apache/spark/pull/3535
+
+ [SQL] Minor fix for doc and comment
+ wangfei <wangfei1@huawei.com>
+ 2014-12-01 14:02:02 -0800
+ Commit: 7b79957, github.com/apache/spark/pull/3533
+
+ [SPARK-4658][SQL] Code documentation issue in DDL of datasource API
+ ravipesala <ravindra.pesala@huawei.com>
+ 2014-12-01 13:31:27 -0800
+ Commit: bc35381, github.com/apache/spark/pull/3516
+
+ [SPARK-4650][SQL] Supporting multi column support in countDistinct function like count(distinct c1,c2..) in Spark SQL
+ ravipesala <ravindra.pesala@huawei.com>, Michael Armbrust <michael@databricks.com>
+ 2014-12-01 13:26:44 -0800
+ Commit: 6a9ff19, github.com/apache/spark/pull/3511
+
+ [SPARK-4358][SQL] Let BigDecimal do checking type compatibility
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2014-12-01 13:17:56 -0800
+ Commit: b57365a, github.com/apache/spark/pull/3208
+
+ [SQL] add @group tab in limit() and count()
+ Jacky Li <jacky.likun@gmail.com>
+ 2014-12-01 13:12:30 -0800
+ Commit: bafee67, github.com/apache/spark/pull/3458
+
+ [SPARK-4258][SQL][DOC] Documents spark.sql.parquet.filterPushdown
+ Cheng Lian <lian@databricks.com>
+ 2014-12-01 13:09:51 -0800
+ Commit: 5db8dca, github.com/apache/spark/pull/3440
+
+ Documentation: add description for repartitionAndSortWithinPartitions
+ Madhu Siddalingaiah <madhu@madhu.com>
+ 2014-12-01 08:45:34 -0800
+ Commit: 2b233f5, github.com/apache/spark/pull/3390
+
+ [SPARK-4661][Core] Minor code and docs cleanup
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-01 00:35:01 -0800
+ Commit: 30a86ac, github.com/apache/spark/pull/3521
+
+ [SPARK-4664][Core] Throw an exception when spark.akka.frameSize > 2047
+ zsxwing <zsxwing@gmail.com>
+ 2014-12-01 00:32:54 -0800
+ Commit: 1d238f2, github.com/apache/spark/pull/3527
+
+ SPARK-2192 [BUILD] Examples Data Not in Binary Distribution
+ Sean Owen <sowen@cloudera.com>
+ 2014-12-01 16:31:04 +0800
+ Commit: 6384f42, github.com/apache/spark/pull/3480
+
+ Fix wrong file name pattern in .gitignore
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-12-01 00:29:28 -0800
+ Commit: 97eb6d7, github.com/apache/spark/pull/3529
+
+ [SPARK-4632] version update
+ Prabeesh K <prabsmails@gmail.com>
+ 2014-11-30 20:51:53 -0800
+ Commit: 5e7a6dc, github.com/apache/spark/pull/3495
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-30 20:51:13 -0800
+ Commit: 06dc1b1, github.com/apache/spark/pull/2915
+
+ [DOC] Fixes formatting typo in SQL programming guide
+ Cheng Lian <lian@databricks.com>
+ 2014-11-30 19:04:07 -0800
+ Commit: 2a4d389, github.com/apache/spark/pull/3498
+
+ [SPARK-4656][Doc] Typo in Programming Guide markdown
+ lewuathe <lewuathe@me.com>
+ 2014-11-30 17:18:50 -0800
+ Commit: a217ec5, github.com/apache/spark/pull/3412
+
+ [SPARK-4623]Add the some error infomation if using spark-sql in yarn-cluster mode
+ carlmartin <carlmartinmax@gmail.com>, huangzhaowei <carlmartinmax@gmail.com>
+ 2014-11-30 16:19:41 -0800
+ Commit: aea7a99, github.com/apache/spark/pull/3479
+
+ SPARK-2143 [WEB UI] Add Spark version to UI footer
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-30 11:40:08 -0800
+ Commit: 048ecca, github.com/apache/spark/pull/3410
+
+ [DOCS][BUILD] Add instruction to use change-version-to-2.11.sh in 'Building for Scala 2.11'.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-30 00:10:31 -0500
+ Commit: 0fcd24c, github.com/apache/spark/pull/3361
+
+ SPARK-4507: PR merge script should support closing multiple JIRA tickets
+ Takayuki Hasegawa <takayuki.hasegawa0311@gmail.com>
+ 2014-11-29 23:12:10 -0500
+ Commit: 4316a7b, github.com/apache/spark/pull/3428
+
+ [SPARK-4505][Core] Add a ClassTag parameter to CompactBuffer[T]
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-29 20:23:08 -0500
+ Commit: c062224, github.com/apache/spark/pull/3378
+
+ [SPARK-4057] Use -agentlib instead of -Xdebug in sbt-launch-lib.bash for debugging
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-29 20:14:14 -0500
+ Commit: 938dc14, github.com/apache/spark/pull/2904
+
+ Include the key name when failing on an invalid value.
+ Stephen Haberman <stephen@exigencecorp.com>
+ 2014-11-29 20:12:05 -0500
+ Commit: 95290bf, github.com/apache/spark/pull/3514
+
+ [SPARK-3398] [SPARK-4325] [EC2] Use EC2 status checks.
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2014-11-29 00:31:06 -0800
+ Commit: 317e114, github.com/apache/spark/pull/3195
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-29 00:24:35 -0500
+ Commit: 047ff57, github.com/apache/spark/pull/3451
+
+ [SPARK-4597] Use proper exception and reset variable in Utils.createTempDir()
+ Liang-Chi Hsieh <viirya@gmail.com>
+ 2014-11-28 18:04:05 -0800
+ Commit: 49fe879, github.com/apache/spark/pull/3449
+
+ SPARK-1450 [EC2] Specify the default zone in the EC2 script help
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-28 17:43:38 -0500
+ Commit: 48223d8, github.com/apache/spark/pull/3454
+
+ [SPARK-4584] [yarn] Remove security manager from Yarn AM.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2014-11-28 15:15:30 -0500
+ Commit: 915f8ee, github.com/apache/spark/pull/3484
+
+ [SPARK-4193][BUILD] Disable doclint in Java 8 to prevent from build error.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-28 13:00:15 -0500
+ Commit: e464f0a, github.com/apache/spark/pull/3058
+
+ [SPARK-4643] [Build] Remove unneeded staging repositories from build
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-11-28 12:41:38 -0500
+ Commit: 53ed7f1, github.com/apache/spark/pull/3504
+
+ Delete unnecessary function
+ KaiXinXiaoLei <huleilei1@huawei.com>
+ 2014-11-28 12:34:07 -0500
+ Commit: 052e658, github.com/apache/spark/pull/3224
+
+ [SPARK-4645][SQL] Disables asynchronous execution in Hive 0.13.1 HiveThriftServer2
+ Cheng Lian <lian@databricks.com>
+ 2014-11-28 11:42:40 -0500
+ Commit: 5b99bf2, github.com/apache/spark/pull/3506
+
+ [SPARK-4619][Storage]delete redundant time suffix
+ maji2014 <maji3@asiainfo.com>
+ 2014-11-28 00:36:22 -0800
+ Commit: ceb6281, github.com/apache/spark/pull/3475
+
+ [SPARK-4613][Core] Java API for JdbcRDD
+ Cheng Lian <lian@databricks.com>
+ 2014-11-27 18:01:14 -0800
+ Commit: 120a350, github.com/apache/spark/pull/3478
+
+ [SPARK-4626] Kill a task only if the executorId is (still) registered with the scheduler
+ roxchkplusony <roxchkplusony@gmail.com>
+ 2014-11-27 15:54:40 -0800
+ Commit: 84376d3, github.com/apache/spark/pull/3483
+
+ SPARK-4170 [CORE] Closure problems when running Scala app that "extends App"
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-27 09:03:17 -0800
+ Commit: 5d7fe17, github.com/apache/spark/pull/3497
+
+ [Release] Automate generation of contributors list
+ Andrew Or <andrew@databricks.com>
+ 2014-11-26 23:16:23 -0800
+ Commit: c86e9bc
+
+ [SPARK-732][SPARK-3628][CORE][RESUBMIT] eliminate duplicate update on accmulator
+ CodingCat <zhunansjtu@gmail.com>
+ 2014-11-26 16:52:04 -0800
+ Commit: 5af53ad, github.com/apache/spark/pull/2524
+
+ [SPARK-4614][MLLIB] Slight API changes in Matrix and Matrices
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-26 08:22:50 -0800
+ Commit: 561d31d, github.com/apache/spark/pull/3468
+
+ Removing confusing TripletFields
+ Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
+ 2014-11-26 00:55:28 -0800
+ Commit: 288ce58, github.com/apache/spark/pull/3472
+
+ [SPARK-4612] Reduce task latency and increase scheduling throughput by making configuration initialization lazy
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2014-11-25 23:15:58 -0800
+ Commit: e7f4d25, github.com/apache/spark/pull/3463
+
+ [SPARK-4516] Avoid allocating Netty PooledByteBufAllocators unnecessarily
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-26 00:32:45 -0500
+ Commit: 346bc17, github.com/apache/spark/pull/3465
+
+ [SPARK-4516] Cap default number of Netty threads at 8
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-25 23:57:04 -0500
+ Commit: f5f2d27, github.com/apache/spark/pull/3469
+
+ [SPARK-4604][MLLIB] make MatrixFactorizationModel public
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-25 20:11:40 -0800
+ Commit: b5fb141, github.com/apache/spark/pull/3459
+
+ [HOTFIX]: Adding back without-hive dist
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-25 23:10:19 -0500
+ Commit: 4d95526
+
+ [SPARK-4583] [mllib] LogLoss for GradientBoostedTrees fix + doc updates
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-11-25 20:10:15 -0800
+ Commit: c251fd7, github.com/apache/spark/pull/3439
+
+ [Spark-4509] Revert EC2 tag-based cluster membership patch
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-25 16:07:09 -0800
+ Commit: 7eba0fb, github.com/apache/spark/pull/3453
+
+ Fix SPARK-4471: blockManagerIdFromJson function throws exception while B...
+ hushan[胡珊] <hushan@xiaomi.com>
+ 2014-11-25 15:51:08 -0800
+ Commit: 9bdf5da, github.com/apache/spark/pull/3340
+
+ [SPARK-4546] Improve HistoryServer first time user experience
+ Andrew Or <andrew@databricks.com>
+ 2014-11-25 15:48:02 -0800
+ Commit: 9afcbe4, github.com/apache/spark/pull/3411
+
+ [SPARK-4592] Avoid duplicate worker registrations in standalone mode
+ Andrew Or <andrew@databricks.com>
+ 2014-11-25 15:46:26 -0800
+ Commit: 1b2ab1c, github.com/apache/spark/pull/3447
+
+ [SPARK-4196][SPARK-4602][Streaming] Fix serialization issue in PairDStreamFunctions.saveAsNewAPIHadoopFiles
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2014-11-25 14:16:27 -0800
+ Commit: 8838ad7, github.com/apache/spark/pull/3457
+
+ [SPARK-4581][MLlib] Refactorize StandardScaler to improve the transformation performance
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-11-25 11:07:11 -0800
+ Commit: bf1a6aa, github.com/apache/spark/pull/3435
+
+ [SPARK-4601][Streaming] Set correct call site for streaming jobs so that it is displayed correctly on the Spark UI
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2014-11-25 06:50:36 -0800
+ Commit: 69cd53e, github.com/apache/spark/pull/3455
+
+ [SPARK-4344][DOCS] adding documentation on spark.yarn.user.classpath.first
+ arahuja <aahuja11@gmail.com>
+ 2014-11-25 08:23:41 -0600
+ Commit: d240760, github.com/apache/spark/pull/3209
+
+ [SPARK-4381][Streaming]Add warning log when user set spark.master to local in Spark Streaming and there's no job executed
+ jerryshao <saisai.shao@intel.com>
+ 2014-11-25 05:36:29 -0800
+ Commit: fef27b2, github.com/apache/spark/pull/3244
+
+ [SPARK-4535][Streaming] Fix the error in comments
+ q00251598 <qiyadong@huawei.com>
+ 2014-11-25 04:01:56 -0800
+ Commit: a51118a, github.com/apache/spark/pull/3400
+
+ [SPARK-4526][MLLIB]GradientDescent get a wrong gradient value according to the gradient formula.
+ GuoQiang Li <witgo@qq.com>
+ 2014-11-25 02:01:19 -0800
+ Commit: f515f94, github.com/apache/spark/pull/3399
+
+ [SPARK-4596][MLLib] Refactorize Normalizer to make code cleaner
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-11-25 01:57:34 -0800
+ Commit: 89f9122, github.com/apache/spark/pull/3446
+
+ [DOC][Build] Wrong cmd for build spark with apache hadoop 2.4.X and hive 12
+ wangfei <wangfei1@huawei.com>
+ 2014-11-24 22:32:39 -0800
+ Commit: 0fe54cf, github.com/apache/spark/pull/3335
+
+ [SQL] Compute timeTaken correctly
+ w00228970 <wangfei1@huawei.com>
+ 2014-11-24 21:17:24 -0800
+ Commit: 723be60, github.com/apache/spark/pull/3423
+
+ [SPARK-4582][MLLIB] get raw vectors for further processing in Word2Vec
+ tkaessmann <tobias.kaessmanns24.com>, tkaessmann <tobias.kaessmann@s24.com>
+ 2014-11-24 19:58:01 -0800
+ Commit: 9ce2bf3, github.com/apache/spark/pull/3309
+
+ [SPARK-4525] Mesos should decline unused offers
+ Patrick Wendell <pwendell@gmail.com>, Jongyoul Lee <jongyoul@gmail.com>
+ 2014-11-24 19:14:14 -0800
+ Commit: f0afb62, github.com/apache/spark/pull/3436
+
+ Revert "[SPARK-4525] Mesos should decline unused offers"
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-24 19:16:53 -0800
+ Commit: a68d442
+
+ [SPARK-4525] Mesos should decline unused offers
+ Patrick Wendell <pwendell@gmail.com>, Jongyoul Lee <jongyoul@gmail.com>
+ 2014-11-24 19:14:14 -0800
+ Commit: b043c27, github.com/apache/spark/pull/3436
+
+ [SPARK-4266] [Web-UI] Reduce stage page load time.
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2014-11-24 18:03:10 -0800
+ Commit: d24d5bf, github.com/apache/spark/pull/3328
+
+ [SPARK-4548] []SPARK-4517] improve performance of python broadcast
+ Davies Liu <davies@databricks.com>
+ 2014-11-24 17:17:03 -0800
+ Commit: 6cf5076, github.com/apache/spark/pull/3417
+
+ [SPARK-4578] fix asDict() with nested Row()
+ Davies Liu <davies@databricks.com>
+ 2014-11-24 16:41:23 -0800
+ Commit: 050616b, github.com/apache/spark/pull/3434
+
+ [SPARK-4562] [MLlib] speedup vector
+ Davies Liu <davies@databricks.com>
+ 2014-11-24 16:37:14 -0800
+ Commit: b660de7, github.com/apache/spark/pull/3420
+
+ [SPARK-4518][SPARK-4519][Streaming] Refactored file stream to prevent files from being processed multiple times
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2014-11-24 13:50:20 -0800
+ Commit: cb0e9b0, github.com/apache/spark/pull/3419
+
+ [SPARK-4145] Web UI job pages
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-24 13:18:14 -0800
+ Commit: 4a90276, github.com/apache/spark/pull/3009
+
+ [SPARK-4487][SQL] Fix attribute reference resolution error when using ORDER BY.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-24 12:54:37 -0800
+ Commit: dd1c9cb, github.com/apache/spark/pull/3363
+
+ [SQL] Fix path in HiveFromSpark
+ scwf <wangfei1@huawei.com>
+ 2014-11-24 12:49:08 -0800
+ Commit: b384119, github.com/apache/spark/pull/3415
+
+ [SQL] Fix comment in HiveShim
+ Daniel Darabos <darabos.daniel@gmail.com>
+ 2014-11-24 12:45:07 -0800
+ Commit: d5834f0, github.com/apache/spark/pull/3432
+
+ [SPARK-4479][SQL] Avoids unnecessary defensive copies when sort based shuffle is on
+ Cheng Lian <lian@databricks.com>
+ 2014-11-24 12:43:45 -0800
+ Commit: a6d7b61, github.com/apache/spark/pull/3422
+
+ SPARK-4457. Document how to build for Hadoop versions greater than 2.4
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-11-24 13:28:48 -0600
+ Commit: 29372b6, github.com/apache/spark/pull/3322
+
+ [SPARK-4377] Fixed serialization issue by switching to akka provided serializer.
+ Prashant Sharma <prashant.s@imaginea.com>
+ 2014-11-22 14:05:38 -0800
+ Commit: 9b2a3c6, github.com/apache/spark/pull/3402
+
+ [SPARK-4431][MLlib] Implement efficient foreachActive for dense and sparse vector
+ DB Tsai <dbtsai@alpinenow.com>
+ 2014-11-21 18:15:07 -0800
+ Commit: b5d17ef, github.com/apache/spark/pull/3288
+
+ [SPARK-4531] [MLlib] cache serialized java object
+ Davies Liu <davies@databricks.com>
+ 2014-11-21 15:02:31 -0800
+ Commit: ce95bd8, github.com/apache/spark/pull/3397
+
+ SPARK-4532: Fix bug in detection of Hive in Spark 1.2
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-21 12:10:04 -0800
+ Commit: a81918c, github.com/apache/spark/pull/3398
+
+ [SPARK-4397][Core] Reorganize 'implicit's to improve the API convenience
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-21 10:06:30 -0800
+ Commit: 65b987c, github.com/apache/spark/pull/3262
+
+ [SPARK-4472][Shell] Print "Spark context available as sc." only when SparkContext is created...
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-21 00:42:43 -0800
+ Commit: f1069b8, github.com/apache/spark/pull/3341
+
+ [Doc][GraphX] Remove unused png files.
+ Reynold Xin <rxin@databricks.com>
+ 2014-11-21 00:30:58 -0800
+ Commit: 28fdc6f
+
+ [Doc][GraphX] Remove Motivation section and did some minor update.
+ Reynold Xin <rxin@databricks.com>
+ 2014-11-21 00:29:02 -0800
+ Commit: b97070e
+
+ [SPARK-4522][SQL] Parse schema with missing metadata.
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-20 20:34:43 -0800
+ Commit: 90a6a46, github.com/apache/spark/pull/3392
+
+ add Sphinx as a dependency of building docs
+ Davies Liu <davies@databricks.com>
+ 2014-11-20 19:12:45 -0800
+ Commit: 8cd6eea, github.com/apache/spark/pull/3388
+
+ [SPARK-4413][SQL] Parquet support through datasource API
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-20 18:31:02 -0800
+ Commit: 02ec058, github.com/apache/spark/pull/3269
+
+ [SPARK-4244] [SQL] Support Hive Generic UDFs with constant object inspector parameters
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-20 16:50:59 -0800
+ Commit: 84d79ee, github.com/apache/spark/pull/3109
+
+ [SPARK-4477] [PySpark] remove numpy from RDDSampler
+ Davies Liu <davies@databricks.com>, Xiangrui Meng <meng@databricks.com>
+ 2014-11-20 16:40:25 -0800
+ Commit: d39f2e9, github.com/apache/spark/pull/3351
+
+ [SQL] fix function description mistake
+ Jacky Li <jacky.likun@gmail.com>
+ 2014-11-20 15:48:36 -0800
+ Commit: ad5f1f3, github.com/apache/spark/pull/3344
+
+ [SPARK-2918] [SQL] Support the CTAS in EXPLAIN command
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-20 15:46:00 -0800
+ Commit: 6aa0fc9, github.com/apache/spark/pull/3357
+
+ [SPARK-4318][SQL] Fix empty sum distinct.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-20 15:41:24 -0800
+ Commit: 2c2e7a4, github.com/apache/spark/pull/3184
+
+ [SPARK-4513][SQL] Support relational operator '<=>' in Spark SQL
+ ravipesala <ravindra.pesala@huawei.com>
+ 2014-11-20 15:34:03 -0800
+ Commit: 98e9419, github.com/apache/spark/pull/3387
+
+ [SPARK-4439] [MLlib] add python api for random forest
+ Davies Liu <davies@databricks.com>
+ 2014-11-20 15:31:28 -0800
+ Commit: 1c53a5d, github.com/apache/spark/pull/3320
+
+ [SPARK-4228][SQL] SchemaRDD to JSON
+ Dan McClary <dan.mcclary@gmail.com>
+ 2014-11-20 13:36:50 -0800
+ Commit: b8e6886, github.com/apache/spark/pull/3213
+
+ [SPARK-3938][SQL] Names in-memory columnar RDD with corresponding table name
+ Cheng Lian <lian@databricks.com>
+ 2014-11-20 13:12:24 -0800
+ Commit: abf2918, github.com/apache/spark/pull/3383
+
+ [SPARK-4486][MLLIB] Improve GradientBoosting APIs and doc
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-20 00:48:59 -0800
+ Commit: 15cacc8, github.com/apache/spark/pull/3374
+
+ [SPARK-4446] [SPARK CORE]
+ Leolh <leosandylh@gmail.com>
+ 2014-11-19 18:18:55 -0800
+ Commit: e216ffa, github.com/apache/spark/pull/3306
+
+ [SPARK-4480] Avoid many small spills in external data structures
+ Andrew Or <andrew@databricks.com>
+ 2014-11-19 18:07:27 -0800
+ Commit: 0eb4a7f, github.com/apache/spark/pull/3353
+
+ [Spark-4484] Treat maxResultSize as unlimited when set to 0; improve error message
+ Nishkam Ravi <nravi@cloudera.com>, nravi <nravi@c1704.halxg.cloudera.com>, nishkamravi2 <nishkamravi@gmail.com>
+ 2014-11-19 17:23:42 -0800
+ Commit: 73fedf5, github.com/apache/spark/pull/3360
+
+ [SPARK-4478] Keep totalRegisteredExecutors up-to-date
+ Akshat Aranya <aaranya@quantcast.com>
+ 2014-11-19 17:20:20 -0800
+ Commit: 9ccc53c, github.com/apache/spark/pull/3373
+
+ Updating GraphX programming guide and documentation
+ Joseph E. Gonzalez <joseph.e.gonzalez@gmail.com>
+ 2014-11-19 16:53:33 -0800
+ Commit: 377b068, github.com/apache/spark/pull/3359
+
+ [SPARK-4495] Fix memory leak in JobProgressListener
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-19 16:50:21 -0800
+ Commit: 04d462f, github.com/apache/spark/pull/3372
+
+ [SPARK-4294][Streaming] UnionDStream stream should express the requirements in the same way as TransformedDStream
+ Yadong Qi <qiyadong2010@gmail.com>
+ 2014-11-19 15:53:06 -0800
+ Commit: c3002c4, github.com/apache/spark/pull/3152
+
+ [SPARK-4384] [PySpark] improve sort spilling
+ Davies Liu <davies@databricks.com>
+ 2014-11-19 15:45:37 -0800
+ Commit: 73c8ea8, github.com/apache/spark/pull/3252
+
+ [SPARK-4429][BUILD] Build for Scala 2.11 using sbt fails.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-19 14:40:21 -0800
+ Commit: f9adda9, github.com/apache/spark/pull/3342
+
+ [DOC][PySpark][Streaming] Fix docstring for sphinx
+ Ken Takagiwa <ugw.gi.world@gmail.com>
+ 2014-11-19 14:23:18 -0800
+ Commit: 9b7bbce, github.com/apache/spark/pull/3311
+
+ SPARK-3962 Marked scope as provided for external projects.
+ Prashant Sharma <prashant.s@imaginea.com>, Prashant Sharma <scrapcodes@gmail.com>
+ 2014-11-19 14:18:10 -0800
+ Commit: 1c93841, github.com/apache/spark/pull/2959
+
+ [HOT FIX] MiMa tests are broken
+ Andrew Or <andrew@databricks.com>
+ 2014-11-19 14:03:44 -0800
+ Commit: 0df02ca, github.com/apache/spark/pull/3371
+
+ [SPARK-4481][Streaming][Doc] Fix the wrong description of updateFunc
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-19 13:17:15 -0800
+ Commit: 3bf7cee, github.com/apache/spark/pull/3356
+
+ [SPARK-4482][Streaming] Disable ReceivedBlockTracker's write ahead log by default
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2014-11-19 13:06:48 -0800
+ Commit: 22fc4e7, github.com/apache/spark/pull/3358
+
+ [SPARK-4470] Validate number of threads in local mode
+ Kenichi Maehashi <webmaster@kenichimaehashi.com>
+ 2014-11-19 12:11:09 -0800
+ Commit: eacc788, github.com/apache/spark/pull/3337
+
+ [SPARK-4467] fix elements read count for ExtrenalSorter
+ Tianshuo Deng <tdeng@twitter.com>
+ 2014-11-19 10:01:09 -0800
+ Commit: d75579d, github.com/apache/spark/pull/3302
+
+ SPARK-4455 Exclude dependency on hbase-annotations module
+ tedyu <yuzhihong@gmail.com>
+ 2014-11-19 00:55:39 -0800
+ Commit: 5f5ac2d, github.com/apache/spark/pull/3286
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-19 00:27:31 -0800
+ Commit: 8327df6, github.com/apache/spark/pull/2777
+
+ [Spark-4432]close InStream after the block is accessed
+ Mingfei <mingfei.shi@intel.com>
+ 2014-11-18 22:17:06 -0800
+ Commit: 165cec9, github.com/apache/spark/pull/3290
+
+ [SPARK-4441] Close Tachyon client when TachyonBlockManager is shutdown
+ Mingfei <mingfei.shi@intel.com>
+ 2014-11-18 22:16:36 -0800
+ Commit: 67e9876, github.com/apache/spark/pull/3299
+
+ Bumping version to 1.3.0-SNAPSHOT.
+ Marcelo Vanzin <vanzin@cloudera.com>
+ 2014-11-18 21:24:18 -0800
+ Commit: 397d3aa, github.com/apache/spark/pull/3277
+
+ [SPARK-4468][SQL] Fixes Parquet filter creation for inequality predicates with literals on the left hand side
+ Cheng Lian <lian@databricks.com>
+ 2014-11-18 17:41:54 -0800
+ Commit: 423baea, github.com/apache/spark/pull/3334
+
+ [SPARK-4327] [PySpark] Python API for RDD.randomSplit()
+ Davies Liu <davies@databricks.com>
+ 2014-11-18 16:37:35 -0800
+ Commit: 7f22fa8, github.com/apache/spark/pull/3193
+
+ [SPARK-4433] fix a racing condition in zipWithIndex
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-18 16:25:44 -0800
+ Commit: bb46046, github.com/apache/spark/pull/3291
+
+ [SPARK-3721] [PySpark] broadcast objects larger than 2G
+ Davies Liu <davies@databricks.com>, Davies Liu <davies.liu@gmail.com>
+ 2014-11-18 16:17:51 -0800
+ Commit: 4a377af, github.com/apache/spark/pull/2659
+
+ [SPARK-4306] [MLlib] Python API for LogisticRegressionWithLBFGS
+ Davies Liu <davies@databricks.com>
+ 2014-11-18 15:57:33 -0800
+ Commit: d2e2951, github.com/apache/spark/pull/3307
+
+ [SPARK-4463] Add (de)select all button for add'l metrics.
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2014-11-18 15:01:06 -0800
+ Commit: 010bc86, github.com/apache/spark/pull/3331
+
+ [SPARK-4017] show progress bar in console
+ Davies Liu <davies@databricks.com>
+ 2014-11-18 13:37:21 -0800
+ Commit: e34f38f, github.com/apache/spark/pull/3029
+
+ [SPARK-4404] remove sys.exit() in shutdown hook
+ Davies Liu <davies@databricks.com>
+ 2014-11-18 13:11:38 -0800
+ Commit: 80f3177, github.com/apache/spark/pull/3289
+
+ [SPARK-4075][SPARK-4434] Fix the URI validation logic for Application Jar name.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-18 12:17:33 -0800
+ Commit: bfebfd8, github.com/apache/spark/pull/3326
+
+ [SQL] Support partitioned parquet tables that have the key in both the directory and the file
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-18 12:13:23 -0800
+ Commit: 90d72ec, github.com/apache/spark/pull/3272
+
+ [SPARK-4396] allow lookup by index in Python's Rating
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-18 10:35:29 -0800
+ Commit: b54c6ab, github.com/apache/spark/pull/3261
+
+ [SPARK-4435] [MLlib] [PySpark] improve classification
+ Davies Liu <davies@databricks.com>
+ 2014-11-18 10:11:13 -0800
+ Commit: 8fbf72b, github.com/apache/spark/pull/3305
+
+ ALS implicit: added missing parameter alpha in doc string
+ Felix Maximilian Möller <felixmaximilian.moeller@immobilienscout24.de>
+ 2014-11-18 10:08:24 -0800
+ Commit: cedc3b5, github.com/apache/spark/pull/3343
+
+ SPARK-4466: Provide support for publishing Scala 2.11 artifacts to Maven
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-17 21:07:50 -0800
+ Commit: c6e0c2a, github.com/apache/spark/pull/3332
+
+ [SPARK-4453][SPARK-4213][SQL] Simplifies Parquet filter generation code
+ Cheng Lian <lian@databricks.com>
+ 2014-11-17 16:55:12 -0800
+ Commit: 36b0956, github.com/apache/spark/pull/3317
+
+ [SPARK-4448] [SQL] unwrap for the ConstantObjectInspector
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-17 16:35:49 -0800
+ Commit: ef7c464, github.com/apache/spark/pull/3308
+
+ [SPARK-4443][SQL] Fix statistics for external table in spark sql hive
+ w00228970 <wangfei1@huawei.com>
+ 2014-11-17 16:33:50 -0800
+ Commit: 42389b1, github.com/apache/spark/pull/3304
+
+ [SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types
+ Cheng Lian <lian@databricks.com>
+ 2014-11-17 16:31:05 -0800
+ Commit: 6b7f2f7, github.com/apache/spark/pull/3298
+
+ [SQL] Construct the MutableRow from an Array
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-17 16:29:52 -0800
+ Commit: 69e858c, github.com/apache/spark/pull/3217
+
+ [SPARK-4425][SQL] Handle NaN or Infinity cast to Timestamp correctly.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-17 16:28:07 -0800
+ Commit: 566c791, github.com/apache/spark/pull/3283
+
+ [SPARK-4420][SQL] Change nullability of Cast from DoubleType/FloatType to DecimalType.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-17 16:26:48 -0800
+ Commit: 3a81a1c, github.com/apache/spark/pull/3278
+
+ [SQL] Makes conjunction pushdown more aggressive for in-memory table
+ Cheng Lian <lian@databricks.com>
+ 2014-11-17 15:33:13 -0800
+ Commit: 5ce7dae, github.com/apache/spark/pull/3318
+
+ [SPARK-4180] [Core] Prevent creation of multiple active SparkContexts
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-17 12:48:18 -0800
+ Commit: 0f3ceb5, github.com/apache/spark/pull/3121
+
+ [DOCS][SQL] Fix broken link to Row class scaladoc
+ Andy Konwinski <andykonwinski@gmail.com>
+ 2014-11-17 11:52:23 -0800
+ Commit: cec1116, github.com/apache/spark/pull/3323
+
+ Revert "[SPARK-4075] [Deploy] Jar url validation is not enough for Jar file"
+ Andrew Or <andrew@databricks.com>
+ 2014-11-17 11:24:28 -0800
+ Commit: dbb9da5
+
+ [SPARK-4444] Drop VD type parameter from EdgeRDD
+ Ankur Dave <ankurdave@gmail.com>
+ 2014-11-17 11:06:31 -0800
+ Commit: 9ac2bb1, github.com/apache/spark/pull/3303
+
+ SPARK-2811 upgrade algebird to 0.8.1
+ Adam Pingel <adam@axle-lang.org>
+ 2014-11-17 10:47:29 -0800
+ Commit: e7690ed, github.com/apache/spark/pull/3282
+
+ SPARK-4445, Don't display storage level in toDebugString unless RDD is persisted.
+ Prashant Sharma <prashant.s@imaginea.com>
+ 2014-11-17 10:40:33 -0800
+ Commit: 5c92d47, github.com/apache/spark/pull/3310
+
+ [SPARK-4410][SQL] Add support for external sort
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-16 21:55:57 -0800
+ Commit: 64c6b9b, github.com/apache/spark/pull/3268
+
+ [SPARK-4422][MLLIB]In some cases, Vectors.fromBreeze get wrong results.
+ GuoQiang Li <witgo@qq.com>
+ 2014-11-16 21:31:51 -0800
+ Commit: 5168c6c, github.com/apache/spark/pull/3281
+
+ Revert "[SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types"
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-16 15:05:04 -0800
+ Commit: 45ce327, github.com/apache/spark/pull/3292
+
+ [SPARK-4309][SPARK-4407][SQL] Date type support for Thrift server, and fixes for complex types
+ Cheng Lian <lian@databricks.com>
+ 2014-11-16 14:26:41 -0800
+ Commit: cb6bd83, github.com/apache/spark/pull/3178
+
+ [SPARK-4393] Fix memory leak in ConnectionManager ACK timeout TimerTasks; use HashedWheelTimer
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-16 00:44:15 -0800
+ Commit: 7850e0c, github.com/apache/spark/pull/3259
+
+ [SPARK-4426][SQL][Minor] The symbol of BitwiseOr is wrong, should not be '&'
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-15 22:23:47 -0800
+ Commit: 84468b2, github.com/apache/spark/pull/3284
+
+ [SPARK-4419] Upgrade snappy-java to 1.1.1.6
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-15 22:22:34 -0800
+ Commit: 7d8e152, github.com/apache/spark/pull/3287
+
+ [SPARK-2321] Several progress API improvements / refactorings
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-14 23:46:25 -0800
+ Commit: 40eb8b6, github.com/apache/spark/pull/3197
+
+ Added contains(key) to Metadata
+ kai <kaizeng@eecs.berkeley.edu>
+ 2014-11-14 23:44:23 -0800
+ Commit: cbddac2, github.com/apache/spark/pull/3273
+
+ [SPARK-4260] Httpbroadcast should set connection timeout.
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-14 22:36:56 -0800
+ Commit: 60969b0, github.com/apache/spark/pull/3122
+
+ [SPARK-4363][Doc] Update the Broadcast example
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-14 22:28:48 -0800
+ Commit: 861223e, github.com/apache/spark/pull/3226
+
+ [SPARK-4379][Core] Change Exception to SparkException in checkpoint
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-14 22:25:41 -0800
+ Commit: dba1405, github.com/apache/spark/pull/3241
+
+ [SPARK-4415] [PySpark] JVM should exit after Python exit
+ Davies Liu <davies@databricks.com>
+ 2014-11-14 20:13:46 -0800
+ Commit: 7fe08b4, github.com/apache/spark/pull/3274
+
+ [SPARK-4404]SparkSubmitDriverBootstrapper should stop after its SparkSubmit sub-proc...
+ WangTao <barneystinson@aliyun.com>, WangTaoTheTonic <barneystinson@aliyun.com>
+ 2014-11-14 20:11:51 -0800
+ Commit: 303a4e4, github.com/apache/spark/pull/3266
+
+ SPARK-4214. With dynamic allocation, avoid outstanding requests for more...
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-11-14 15:51:05 -0800
+ Commit: ad42b28, github.com/apache/spark/pull/3204
+
+ [SPARK-4412][SQL] Fix Spark's control of Parquet logging.
+ Jim Carroll <jim@dontcallme.com>
+ 2014-11-14 15:33:21 -0800
+ Commit: 37482ce, github.com/apache/spark/pull/3271
+
+ [SPARK-4365][SQL] Remove unnecessary filter call on records returned from parquet library
+ Yash Datta <Yash.Datta@guavus.com>
+ 2014-11-14 15:16:36 -0800
+ Commit: 63ca3af, github.com/apache/spark/pull/3229
+
+ [SPARK-4386] Improve performance when writing Parquet files.
+ Jim Carroll <jim@dontcallme.com>
+ 2014-11-14 15:11:53 -0800
+ Commit: f76b968, github.com/apache/spark/pull/3254
+
+ [SPARK-4322][SQL] Enables struct fields as sub expressions of grouping fields
+ Cheng Lian <lian@databricks.com>
+ 2014-11-14 15:09:36 -0800
+ Commit: 0c7b66b, github.com/apache/spark/pull/3248
+
+ [SQL] Don't shuffle code generated rows
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-14 15:03:23 -0800
+ Commit: 4b4b50c, github.com/apache/spark/pull/3263
+
+ [SQL] Minor cleanup of comments, errors and override.
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-14 15:00:42 -0800
+ Commit: f805025, github.com/apache/spark/pull/3257
+
+ [SPARK-4391][SQL] Configure parquet filters using SQLConf
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-14 14:59:35 -0800
+ Commit: e47c387, github.com/apache/spark/pull/3258
+
+ [SPARK-4390][SQL] Handle NaN cast to decimal correctly
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-14 14:56:57 -0800
+ Commit: a0300ea, github.com/apache/spark/pull/3256
+
+ [SPARK-4062][Streaming]Add ReliableKafkaReceiver in Spark Streaming Kafka connector
+ jerryshao <saisai.shao@intel.com>, Tathagata Das <tathagata.das1565@gmail.com>, Saisai Shao <saisai.shao@intel.com>
+ 2014-11-14 14:33:37 -0800
+ Commit: 5930f64, github.com/apache/spark/pull/2991
+
+ [SPARK-4333][SQL] Correctly log number of iterations in RuleExecutor
+ DoingDone9 <799203320@qq.com>
+ 2014-11-14 14:28:06 -0800
+ Commit: 0cbdb01, github.com/apache/spark/pull/3180
+
+ SPARK-4375. no longer require -Pscala-2.10
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-11-14 14:21:57 -0800
+ Commit: f5f757e, github.com/apache/spark/pull/3239
+
+ [SPARK-4245][SQL] Fix containsNull of the result ArrayType of CreateArray expression.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-14 14:21:16 -0800
+ Commit: bbd8f5b, github.com/apache/spark/pull/3110
+
+ [SPARK-4239] [SQL] support view in HiveQl
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-11-14 13:51:20 -0800
+ Commit: ade72c4, github.com/apache/spark/pull/3131
+
+ Update failed assert text to match code in SizeEstimatorSuite
+ Jeff Hammerbacher <jeff.hammerbacher@gmail.com>
+ 2014-11-14 13:37:48 -0800
+ Commit: c258db9, github.com/apache/spark/pull/3242
+
+ [SPARK-4313][WebUI][Yarn] Fix link issue of the executor thread dump page in yarn-cluster mode
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-14 13:36:13 -0800
+ Commit: 156cf33, github.com/apache/spark/pull/3183
+
+ SPARK-3663 Document SPARK_LOG_DIR and SPARK_PID_DIR
+ Andrew Ash <andrew@andrewash.com>
+ 2014-11-14 13:33:35 -0800
+ Commit: 5c265cc, github.com/apache/spark/pull/2518
+
+ [Spark Core] SPARK-4380 Edit spilling log from MB to B
+ Hong Shen <hongshen@tencent.com>
+ 2014-11-14 13:29:41 -0800
+ Commit: 0c56a03, github.com/apache/spark/pull/3243
+
+ [SPARK-4398][PySpark] specialize sc.parallelize(xrange)
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-14 12:43:17 -0800
+ Commit: abd5817, github.com/apache/spark/pull/3264
+
+ [SPARK-4394][SQL] Data Sources API Improvements
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-14 12:00:08 -0800
+ Commit: 77e845c, github.com/apache/spark/pull/3260
+
+ [SPARK-3722][Docs]minor improvement and fix in docs
+ WangTao <barneystinson@aliyun.com>
+ 2014-11-14 08:09:42 -0600
+ Commit: e421072, github.com/apache/spark/pull/2579
+
+ [SPARK-4310][WebUI] Sort 'Submitted' column in Stage page by time
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-13 14:37:04 -0800
+ Commit: 825709a, github.com/apache/spark/pull/3179
+
+ [SPARK-4372][MLLIB] Make LR and SVM's default parameters consistent in Scala and Python
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-13 13:54:16 -0800
+ Commit: 3221830, github.com/apache/spark/pull/3232
+
+ [SPARK-4326] fix unidoc
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-13 13:16:20 -0800
+ Commit: 4b0c1ed, github.com/apache/spark/pull/3253
+
+ [HOT FIX] make-distribution.sh fails if Yarn shuffle jar DNE
+ Andrew Or <andrew@databricks.com>
+ 2014-11-13 11:54:45 -0800
+ Commit: a0fa1ba, github.com/apache/spark/pull/3250
+
+ [SPARK-4378][MLLIB] make ALS more Java-friendly
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-13 11:42:27 -0800
+ Commit: ca26a21, github.com/apache/spark/pull/3240
+
+ [SPARK-4348] [PySpark] [MLlib] rename random.py to rand.py
+ Davies Liu <davies@databricks.com>
+ 2014-11-13 10:24:54 -0800
+ Commit: ce0333f, github.com/apache/spark/pull/3216
+
+ [SPARK-4256] Make Binary Evaluation Metrics functions defined in cases where there ar...
+ Andrew Bullen <andrew.bullen@workday.com>
+ 2014-11-12 22:14:44 -0800
+ Commit: 484fecb, github.com/apache/spark/pull/3118
+
+ [SPARK-4370] [Core] Limit number of Netty cores based on executor size
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-12 18:46:37 -0800
+ Commit: b9e1c2e, github.com/apache/spark/pull/3155
+
+ [SPARK-4373][MLLIB] fix MLlib maven tests
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-12 18:15:14 -0800
+ Commit: 23f5bdf, github.com/apache/spark/pull/3235
+
+ [Release] Bring audit scripts up-to-date
+ Andrew Or <andrew@databricks.com>
+ 2014-11-13 00:30:58 +0000
+ Commit: 723a86b
+
+ [SPARK-2672] support compressed file in wholeTextFile
+ Davies Liu <davies@databricks.com>
+ 2014-11-12 15:58:12 -0800
+ Commit: d7d54a4, github.com/apache/spark/pull/3005
+
+ [SPARK-4369] [MLLib] fix TreeModel.predict() with RDD
+ Davies Liu <davies@databricks.com>
+ 2014-11-12 13:56:41 -0800
+ Commit: bd86118, github.com/apache/spark/pull/3230
+
+ [SPARK-3666] Extract interfaces for EdgeRDD and VertexRDD
+ Ankur Dave <ankurdave@gmail.com>
+ 2014-11-12 13:49:20 -0800
+ Commit: a5ef581, github.com/apache/spark/pull/2530
+
+ [Release] Correct make-distribution.sh log path
+ Andrew Or <andrew@databricks.com>
+ 2014-11-12 13:46:26 -0800
+ Commit: c3afd32
+
+ Internal cleanup for aggregateMessages
+ Ankur Dave <ankurdave@gmail.com>
+ 2014-11-12 13:44:49 -0800
+ Commit: 0402be9, github.com/apache/spark/pull/3231
+
+ [SPARK-4281][Build] Package Yarn shuffle service into its own jar
+ Andrew Or <andrew@databricks.com>
+ 2014-11-12 13:39:45 -0800
+ Commit: aa43a8d, github.com/apache/spark/pull/3147
+
+ [Test] Better exception message from SparkSubmitSuite
+ Andrew Or <andrew@databricks.com>
+ 2014-11-12 13:35:48 -0800
+ Commit: 6e3c5a2, github.com/apache/spark/pull/3212
+
+ [SPARK-3660][STREAMING] Initial RDD for updateStateByKey transformation
+ Soumitra Kumar <kumar.soumitra@gmail.com>
+ 2014-11-12 12:25:31 -0800
+ Commit: 36ddeb7, github.com/apache/spark/pull/2665
+
+ [SPARK-3530][MLLIB] pipeline and parameters with examples
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-12 10:38:57 -0800
+ Commit: 4b736db, github.com/apache/spark/pull/3099
+
+ [SPARK-4355][MLLIB] fix OnlineSummarizer.merge when other.mean is zero
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-12 01:50:11 -0800
+ Commit: 84324fb, github.com/apache/spark/pull/3220
+
+ [SPARK-3936] Add aggregateMessages, which supersedes mapReduceTriplets
+ Ankur Dave <ankurdave@gmail.com>
+ 2014-11-11 23:38:27 -0800
+ Commit: faeb41d, github.com/apache/spark/pull/3100
+
+ [MLLIB] SPARK-4347: Reducing GradientBoostingSuite run time.
+ Manish Amde <manish9ue@gmail.com>
+ 2014-11-11 22:47:53 -0800
+ Commit: 2ef016b, github.com/apache/spark/pull/3214
+
+ Support cross building for Scala 2.11
+ Prashant Sharma <prashant.s@imaginea.com>, Patrick Wendell <pwendell@gmail.com>
+ 2014-11-11 21:36:48 -0800
+ Commit: daaca14, github.com/apache/spark/pull/3159
+
+ [Release] Log build output for each distribution
+ Andrew Or <andrew@databricks.com>
+ 2014-11-11 18:02:59 -0800
+ Commit: 2ddb141
+
+ SPARK-2269 Refactor mesos scheduler resourceOffers and add unit test
+ Timothy Chen <tnachen@gmail.com>
+ 2014-11-11 14:29:18 -0800
+ Commit: a878660, github.com/apache/spark/pull/1487
+
+ [SPARK-4282][YARN] Stopping flag in YarnClientSchedulerBackend should be volatile
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-11 12:33:53 -0600
+ Commit: 7f37188, github.com/apache/spark/pull/3143
+
+ SPARK-4305 [BUILD] yarn-alpha profile won't build due to network/yarn module
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-11 12:30:35 -0600
+ Commit: f820b56, github.com/apache/spark/pull/3167
+
+ SPARK-1830 Deploy failover, Make Persistence engine and LeaderAgent Pluggable
+ Prashant Sharma <prashant.s@imaginea.com>
+ 2014-11-11 09:29:48 -0800
+ Commit: deefd9d, github.com/apache/spark/pull/771
+
+ [Streaming][Minor]Replace some 'if-else' in Clock
+ huangzhaowei <carlmartinmax@gmail.com>
+ 2014-11-11 03:02:12 -0800
+ Commit: 6e03de3, github.com/apache/spark/pull/3088
+
+ [SPARK-2492][Streaming] kafkaReceiver minor changes to align with Kafka 0.8
+ jerryshao <saisai.shao@intel.com>
+ 2014-11-11 02:22:23 -0800
+ Commit: c8850a3, github.com/apache/spark/pull/1420
+
+ [SPARK-4295][External]Fix exception in SparkSinkSuite
+ maji2014 <maji3@asiainfo.com>
+ 2014-11-11 02:18:27 -0800
+ Commit: f8811a5, github.com/apache/spark/pull/3177
+
+ [SPARK-4307] Initialize FileDescriptor lazily in FileRegion.
+ Reynold Xin <rxin@databricks.com>, Reynold Xin <rxin@apache.org>
+ 2014-11-11 00:25:31 -0800
+ Commit: ef29a9a, github.com/apache/spark/pull/3172
+
+ [SPARK-4324] [PySpark] [MLlib] support numpy.array for all MLlib API
+ Davies Liu <davies@databricks.com>
+ 2014-11-10 22:26:16 -0800
+ Commit: 65083e9, github.com/apache/spark/pull/3189
+
+ [SPARK-4330][Doc] Link to proper URL for YARN overview
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-10 22:18:00 -0800
+ Commit: 3c07b8f, github.com/apache/spark/pull/3196
+
+ [SPARK-3649] Remove GraphX custom serializers
+ Ankur Dave <ankurdave@gmail.com>
+ 2014-11-10 19:31:52 -0800
+ Commit: 300887b, github.com/apache/spark/pull/2503
+
+ [SPARK-4274] [SQL] Fix NPE in printing the details of the query plan
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-10 17:46:05 -0800
+ Commit: c764d0a, github.com/apache/spark/pull/3139
+
+ [SPARK-3954][Streaming] Optimization to FileInputDStream
+ surq <surq@asiainfo.com>
+ 2014-11-10 17:37:16 -0800
+ Commit: ce6ed2a, github.com/apache/spark/pull/2811
+
+ [SPARK-4149][SQL] ISO 8601 support for json date time strings
+ Daoyuan Wang <daoyuan.wang@intel.com>
+ 2014-11-10 17:26:03 -0800
+ Commit: a1fc059, github.com/apache/spark/pull/3012
+
+ [SPARK-4250] [SQL] Fix bug of constant null value mapping to ConstantObjectInspector
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-10 17:22:57 -0800
+ Commit: fa77783, github.com/apache/spark/pull/3114
+
+ [SQL] remove a decimal case branch that has no effect at runtime
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-10 17:20:52 -0800
+ Commit: d793d80, github.com/apache/spark/pull/3192
+
+ [SPARK-4308][SQL] Sets SQL operation state to ERROR when exception is thrown
+ Cheng Lian <lian@databricks.com>
+ 2014-11-10 16:56:36 -0800
+ Commit: acb55ae, github.com/apache/spark/pull/3175
+
+ [SPARK-4000][Build] Uploads HiveCompatibilitySuite logs
+ Cheng Lian <lian@databricks.com>
+ 2014-11-10 16:17:52 -0800
+ Commit: 534b231, github.com/apache/spark/pull/2993
+
+ [SPARK-4319][SQL] Enable an ignored test "null count".
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-10 15:55:15 -0800
+ Commit: dbf1058, github.com/apache/spark/pull/3185
+
+ Revert "[SPARK-2703][Core]Make Tachyon related unit tests execute without deploying a Tachyon system locally."
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-10 14:56:06 -0800
+ Commit: 6e7a309
+
+ [SPARK-4047] - Generate runtime warnings for example implementation of PageRank
+ Varadharajan Mukundan <srinathsmn@gmail.com>
+ 2014-11-10 14:32:29 -0800
+ Commit: 974d334, github.com/apache/spark/pull/2894
+
+ SPARK-1297 Upgrade HBase dependency to 0.98
+ tedyu <yuzhihong@gmail.com>
+ 2014-11-10 13:23:33 -0800
+ Commit: b32734e, github.com/apache/spark/pull/3115
+
+ SPARK-4230. Doc for spark.default.parallelism is incorrect
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-11-10 12:40:41 -0800
+ Commit: c6f4e70, github.com/apache/spark/pull/3107
+
+ [SPARK-4312] bash doesn't have "die"
+ Jey Kottalam <jey@kottalam.net>
+ 2014-11-10 12:37:56 -0800
+ Commit: c5db8e2, github.com/apache/spark/pull/2898
+
+ Update RecoverableNetworkWordCount.scala
+ comcmipi <pitonak@fns.uniba.sk>
+ 2014-11-10 12:33:48 -0800
+ Commit: 0340c56, github.com/apache/spark/pull/2735
+
+ SPARK-2548 [STREAMING] JavaRecoverableWordCount is missing
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-10 11:47:27 -0800
+ Commit: 3a02d41, github.com/apache/spark/pull/2564
+
+ [SPARK-4169] [Core] Accommodate non-English Locales in unit tests
+ Niklas Wilcke <1wilcke@informatik.uni-hamburg.de>
+ 2014-11-10 11:37:38 -0800
+ Commit: ed8bf1e, github.com/apache/spark/pull/3036
+
+ [SQL] support udt to hive types conversion (hive->udt is not supported)
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-10 11:04:12 -0800
+ Commit: 894a724, github.com/apache/spark/pull/3164
+
+ [SPARK-2703][Core]Make Tachyon related unit tests execute without deploying a Tachyon system locally.
+ RongGu <gurongwalker@gmail.com>
+ 2014-11-09 23:48:15 -0800
+ Commit: bd86cb1, github.com/apache/spark/pull/3030
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-09 23:07:14 -0800
+ Commit: 227488d, github.com/apache/spark/pull/2898
+
+ SPARK-3179. Add task OutputMetrics.
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-11-09 22:29:03 -0800
+ Commit: 3c2cff4, github.com/apache/spark/pull/2968
+
+ SPARK-1209 [CORE] (Take 2) SparkHadoop{MapRed,MapReduce}Util should not use package org.apache.hadoop
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-09 22:11:20 -0800
+ Commit: f8e5732, github.com/apache/spark/pull/3048
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-09 18:16:20 -0800
+ Commit: f73b56f, github.com/apache/spark/pull/464
+
+ SPARK-1344 [DOCS] Scala API docs for top methods
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-09 17:42:08 -0800
+ Commit: d136265, github.com/apache/spark/pull/3168
+
+ SPARK-971 [DOCS] Link to Confluence wiki from project website / documentation
+ Sean Owen <sowen@cloudera.com>
+ 2014-11-09 17:40:48 -0800
+ Commit: 8c99a47, github.com/apache/spark/pull/3169
+
+ [SPARK-4301] StreamingContext should not allow start() to be called after calling stop()
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-08 18:10:23 -0800
+ Commit: 7b41b17, github.com/apache/spark/pull/3160
+
+ [Minor] [Core] Don't NPE on closeQuietly(null)
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-08 13:03:51 -0800
+ Commit: 4af5c7e, github.com/apache/spark/pull/3166
+
+ [SPARK-4291][Build] Rename network module projects
+ Andrew Or <andrew@databricks.com>
+ 2014-11-07 23:16:13 -0800
+ Commit: 7afc856, github.com/apache/spark/pull/3148
+
+ [MLLIB] [PYTHON] SPARK-4221: Expose nonnegative ALS in the python API
+ Michelangelo D'Agostino <mdagostino@civisanalytics.com>
+ 2014-11-07 22:53:01 -0800
+ Commit: 7e9d975, github.com/apache/spark/pull/3095
+
+ [SPARK-4304] [PySpark] Fix sort on empty RDD
+ Davies Liu <davies@databricks.com>
+ 2014-11-07 20:53:03 -0800
+ Commit: 7779109, github.com/apache/spark/pull/3162
+
+ MAINTENANCE: Automated closing of pull requests.
+ Patrick Wendell <pwendell@gmail.com>
+ 2014-11-07 13:08:25 -0800
+ Commit: 5923dd9, github.com/apache/spark/pull/3016
+
+ Update JavaCustomReceiver.java
+ xiao321 <1042460381@qq.com>
+ 2014-11-07 12:56:49 -0800
+ Commit: 7c9ec52, github.com/apache/spark/pull/3153
+
+ [SPARK-4292][SQL] Result set iterator bug in JDBC/ODBC
+ wangfei <wangfei1@huawei.com>
+ 2014-11-07 12:55:11 -0800
+ Commit: d6e5552, github.com/apache/spark/pull/3149
+
+ [SPARK-4203][SQL] Partition directories in random order when inserting into hive table
+ Matthew Taylor <matthew.t@tbfe.net>
+ 2014-11-07 12:53:08 -0800
+ Commit: ac70c97, github.com/apache/spark/pull/3076
+
+ [SPARK-4270][SQL] Fix Cast from DateType to DecimalType.
+ Takuya UESHIN <ueshin@happy-camper.st>
+ 2014-11-07 12:30:47 -0800
+ Commit: a6405c5, github.com/apache/spark/pull/3134
+
+ [SPARK-4272] [SQL] Add more unwrapper functions for primitive type in TableReader
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-07 12:15:53 -0800
+ Commit: 60ab80f, github.com/apache/spark/pull/3136
+
+ [SPARK-4213][SQL] ParquetFilters - No support for LT, LTE, GT, GTE operators
+ Kousuke Saruta <sarutak@oss.nttdata.co.jp>
+ 2014-11-07 11:56:40 -0800
+ Commit: 14c54f1, github.com/apache/spark/pull/3083
+
+ [SQL] Modify keyword val location according to ordering
+ Jacky Li <jacky.likun@gmail.com>
+ 2014-11-07 11:52:08 -0800
+ Commit: 68609c5, github.com/apache/spark/pull/3080
+
+ [SQL] Support ScalaReflection of schema in different universes
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-07 11:51:20 -0800
+ Commit: 8154ed7, github.com/apache/spark/pull/3096
+
+ [SPARK-4225][SQL] Resorts to SparkContext.version to inspect Spark version
+ Cheng Lian <lian@databricks.com>
+ 2014-11-07 11:45:25 -0800
+ Commit: 86e9eaa, github.com/apache/spark/pull/3105
+
+ [SQL][DOC][Minor] Spark SQL Hive now support dynamic partitioning
+ wangfei <wangfei1@huawei.com>
+ 2014-11-07 11:43:35 -0800
+ Commit: 636d7bc, github.com/apache/spark/pull/3127
+
+ [SPARK-4187] [Core] Switch to binary protocol for external shuffle service messages
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-07 09:42:21 -0800
+ Commit: d4fa04e, github.com/apache/spark/pull/3146
+
+ [SPARK-4204][Core][WebUI] Change Utils.exceptionString to contain the inner exceptions and make the error information in Web UI more friendly
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-06 21:52:12 -0800
+ Commit: 3abdb1b, github.com/apache/spark/pull/3073
+
+ [SPARK-4236] Cleanup removed applications' files in shuffle service
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-06 19:54:32 -0800
+ Commit: 48a19a6, github.com/apache/spark/pull/3126
+
+ [SPARK-4188] [Core] Perform network-level retry of shuffle file fetches
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-06 18:39:14 -0800
+ Commit: f165b2b, github.com/apache/spark/pull/3101
+
+ [SPARK-4277] Support external shuffle service on Standalone Worker
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-06 17:20:46 -0800
+ Commit: 6e9ef10, github.com/apache/spark/pull/3142
+
+ [SPARK-3797] Minor addendum to Yarn shuffle service
+ Andrew Or <andrew@databricks.com>
+ 2014-11-06 17:18:49 -0800
+ Commit: 96136f2, github.com/apache/spark/pull/3144
+
+ [HOT FIX] Make distribution fails
+ Andrew Or <andrew@databricks.com>
+ 2014-11-06 15:31:07 -0800
+ Commit: 470881b, github.com/apache/spark/pull/3145
+
+ [SPARK-4249][GraphX]fix a problem of EdgePartitionBuilder in Graphx
+ lianhuiwang <lianhuiwang09@gmail.com>
+ 2014-11-06 10:46:45 -0800
+ Commit: d15c6e9, github.com/apache/spark/pull/3138
+
+ [SPARK-4264] Completion iterator should only invoke callback once
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-06 10:45:46 -0800
+ Commit: 23eaf0e, github.com/apache/spark/pull/3128
+
+ [SPARK-4186] add binaryFiles and binaryRecords in Python
+ Davies Liu <davies@databricks.com>
+ 2014-11-06 00:22:19 -0800
+ Commit: b41a39e, github.com/apache/spark/pull/3078
+
+ [SPARK-4255] Fix incorrect table striping
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2014-11-06 00:03:03 -0800
+ Commit: 5f27ae1, github.com/apache/spark/pull/3117
+
+ [SPARK-4137] [EC2] Don't change working dir on user
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2014-11-05 20:45:35 -0800
+ Commit: db45f5a, github.com/apache/spark/pull/2988
+
+ [SPARK-4262][SQL] add .schemaRDD to JavaSchemaRDD
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-05 19:56:16 -0800
+ Commit: 3d2b5bc, github.com/apache/spark/pull/3125
+
+ [SPARK-4254] [mllib] MovieLensALS bug fix
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-11-05 19:51:18 -0800
+ Commit: c315d13, github.com/apache/spark/pull/3116
+
+ [SPARK-4158] Fix for missing resources.
+ Brenden Matthews <brenden@diddyinc.com>
+ 2014-11-05 16:02:44 -0800
+ Commit: cb0eae3, github.com/apache/spark/pull/3024
+
+ SPARK-3223 runAsSparkUser cannot change HDFS write permission properly i...
+ Jongyoul Lee <jongyoul@gmail.com>
+ 2014-11-05 15:49:42 -0800
+ Commit: f7ac8c2, github.com/apache/spark/pull/3034
+
+ SPARK-4040. Update documentation to exemplify use of local (n) value, fo...
+ jay@apache.org <jayunit100>
+ 2014-11-05 15:45:34 -0800
+ Commit: 868cd4c, github.com/apache/spark/pull/2964
+
+ [SPARK-3797] Run external shuffle service in Yarn NM
+ Andrew Or <andrew@databricks.com>
+ 2014-11-05 15:42:05 -0800
+ Commit: 61a5cce, github.com/apache/spark/pull/3082
+
+ SPARK-4222 [CORE] use readFully in FixedLengthBinaryRecordReader
+ industrial-sloth <industrial-sloth@users.noreply.github.com>
+ 2014-11-05 15:38:48 -0800
+ Commit: f37817b, github.com/apache/spark/pull/3093
+
+ [SPARK-3984] [SPARK-3983] Fix incorrect scheduler delay and display task deserialization time in UI
+ Kay Ousterhout <kayousterhout@gmail.com>
+ 2014-11-05 15:30:31 -0800
+ Commit: a46497e, github.com/apache/spark/pull/2832
+
+ [SPARK-4242] [Core] Add SASL to external shuffle service
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-05 14:38:43 -0800
+ Commit: 4c42986, github.com/apache/spark/pull/3108
+
+ [SPARK-4197] [mllib] GradientBoosting API cleanup and examples in Scala, Java
+ Joseph K. Bradley <joseph@databricks.com>
+ 2014-11-05 10:33:13 -0800
+ Commit: 5b3b6f6, github.com/apache/spark/pull/3094
+
+ [SPARK-4029][Streaming] Update streaming driver to reliably save and recover received block metadata on driver failures
+ Tathagata Das <tathagata.das1565@gmail.com>
+ 2014-11-05 01:21:53 -0800
+ Commit: 5f13759, github.com/apache/spark/pull/3026
+
+ [SPARK-3964] [MLlib] [PySpark] add Hypothesis test Python API
+ Davies Liu <davies@databricks.com>
+ 2014-11-04 21:35:52 -0800
+ Commit: c8abddc, github.com/apache/spark/pull/3091
+
+ [SQL] Add String option for DSL AS
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-04 18:14:28 -0800
+ Commit: 515abb9, github.com/apache/spark/pull/3097
+
+ [SPARK-2938] Support SASL authentication in NettyBlockTransferService
+ Aaron Davidson <aaron@databricks.com>
+ 2014-11-04 16:15:38 -0800
+ Commit: 5e73138, github.com/apache/spark/pull/3087
+
+ [Spark-4060] [MLlib] exposing special rdd functions to the public
+ Niklas Wilcke <1wilcke@informatik.uni-hamburg.de>
+ 2014-11-04 09:57:03 -0800
+ Commit: f90ad5d, github.com/apache/spark/pull/2907
+
+ fixed MLlib Naive-Bayes java example bug
+ Dariusz Kobylarz <darek.kobylarz@gmail.com>
+ 2014-11-04 09:53:43 -0800
+ Commit: bcecd73, github.com/apache/spark/pull/3081
+
+ [SPARK-3886] [PySpark] simplify serializer, use AutoBatchedSerializer by default.
+ Davies Liu <davies@databricks.com>
+ 2014-11-03 23:56:14 -0800
+ Commit: e4f4263, github.com/apache/spark/pull/2920
+
+ [SPARK-4166][Core] Add a backward compatibility test for ExecutorLostFailure
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-03 22:47:45 -0800
+ Commit: b671ce0, github.com/apache/spark/pull/3085
+
+ [SPARK-4163][Core] Add a backward compatibility test for FetchFailed
+ zsxwing <zsxwing@gmail.com>
+ 2014-11-03 22:40:43 -0800
+ Commit: 9bdc841, github.com/apache/spark/pull/3086
+
+ [SPARK-3573][MLLIB] Make MLlib's Vector compatible with SQL's SchemaRDD
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-03 22:29:48 -0800
+ Commit: 1a9c6cd, github.com/apache/spark/pull/3070
+
+ [SPARK-4192][SQL] Internal API for Python UDT
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-03 19:29:11 -0800
+ Commit: 04450d1, github.com/apache/spark/pull/3068
+
+ [FIX][MLLIB] fix seed in BaggedPointSuite
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-03 18:50:37 -0800
+ Commit: c5912ec, github.com/apache/spark/pull/3084
+
+ [SPARK-611] Display executor thread dumps in web UI
+ Josh Rosen <joshrosen@databricks.com>
+ 2014-11-03 18:18:47 -0800
+ Commit: 4f035dd, github.com/apache/spark/pull/2944
+
+ [SPARK-4168][WebUI] web statges number should show correctly when stages are more than 1000
+ Zhang, Liye <liye.zhang@intel.com>
+ 2014-11-03 18:17:32 -0800
+ Commit: 97a466e, github.com/apache/spark/pull/3035
+
+ [SQL] Convert arguments to Scala UDFs
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-03 18:04:51 -0800
+ Commit: 15b58a2, github.com/apache/spark/pull/3077
+
+ SPARK-4178. Hadoop input metrics ignore bytes read in RecordReader insta...
+ Sandy Ryza <sandy@cloudera.com>
+ 2014-11-03 15:19:01 -0800
+ Commit: 2812815, github.com/apache/spark/pull/3045
+
+ [SQL] More aggressive defaults
+ Michael Armbrust <michael@databricks.com>
+ 2014-11-03 14:08:27 -0800
+ Commit: 25bef7e, github.com/apache/spark/pull/3064
+
+ [SPARK-4152] [SQL] Avoid data change in CTAS while table already existed
+ Cheng Hao <hao.cheng@intel.com>
+ 2014-11-03 13:59:43 -0800
+ Commit: e83f13e, github.com/apache/spark/pull/3013
+
+ [SPARK-4202][SQL] Simple DSL support for Scala UDF
+ Cheng Lian <lian@databricks.com>
+ 2014-11-03 13:20:33 -0800
+ Commit: c238fb4, github.com/apache/spark/pull/3067
+
+ [SPARK-3594] [PySpark] [SQL] take more rows to infer schema or sampling
+ Davies Liu <davies.liu@gmail.com>, Davies Liu <davies@databricks.com>
+ 2014-11-03 13:17:09 -0800
+ Commit: 24544fb, github.com/apache/spark/pull/2716
+
+ [SPARK-4207][SQL] Query which has syntax like 'not like' is not working in Spark SQL
+ ravipesala <ravindra.pesala@huawei.com>
+ 2014-11-03 13:07:41 -0800
+ Commit: 2b6e1ce, github.com/apache/spark/pull/3075
+
+ [SPARK-4211][Build] Fixes hive.version in Maven profile hive-0.13.1
+ fi <coderfi@gmail.com>
+ 2014-11-03 12:56:56 -0800
+ Commit: df607da, github.com/apache/spark/pull/3072
+
+ [SPARK-4148][PySpark] fix seed distribution and add some tests for rdd.sample
+ Xiangrui Meng <meng@databricks.com>
+ 2014-11-03 12:24:24 -0800
+ Commit: 3cca196, github.com/apache/spark/pull/3010
+
+ [EC2] Factor out Mesos spark-ec2 branch
+ Nicholas Chammas <nicholas.chammas@gmail.com>
+ 2014-11-03 09:02:35 -0800
+ Commit: 2aca97c, github.com/apache/spark/pull/3008
+