aboutsummaryrefslogtreecommitdiff
path: root/python/pyspark
Commit message (Expand)AuthorAgeFilesLines
...
* [SPARK-18080][ML][PYTHON] Python API & Examples for Locality Sensitive HashingYun Ni2017-02-151-0/+291
* [SPARK-18937][SQL] Timezone support in CSV/JSON parsingTakuya UESHIN2017-02-152-24/+39
* [SPARK-19399][SPARKR] Add R coalesce API for DataFrame and ColumnFelix Cheung2017-02-151-1/+9
* [SPARK-19160][PYTHON][SQL] Add udf decoratorzero3232017-02-152-7/+91
* [SPARK-19590][PYSPARK][ML] Update the document for QuantileDiscretizer in pys...VinceShieh2017-02-151-1/+11
* [SPARK-18541][PYTHON] Add metadata parameter to pyspark.sql.Column.alias()Sheamus K. Parkes2017-02-142-3/+33
* [SPARK-19162][PYTHON][SQL] UserDefinedFunction should validate that func is c...zero3232017-02-142-0/+12
* [SPARK-19453][PYTHON][SQL][DOC] Correct and extend DataFrame.replace docstringzero3232017-02-141-6/+12
* [SPARK-19429][PYTHON][SQL] Support slice arguments in Column.__getitem__zero3232017-02-132-3/+16
* [SPARK-19427][PYTHON][SQL] Support data type string as a returnType argument ...zero3232017-02-132-3/+20
* [SPARK-19506][ML][PYTHON] Import warnings in pyspark.ml.utilzero3232017-02-131-0/+1
* [SPARK-16609] Add to_date/to_timestamp with format functionsanabranch2017-02-072-6/+55
* [SPARK-19467][ML][PYTHON] Remove cyclic imports from pyspark.ml.pipelinezero3232017-02-061-1/+1
* [SPARK-19421][ML][PYSPARK] Remove numClasses and numFeatures methods in Linea...Zheng RuiFeng2017-02-051-16/+0
* [SPARK-19389][ML][PYTHON][DOC] Minor doc fixes for ML Python Params and Linea...Joseph K. Bradley2017-02-022-17/+5
* [SPARK-14352][SQL] approxQuantile should support multi columnsZheng RuiFeng2017-02-012-8/+52
* [SPARK-19163][PYTHON][SQL] Delay _judf initialization to the __call__zero3232017-01-312-11/+68
* [SPARK-17161][PYSPARK][ML] Add PySpark-ML JavaWrapper convenience function to...Bryan Cutler2017-01-313-3/+77
* [SPARK-19403][PYTHON][SQL] Correct pyspark.sql.column.__all__ list.zero3232017-01-301-1/+1
* [SPARK-19336][ML][PYSPARK] LinearSVC Python APIwm624@hotmail.com2017-01-273-1/+156
* [SPARK-18020][STREAMING][KINESIS] Checkpoint SHARD_END to finish reading clos...Takeshi YAMAMURO2017-01-251-1/+1
* [SPARK-19307][PYSPARK] Make sure user conf is propagated to SparkContext.Marcelo Vanzin2017-01-252-0/+23
* [SPARK-19229][SQL] Disallow Creating Hive Source Tables when Hive Support is ...gatorsmile2017-01-221-4/+4
* [SPARK-18589][SQL] Fix Python UDF accessing attributes from both side of joinDavies Liu2017-01-201-0/+9
* [SPARK-14272][ML] Add Loglikelihood in GaussianMixtureSummaryZheng RuiFeng2017-01-191-0/+10
* [SPARK-19223][SQL][PYSPARK] Fix InputFileBlockHolder for datasources which ar...Liang-Chi Hsieh2017-01-181-0/+24
* [SPARK-19239][PYSPARK] Check parameters whether equals None when specify the ...DjvuLee2017-01-171-3/+6
* [SPARK-19019] [PYTHON] Fix hijacked `collections.namedtuple` and port cloudpi...hyukjinkwon2017-01-172-31/+87
* [SPARK-19148][SQL] do not expose the external table concept in CatalogWenchen Fan2017-01-171-3/+24
* [SPARK-18687][PYSPARK][SQL] Backward compatibility - creating a Dataframe on ...Vinayak2017-01-132-2/+7
* [SPARK-19055][SQL][PYSPARK] Fix SparkSession initialization when SparkContext...Liang-Chi Hsieh2017-01-122-6/+33
* [SPARK-19164][PYTHON][SQL] Remove unused UserDefinedFunction._broadcastzero3232017-01-121-6/+0
* [SPARK-19140][SS] Allow update mode for non-aggregation streaming queriesShixiong Zhu2017-01-101-8/+19
* [SPARK-17645][MLLIB][ML][FOLLOW-UP] document minor changePeng, Meng2017-01-102-7/+8
* [SPARK-17847][ML] Reduce shuffled data size of GaussianMixture & copy the imp...Yanbo Liang2017-01-091-18/+8
* [SPARK-19126][DOCS] Update Join Documentation Across Languagesanabranch2017-01-081-2/+3
* [SPARK-19127][DOCS] Update Rank Function Documentationanabranch2017-01-081-6/+10
* [SPARK-13748][PYSPARK][DOC] Add the description for explictly setting None fo...hyukjinkwon2017-01-071-1/+3
* [MINOR][DOCS] Remove consecutive duplicated words/typo in Spark RepoNiranjan Padmanabhan2017-01-043-4/+4
* [SPARK-17645][MLLIB][ML] add feature selector method based on: False Discover...Peng2016-12-282-15/+109
* [SPARK-18949][SQL] Add recoverPartitions API to Cataloggatorsmile2016-12-201-0/+5
* [SPARK-18576][PYTHON] Add basic TaskContext information to PySparkHolden Karau2016-12-204-1/+165
* [SPARK-18281] [SQL] [PYSPARK] Remove timeout for reading data through socket ...Liang-Chi Hsieh2016-12-202-6/+17
* [SPARK-18888] partitionBy in DataStreamWriter in Python throws _to_seq not de...Burak Yavuz2016-12-152-3/+5
* [SPARK-18852][SS] StreamingQuery.lastProgress should be null when recentProgr...Shixiong Zhu2016-12-142-3/+24
* [SPARK-18628][ML] Update Scala param and Python param to have quoteskrishnakalyan32016-12-111-2/+2
* [SPARK-18766][SQL] Push Down Filter Through BatchEvalPython (Python UDF)gatorsmile2016-12-101-0/+9
* [SPARK-16589] [PYTHON] Chained cartesian produces incorrect number of recordsAndrew Ray2016-12-082-23/+53
* [SPARK-18667][PYSPARK][SQL] Change the way to group row in BatchEvalPythonExe...Liang-Chi Hsieh2016-12-081-0/+8
* [SPARK-18754][SS] Rename recentProgresses to recentProgressMichael Armbrust2016-12-072-5/+5