aboutsummaryrefslogtreecommitdiff
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* [SPARK-19701][SQL][PYTHON] Throws a correct exception for 'in' operator again...hyukjinkwon2017-03-052-1/+6
* [SPARK-19595][SQL] Support json array in from_jsonhyukjinkwon2017-03-051-3/+8
* [SPARK-19348][PYTHON] PySpark keyword_only decorator is not thread-safeBryan Cutler2017-03-0311-120/+161
* [SPARK-18352][DOCS] wholeFile JSON update doc and programming guideFelix Cheung2017-03-022-4/+4
* [SPARK-19734][PYTHON][ML] Correct OneHotEncoder doc string to say dropLastMark Grover2017-03-011-1/+1
* [MINOR][ML] Fix comments in LSH Examples and Python APIYun Ni2017-03-011-1/+1
* [SPARK-19610][SQL] Support parsing multiline CSV fileshyukjinkwon2017-02-284-5/+22
* [SPARK-14489][ML][PYSPARK] ALS unknown user/item prediction strategyNick Pentreath2017-02-281-5/+25
* [SPARK-19660][CORE][SQL] Replace the configuration property names that are de...Yuming Wang2017-02-281-23/+24
* [SPARK-13330][PYSPARK] PYTHONHASHSEED is not propgated to python workerJeff Zhang2017-02-242-5/+4
* [SPARK-19161][PYTHON][SQL] Improving UDF Docstringszero3232017-02-242-11/+25
* [SPARK-14772][PYTHON][ML] Fixed Params.copy method to match Scala implementationBryan Cutler2017-02-232-6/+27
* [SPARK-19706][PYSPARK] add Column.contains in pysparkWenchen Fan2017-02-232-1/+3
* [SPARK-18699][SQL] Put malformed tokens into a new field when parsing CSV dataTakeshi Yamamuro2017-02-232-16/+48
* [SPARK-19497][SS] Implement streaming deduplicationShixiong Zhu2017-02-231-0/+6
* [SPARK-19405][STREAMING] Support for cross-account Kinesis reads via STSAdam Budde2017-02-221-2/+10
* [MINOR][PYTHON] Fix typo docstring: 'top' -> 'topic'Rolando Espinoza2017-02-171-1/+1
* [SPARK-18352][SQL] Support parsing multiline json filesNathan Howell2017-02-164-10/+37
* [SPARK-18080][ML][PYTHON] Python API & Examples for Locality Sensitive HashingYun Ni2017-02-151-0/+291
* [SPARK-19604][TESTS] Log the start of every Python testYin Huai2017-02-151-1/+1
* [SPARK-18937][SQL] Timezone support in CSV/JSON parsingTakuya UESHIN2017-02-152-24/+39
* [SPARK-19399][SPARKR] Add R coalesce API for DataFrame and ColumnFelix Cheung2017-02-151-1/+9
* [SPARK-19160][PYTHON][SQL] Add udf decoratorzero3232017-02-152-7/+91
* [SPARK-19590][PYSPARK][ML] Update the document for QuantileDiscretizer in pys...VinceShieh2017-02-151-1/+11
* [SPARK-18541][PYTHON] Add metadata parameter to pyspark.sql.Column.alias()Sheamus K. Parkes2017-02-142-3/+33
* [SPARK-19162][PYTHON][SQL] UserDefinedFunction should validate that func is c...zero3232017-02-142-0/+12
* [SPARK-19453][PYTHON][SQL][DOC] Correct and extend DataFrame.replace docstringzero3232017-02-141-6/+12
* [SPARK-19429][PYTHON][SQL] Support slice arguments in Column.__getitem__zero3232017-02-132-3/+16
* [SPARK-19427][PYTHON][SQL] Support data type string as a returnType argument ...zero3232017-02-132-3/+20
* [SPARK-19506][ML][PYTHON] Import warnings in pyspark.ml.utilzero3232017-02-131-0/+1
* [SPARK-16609] Add to_date/to_timestamp with format functionsanabranch2017-02-072-6/+55
* [SPARK-19467][ML][PYTHON] Remove cyclic imports from pyspark.ml.pipelinezero3232017-02-061-1/+1
* [SPARK-19421][ML][PYSPARK] Remove numClasses and numFeatures methods in Linea...Zheng RuiFeng2017-02-051-16/+0
* [SPARK-19389][ML][PYTHON][DOC] Minor doc fixes for ML Python Params and Linea...Joseph K. Bradley2017-02-022-17/+5
* [SPARK-14352][SQL] approxQuantile should support multi columnsZheng RuiFeng2017-02-012-8/+52
* [SPARK-19163][PYTHON][SQL] Delay _judf initialization to the __call__zero3232017-01-312-11/+68
* [SPARK-17161][PYSPARK][ML] Add PySpark-ML JavaWrapper convenience function to...Bryan Cutler2017-01-313-3/+77
* [SPARK-19403][PYTHON][SQL] Correct pyspark.sql.column.__all__ list.zero3232017-01-301-1/+1
* [SPARK-19336][ML][PYSPARK] LinearSVC Python APIwm624@hotmail.com2017-01-273-1/+156
* [SPARK-18020][STREAMING][KINESIS] Checkpoint SHARD_END to finish reading clos...Takeshi YAMAMURO2017-01-251-1/+1
* [SPARK-19064][PYSPARK] Fix pip installing of sub componentsHolden Karau2017-01-251-0/+5
* [SPARK-19307][PYSPARK] Make sure user conf is propagated to SparkContext.Marcelo Vanzin2017-01-252-0/+23
* [SPARK-19229][SQL] Disallow Creating Hive Source Tables when Hive Support is ...gatorsmile2017-01-221-4/+4
* [SPARK-18589][SQL] Fix Python UDF accessing attributes from both side of joinDavies Liu2017-01-201-0/+9
* [SPARK-14272][ML] Add Loglikelihood in GaussianMixtureSummaryZheng RuiFeng2017-01-191-0/+10
* [SPARK-19223][SQL][PYSPARK] Fix InputFileBlockHolder for datasources which ar...Liang-Chi Hsieh2017-01-181-0/+24
* [SPARK-19239][PYSPARK] Check parameters whether equals None when specify the ...DjvuLee2017-01-171-3/+6
* [SPARK-19019] [PYTHON] Fix hijacked `collections.namedtuple` and port cloudpi...hyukjinkwon2017-01-172-31/+87
* [SPARK-19148][SQL] do not expose the external table concept in CatalogWenchen Fan2017-01-171-3/+24
* [SPARK-18687][PYSPARK][SQL] Backward compatibility - creating a Dataframe on ...Vinayak2017-01-132-2/+7