Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Merge pull request #812 from shivaram/maven-mllib-tests | Matei Zaharia | 2013-08-12 | 6 | -7/+32 |
|\ | | | | | Create SparkContext in beforeAll for MLLib tests | ||||
| * | Create SparkContext in beforeAll for MLLib tests | Shivaram Venkataraman | 2013-08-11 | 6 | -7/+32 |
| | | | | | | | | This overcomes test failures that occur using Maven | ||||
* | | Clean up scaladoc in ML Lib. | Shivaram Venkataraman | 2013-08-11 | 17 | -60/+171 |
|/ | | | | | Also build and copy ML Lib scaladoc in Spark docs build. Some more minor cleanup with respect to naming, test locations etc. | ||||
* | Merge pull request #762 from shivaram/sgd-cleanup | Evan Sparks | 2013-08-11 | 21 | -417/+816 |
|\ | | | | | Refactor SGD options into a new class. | ||||
| * | Fix GLM code review comments and move java tests | Shivaram Venkataraman | 2013-08-10 | 4 | -6/+2 |
| | | |||||
| * | Add setters for optimizer, gradient in SGD. | Shivaram Venkataraman | 2013-08-08 | 2 | -8/+19 |
| | | | | | | | | Also remove java-specific constructor for LabeledPoint. | ||||
| * | Merge branch 'master' of git://github.com/mesos/spark into sgd-cleanup | Shivaram Venkataraman | 2013-08-06 | 6 | -6/+366 |
| |\ | | | | | | | | | | | | | Conflicts: mllib/src/main/scala/spark/mllib/util/MLUtils.scala | ||||
| * | | Refactor GLM algorithms and add Java tests | Shivaram Venkataraman | 2013-08-06 | 20 | -169/+540 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | This change adds Java examples and unit tests for all GLM algorithms to make sure the MLLib interface works from Java. Changes include - Introduce LabeledPoint and avoid using Doubles in train arguments - Rename train to run in class methods - Make the optimizer a member variable of GLM to make sure the builder pattern works | ||||
| * | | Move implicit arg to constructor for Java access. | Shivaram Venkataraman | 2013-08-03 | 1 | -4/+7 |
| | | | |||||
| * | | Refactor optimizers and create GLMs | Shivaram Venkataraman | 2013-08-02 | 10 | -286/+320 |
| | | | | | | | | | | | | | | | | | | | | | This change refactors the structure of GLMs to use mixins which maintain a similar interface to other ML lib algorithms. This change also creates an Optimizer trait which allows GLMs to be extended to use other optimization techniques. | ||||
| * | | Refactor SGD options into a new class. | Shivaram Venkataraman | 2013-07-31 | 8 | -159/+143 |
| | | | | | | | | | | | | | | | This refactoring pulls out code shared between SVM, Lasso, LR into a common GradientDescentOpts class. Some style cleanup as well | ||||
* | | | Merge pull request #786 from shivaram/mllib-java | Matei Zaharia | 2013-08-09 | 6 | -30/+285 |
|\ \ \ | | | | | | | | | Java fixes, tests and examples for ALS, KMeans | ||||
| * | | | Remove Java-specific constructor for Rating. | Shivaram Venkataraman | 2013-08-08 | 3 | -12/+3 |
| | | | | | | | | | | | | | | | | | | | | The scala constructor works for native type java types. Modify examples to match this. | ||||
| * | | | Add a test case for random initialization. | Shivaram Venkataraman | 2013-08-06 | 2 | -2/+13 |
| | | | | | | | | | | | | | | | | Also workaround a bug where double[][] class cast fails | ||||
| * | | | Java examples, tests for KMeans and ALS | Shivaram Venkataraman | 2013-08-06 | 6 | -27/+280 |
| | |/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Changes ALS to accept RDD[Rating] instead of (Int, Int, Double) making it easier to call from Java - Renames class methods from `train` to `run` to enable static methods to be called from Java. - Add unit tests which check if both static / class methods can be called. - Also add examples which port the main() function in ALS, KMeans to the examples project. Couple of minor changes to existing code: - Add a toJavaRDD method in RDD to convert scala RDD to java RDD easily - Workaround a bug where using double[] from Java leads to class cast exception in KMeans init | ||||
* / | | Fixed a typo in mllib inline documentation. | Reynold Xin | 2013-08-08 | 1 | -1/+1 |
|/ / | |||||
* | | fixing formatting, style, and input | Ginger Smith | 2013-08-05 | 1 | -36/+37 |
| | | |||||
* | | fixing formatting | Ginger Smith | 2013-08-05 | 1 | -16/+23 |
| | | |||||
* | | adding matrix factorization data generator | Ginger Smith | 2013-08-02 | 1 | -0/+105 |
| | | |||||
* | | Increase Kryo buffer size in ALS since some arrays become big | Matei Zaharia | 2013-08-02 | 1 | -0/+1 |
| | | |||||
* | | Merge pull request #761 from mateiz/kmeans-generator | shivaram | 2013-07-31 | 2 | -4/+85 |
|\ \ | | | | | | | Add data generator for K-means | ||||
| * | | Turn on caching in KMeans.main | Matei Zaharia | 2013-07-31 | 1 | -1/+1 |
| | | | |||||
| * | | Added data generator for K-means | Matei Zaharia | 2013-07-31 | 2 | -3/+84 |
| |/ | | | | | | | Also made it possible to specify the number of runs in KMeans.main(). | ||||
* | | Merge pull request #753 from shivaram/glm-refactor | Matei Zaharia | 2013-07-31 | 1 | -0/+165 |
|\ \ | | | | | | | Build changes for ML lib | ||||
| * | | Add bagel, mllib to SBT assembly. | Shivaram Venkataraman | 2013-07-30 | 1 | -0/+165 |
| | | | | | | | | | | | | Also add jblas dependency to mllib pom.xml | ||||
* | | | Use the Char version of split() instead of the String one for efficiency | Matei Zaharia | 2013-07-31 | 1 | -2/+2 |
| |/ |/| | |||||
* | | Minor style cleanup of mllib. | Reynold Xin | 2013-07-30 | 5 | -35/+39 |
| | | |||||
* | | Use a tigher bound in logistic regression unit test's prediction validation. | Reynold Xin | 2013-07-30 | 1 | -3/+4 |
| | | |||||
* | | Renamed Classification.scala to ClassificationModel.scala and ↵ | Reynold Xin | 2013-07-30 | 2 | -0/+0 |
|/ | | | | Regression.scala to RegressionModel.scala | ||||
* | made SimpleUpdater consistent with other updaters | Ameet Talwalkar | 2013-07-29 | 1 | -1/+2 |
| | |||||
* | Clarify how regVal is computed in Updater docs | Shivaram Venkataraman | 2013-07-29 | 1 | -8/+9 |
| | |||||
* | Remove duplicate loss history and clarify why. | Shivaram Venkataraman | 2013-07-29 | 3 | -13/+9 |
| | | | | Also some minor style fixes. | ||||
* | Style fix | Xinghao | 2013-07-29 | 2 | -2/+4 |
| | | | | Lines shortened to < 100 characters | ||||
* | Fix validatePrediction functions for Classification models | Xinghao | 2013-07-29 | 2 | -4/+2 |
| | | | | | Classifiers return categorical (Int) values that should be compared directly | ||||
* | Deleting extra LogisticRegressionGenerator and RidgeRegressionGenerator | Xinghao | 2013-07-29 | 2 | -96/+0 |
| | |||||
* | Fix rounding error in LogisticRegression.scala | Xinghao | 2013-07-29 | 1 | -2/+4 |
| | |||||
* | Replace map-reduce with dot operator using DoubleMatrix | Xinghao | 2013-07-28 | 4 | -8/+18 |
| | |||||
* | Fixed SVM and LR train functions to take Int instead of Double for ↵ | Xinghao | 2013-07-28 | 3 | -22/+21 |
| | | | | Classification | ||||
* | Changed Classification to return Int instead of Double | Xinghao | 2013-07-28 | 7 | -30/+28 |
| | | | | Also minor changes to formatting and comments | ||||
* | SVMSuite and LassoSuite rewritten to follow closely with LogisticRegressionSuite | Xinghao | 2013-07-28 | 2 | -35/+161 |
| | |||||
* | Move data generators to util | Xinghao | 2013-07-28 | 2 | -0/+0 |
| | |||||
* | Change *_LocalRandomSGD to *LocalRandomSGD | Xinghao | 2013-07-28 | 6 | -41/+24 |
| | |||||
* | Resolve conflicts with master, removed regParam for LogisticRegression | Xinghao | 2013-07-26 | 6 | -64/+412 |
| | |||||
* | New files from merge with master | Xinghao | 2013-07-26 | 15 | -7/+399 |
|\ | |||||
| * | Use a different validation dataset for Logistic Regression prediction testing. | Reynold Xin | 2013-07-23 | 1 | -12/+17 |
| | | |||||
| * | Made RegressionModel serializable and added unit tests to make sure predict ↵ | Reynold Xin | 2013-07-23 | 6 | -16/+42 |
| | | | | | | | | methods would work. | ||||
| * | Merge pull request #711 from shivaram/ml-generators | Matei Zaharia | 2013-07-19 | 2 | -39/+75 |
| |\ | | | | | | | Move ML lib data generator files to util/ | ||||
| | * | Rename classes to be called DataGenerator | Shivaram Venkataraman | 2013-07-18 | 2 | -3/+2 |
| | | | |||||
| | * | Refactor data generators to have a function that can be used in tests. | Shivaram Venkataraman | 2013-07-18 | 2 | -34/+71 |
| | | | |||||
| | * | Move ML lib data generator files to util/ | Shivaram Venkataraman | 2013-07-17 | 2 | -2/+2 |
| | | |