Wednesday, July 29, 2015

Spark MLlib

  1. Common algorithms
    1. classification
    2. regression
    3. clustering
    4. collaborative filtering
    5. dimensionality reduction
  2. data science algorithms (python)
    Module - Classificationpyspark.mllib.classification
    Linear Support Vector MachineThe SVMWithSGD class
    Naive BayesThe NaiveBayes class
    Logistic regressionThe LogisticRegressionWithSGD class
    Module - Regressionpyspark.mllib.regression
    Linear least squaresThe LinearRegressionWithSGD class
    Ridge regressionLinear least squares with L2 regularization, it is implemented with the RidgeRegressionWithSGD class
    LassoLinear least squares with L1 regularization, it is implemented with the LassoWithSGD class
    Module - Clusteringpyspark.mllib.clustering
    K-Meansthe KMeans class
    Module - Recommendationpyspark.mllib.recommendation
    Alternating Least Squaresthe ALS class

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.