[WIP] Adds Linear, Ridge and Lasso regressions #10

Nimpruda · 2019-12-12T18:27:07Z

see #7

[WIP] Add Linear ridge and lasso regression

codecov · 2019-12-12T18:46:32Z

Codecov Report

Merging #10 into master will decrease coverage by 13.35%.
The diff coverage is 0%.

@@             Coverage Diff             @@
##           master      #10       +/-   ##
===========================================
- Coverage   96.68%   83.33%   -13.36%     
===========================================
  Files           7        9        +2     
  Lines         181      210       +29     
===========================================
  Hits          175      175               
- Misses          6       35       +29

Impacted Files	Coverage Δ
...infa-supervised/src/linear_regression/algorithm.rs	`0% <0%> (ø)`
linfa-supervised/src/ridge_regression/algorithm.rs	`0% <0%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ec5d491...74e30f9. Read the comment docs.

remram44 · 2019-12-14T01:45:06Z

Cargo.toml

@@ -13,6 +13,8 @@ keywords = ["machine-learning", "linfa", "ai", "ml"]
 categories = ["algorithms", "mathematics", "science"]

 [dependencies]
+
+linfa-supervised = {path = "linfa-supervised"}


You need to add a version number, or publishing the linfa crate to crates.io will fail 😉

Suggested change

linfa-supervised = {path = "linfa-supervised"}

linfa-supervised = { path = "linfa-supervised", version = "0.1" }

LukeMathWalker · 2019-12-15T18:13:43Z

linfa-supervised/Cargo.toml

+
+[dependencies]
+ndarray = { version = "0.13" , features = ["rayon"] }
+ndarray-linalg = { version = "0.12", features = ["openblas"] }


We don't want to force the usage of a certain BLAS implementation over the other ones (e.g. intel-mkl) - I would suggest to use a features-strategy similar to what I put down here: https://github.com/rust-ndarray/ndarray-examples/blob/d7d43e018cade1f8bf0d92220081ba66963bab07/linear_regression/Cargo.toml#L8

LukeMathWalker · 2019-12-15T18:15:09Z

linfa-supervised/examples/main.rs

+    let x_hat = array![[6.0], [7.0]];
+    println!("{:#?}", linear_regression.predict(&x_hat));
+
+    let mut linear_regression2 = LinearRegression::new(true);


I would suggest splitting this example (and the one below) in two separate examples, with a descriptive name (e.g. with/without intercept).
It would be ideal to have them in separate files as well under the examples folder.

LukeMathWalker · 2019-12-15T18:17:54Z

linfa-supervised/src/linear_regression/algorithm.rs

+use ndarray::{stack, Array, Array1, ArrayBase, Axis, Data, Ix1, Ix2};
+use ndarray_linalg::Solve;
+/* I will probably change the implementation for an enum for more type safety.
+I have to make sure, it is a great idea when it comes to pyhton interoperability


There is considerable freedom in how to wrap the Rust version for Python consumption - as detailed in #8, we shouldn't let Python move our Rust design in directions which are not idiomatic. The wrapping code can do the bridging when required 😀

So I'd definitely suggest to go with the commented out version, which uses an enum (LinearRegression::new(false) is much more confusing than LinearRegression::new(Intercept::NoIntercept)).

LukeMathWalker · 2019-12-15T18:19:32Z

linfa-supervised/src/ridge_regression/algorithm.rs

+use ndarray::{Array, Array1, ArrayBase, Data, Ix1, Ix2};
+use ndarray_linalg::Solve;
+/* The difference between a linear regression and a Ridge regression is
+ that ridge regression has an L2 penalisation term to having some features


Typo? avoid having?

LukeMathWalker · 2019-12-15T18:20:33Z

linfa-supervised/src/ridge_regression/algorithm.rs

+}
+
+impl RidgeRegression {
+    pub fn new(alpha: f64) -> RidgeRegression {


Shouldn't we have an intercept parameter here as well?

LukeMathWalker · 2019-12-15T18:21:37Z

linfa-supervised/src/utils.rs

@@ -0,0 +1,6 @@
+trait GradientDescent {


Is this used anywhere?

LukeMathWalker · 2019-12-15T18:24:23Z

linfa-supervised/Cargo.toml

@@ -0,0 +1,16 @@
+[package]
+name = "linfa-supervised"


I would probably prefer this to be named linfa-linear - we will have many more algorithm belonging to the family of supervised learning and we don't want to have all of them in the same sub-crate.

LukeMathWalker · 2019-12-15T18:31:06Z

linfa-supervised/src/linear_regression/algorithm.rs

+            let dummy_column: Array<f64, _> = Array::ones((n_samples, 1));
+            let X = stack(Axis(1), &[dummy_column.view(), X.view()]).unwrap();
+            match &self.beta {
+                None => panic!("The linear regression estimator has to be fitted first!"),


This is not ideal - can we refactor the way we build LinearRegression to make sure that beta is always there when a user calls predict?

The easiest way to do this is re-using the same approach I put down in KMeans: a structure to hold the hyperparameters of the model (built with/without builder pattern, depending on how many hyperparameters we have there) and a fit method on it that returns a fitted LinearRegression - basically, using fit as constructor. In this way we can be sure that beta is there and we have one less panic condition 😁

LukeMathWalker · 2019-12-15T18:33:16Z

Thanks for working on it @Nimpruda!
I left some comments here and there to give you pointers on the next step to make this PR ready to be merged - it's already half-way there 😁

I see we have some issues in CI due to ndarray-linalg requiring a BLAS implementation to build correctly - I'll work in a separate PR to get this issue solved so that your branch can compile correctly on Travis 👍

Nimpruda · 2019-12-15T20:04:06Z

I'll work on it whenever I have time! Thanks for all the feedback

bytesnake · 2020-07-20T14:32:32Z

superseded by #20

Nimpruda and others added 3 commits December 2, 2019 21:44

First attempt to implement linear regression

6e20f09

Added ridge regression and some examples

8860880

Merge pull request #1 from LukeMathWalker/master

00ba794

[WIP] Add Linear ridge and lasso regression

Nimpruda added 2 commits December 12, 2019 20:05

Miscellaneous modifications

24c9117

Merge branch 'master' of github.com:Nimpruda/linfa

74e30f9

remram44 reviewed Dec 14, 2019

View reviewed changes

LukeMathWalker reviewed Dec 15, 2019

View reviewed changes

linfa-supervised/src/utils.rs

@@ -0,0 +1,6 @@

trait GradientDescent {

Copy link

Contributor

LukeMathWalker Dec 15, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this used anywhere?

LukeMathWalker reviewed Dec 15, 2019

View reviewed changes

LukeMathWalker mentioned this pull request Dec 15, 2019

Roadmap #7

Open

24 tasks

paulkoerbitz mentioned this pull request May 20, 2020

Add linfa-linear package with ordinary least squares regression #20

Merged

bytesnake closed this Jul 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Adds Linear, Ridge and Lasso regressions #10

[WIP] Adds Linear, Ridge and Lasso regressions #10

Nimpruda commented Dec 12, 2019

codecov bot commented Dec 12, 2019 •

edited

Loading

remram44 Dec 14, 2019 •

edited

Loading

LukeMathWalker Dec 15, 2019

LukeMathWalker Dec 15, 2019

LukeMathWalker Dec 15, 2019

LukeMathWalker Dec 15, 2019

LukeMathWalker Dec 15, 2019

LukeMathWalker Dec 15, 2019

LukeMathWalker Dec 15, 2019

LukeMathWalker Dec 15, 2019

LukeMathWalker commented Dec 15, 2019

Nimpruda commented Dec 15, 2019

bytesnake commented Jul 20, 2020

	linfa-supervised = {path = "linfa-supervised"}
	linfa-supervised = { path = "linfa-supervised", version = "0.1" }

[WIP] Adds Linear, Ridge and Lasso regressions #10

[WIP] Adds Linear, Ridge and Lasso regressions #10

Conversation

Nimpruda commented Dec 12, 2019

codecov bot commented Dec 12, 2019 • edited Loading

Codecov Report

remram44 Dec 14, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LukeMathWalker commented Dec 15, 2019

Nimpruda commented Dec 15, 2019

bytesnake commented Jul 20, 2020

codecov bot commented Dec 12, 2019 •

edited

Loading

remram44 Dec 14, 2019 •

edited

Loading