docs: Use logistic regression instead of SVC on the Iris dataset #1018

sylvaincom · 2024-12-24T10:13:07Z

Which part of the documentation needs improvement?

README, quick start example

Describe the problem found in the documentation

Currently, on the Iris dataset, for classification, we use SVC.

Suggested improvement

@glemaitre suggests using logistic regression.

Additional context

Related to #1004.
Waiting on this PR to be merged: #1009

glemaitre · 2024-12-24T10:31:23Z

Some background here: SVC does not really scale with the number of samples. It was useful at the time that kernels were popular. But nowadays, I would even advocate with a kernel approximation and a logistic regression to achieve the same thing and it will scale.

So as a general rule, I think that we should really either show:

LogisticRegression when it comes to classification with linear model
HistGradientBoosting in regression and classification when we want to show the direct state of the art models.

Then, we can use any other classifier or regressor but it means that we want to show a specific feature of this particular model.

Fix #1018 In quick start example, do not use SVC on the Iris dataset, but rather logistic regression

sylvaincom added documentation Improvements or additions to documentation needs-triage This has been recently submitted and needs attention labels Dec 24, 2024

sylvaincom self-assigned this Dec 24, 2024

sylvaincom mentioned this issue Dec 24, 2024

docs: Modify the doc following the v0.5 release, part 1 #1004

Merged

tuscland removed the needs-triage This has been recently submitted and needs attention label Jan 3, 2025

sylvaincom mentioned this issue Jan 10, 2025

docs: Use logistic regression instead of SVC on the Iris dataset #1087

Merged

sylvaincom linked a pull request Jan 10, 2025 that will close this issue

docs: Use logistic regression instead of SVC on the Iris dataset #1087

Merged

sylvaincom closed this as completed in 5314915 Jan 12, 2025

sylvaincom closed this as completed in #1087 Jan 12, 2025

thomass-dev pushed a commit that referenced this issue Jan 13, 2025

docs: Use logistic regression instead of SVC on the Iris dataset (#1087)

d25fd2d

Fix #1018 In quick start example, do not use SVC on the Iris dataset, but rather logistic regression

rouk1 pushed a commit that referenced this issue Jan 14, 2025

docs: Use logistic regression instead of SVC on the Iris dataset (#1087)

9ce24f6

Fix #1018 In quick start example, do not use SVC on the Iris dataset, but rather logistic regression

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Use logistic regression instead of SVC on the Iris dataset #1018

docs: Use logistic regression instead of SVC on the Iris dataset #1018

sylvaincom commented Dec 24, 2024 •

edited

Loading

glemaitre commented Dec 24, 2024

docs: Use logistic regression instead of SVC on the Iris dataset #1018

docs: Use logistic regression instead of SVC on the Iris dataset #1018

Comments

sylvaincom commented Dec 24, 2024 • edited Loading

Which part of the documentation needs improvement?

Describe the problem found in the documentation

Suggested improvement

Additional context

glemaitre commented Dec 24, 2024

sylvaincom commented Dec 24, 2024 •

edited

Loading