Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

term 'sample' in chapter 4 may be confusing #41

Open
micuat opened this issue Apr 1, 2018 · 0 comments
Open

term 'sample' in chapter 4 may be confusing #41

micuat opened this issue Apr 1, 2018 · 0 comments

Comments

@micuat
Copy link
Contributor

micuat commented Apr 1, 2018

Hi @genekogan, in this line of chapter 4:
https://github.com/ml4a/ml4a.github.io/blame/master/_chapters/how_neural_networks_are_trained.md#L62
(sorry for using blame mode, it's just because github does not allow users to point to a line of rendered markdown)

This principle is closely related to what we call in machine learning “the curse of dimensionality.” Each dimension we add into a search space exponentially blows up the number of samples we require to get good generalization for any model learned from it. The curse of dimensionality is more often applied to datasets; simply put, the more columns or variables a dataset is represented with, the exponentially more samples in that dataset we need to understand it. In our case, we are thinking about the weights rather than the inputs, but the principle remains the same; high-dimensional space is enormous!

the first 'samples' refers to the sampling of the space, and the second one refers to elements of the dataset (if I understand correctly). Since in this chapter the former usage is more common, I suggest that the latter should be replaced by 'data' / 'element' etc to avoid confusion.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant