-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Usage to build CNN Network #14
Comments
Unfortunately there really is no good high-level documentation at this point. This is on my TODO list, but is likely to take some time as there's a lot to document ;) For your particular case, there are two examples of how you might go about building a full network in the models section. In general, models using this code are going to be quite slow in comparison to any keras/tf/torch/theano implementations - the code here is optimized for readability over speed / efficiency. That said, I think it's a great idea to have some simple examples to show how the NN code corresponds to other packages. |
In general, if you want to implement a model, you'll probably want the following methods as a bare-minimum: _build_network(self, ...):
# initialize the network layers and store them within an
# OrderedDict so you can reliably iterate over them during the
# forward / backward passes
forward(self, X):
# perform a forward pass. this is where the specific model architecture comes
# into play, since you'll need to define how outputs from early layers flow to
# inputs of subsequent layers
backward(self, dLdy):
# perform a backward pass. again, the route the gradients take through the network
# will be specific to the particular model architecture |
So basically numpy-ml follows some kind of |
Yeah, more or less. The major difference is that this code won't have a built-in |
Is there any documentation for usage to build a network?
I want to try to implement some simple network based on for example MNIST dataset.
If there is no documentation, i think we can write one. For example, in keras, we can have model built like this:
The text was updated successfully, but these errors were encountered: