Clarification on Neural Network Architecture with TrainWithClassifier Class #655

learnfromgroundup · 2023-08-09T02:44:05Z

learnfromgroundup
Aug 9, 2023

Hello Kevin and the community! I have a question regarding the neural network architecture when using TrainWithClassifier with triplet margin loss. I understand it takes a triplet of an anchor, positive, and negative image as input during training. However, the library abstracts away a lot of these details so elegantly that I wanted to confirm my understanding :)

If I were to draw a diagram of the neural network architecture, would it be correct to show the input as 3 images (anchor, positive, negative) that then pass through the CNN tracker, embedding layers, and classifier? Or does the library handle the triplet sampling under the hood in a way that the neural network itself just sees a batch of images as input?

I would greatly appreciate any clarification or confirmation on the architecture. Thank you!

Answered by KevinMusgrave

Aug 9, 2023

During both training and testing, the neural network just sees a batch of images. For each image, the neural network computes an embedding, independently of all the other images. You could have a batch size of 1, and the neural network wouldn't know the difference.

The triplet construction happens only to compute the loss value during training. Given the embeddings of a batch, the loss function is calculated by comparing the distances between anchor embedding, positive embedding, and negative embedding. But this is after the embeddings for the batch are computed.

So the steps are:

Compute the embedding for each image in the batch
Compute the loss by finding all the triplets within the ba…

View full answer

KevinMusgrave · 2023-08-09T22:07:28Z

KevinMusgrave
Aug 9, 2023
Maintainer

During both training and testing, the neural network just sees a batch of images. For each image, the neural network computes an embedding, independently of all the other images. You could have a batch size of 1, and the neural network wouldn't know the difference.

The triplet construction happens only to compute the loss value during training. Given the embeddings of a batch, the loss function is calculated by comparing the distances between anchor embedding, positive embedding, and negative embedding. But this is after the embeddings for the batch are computed.

So the steps are:

Compute the embedding for each image in the batch
Compute the loss by finding all the triplets within the batch, and computing the distances between the embeddings that form those triplets
Update the neural network's parameters via backpropagation (calculating the gradient of the loss with respect to the final layer, the final layer with respect to the 2nd-last layer and so on).

The addition of the classifier layer just means that the triplet loss is computed after the 2nd-last layer instead of the final layer, and the classification loss is computed at the final layer.

0 replies

learnfromgroundup · 2023-08-10T07:37:02Z

learnfromgroundup
Aug 10, 2023
Author

Hi Kevin, thanks for your detailed explanations. It is very clear.

As per my understanding, Triplet Loss and Siamese Neural Networks can also be used to learn a good distance function for the dataset. Do you have any intuitions about the similarities and differences between using Triplet Loss + Siamese Neural Networks and Triplet Loss + 'normal' CNN ?

2 replies

KevinMusgrave Aug 11, 2023
Maintainer

Triplet Loss + Siamese Neural Networks is the same as Triplet Loss + 'normal' CNN.

It's called Siamese Neural Network because the idea is you have two identical copies of the network that take in different inputs, and then you compute the distance between the outputs.

But this is exactly the same as passing in all the inputs into a single network, and computing distances between the outputs of the single network.

The "Siamese" naming is just to make the concept easier to understand.

learnfromgroundup Aug 12, 2023
Author

Thanks for explaining Kevin! I didn't hear anybody mentioning it. Thank you for your great insights :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on Neural Network Architecture with TrainWithClassifier Class #655

{{title}}

Replies: 2 comments 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Clarification on Neural Network Architecture with TrainWithClassifier Class #655

learnfromgroundup Aug 9, 2023

Replies: 2 comments · 2 replies

KevinMusgrave Aug 9, 2023 Maintainer

learnfromgroundup Aug 10, 2023 Author

KevinMusgrave Aug 11, 2023 Maintainer

learnfromgroundup Aug 12, 2023 Author

learnfromgroundup
Aug 9, 2023

Replies: 2 comments 2 replies

KevinMusgrave
Aug 9, 2023
Maintainer

learnfromgroundup
Aug 10, 2023
Author

KevinMusgrave Aug 11, 2023
Maintainer

learnfromgroundup Aug 12, 2023
Author