matrix multiplication order question #5

tc64 · 2019-04-19T22:07:00Z

Hi, thanks for sharing your code! I read your paper and am wondering about the matrix multiplication order for the backward loss correction approach.

The paper says T^{-1} loss

In loss.robust, for backward, we have:

return -K.sum(K.dot(y_true, P_inv) * K.log(y_pred), axis=-1)

It looks to me like the order of matrix multiplication for P_inv and y_true should be switched. My guess is that I'm misunderstanding something, but would really appreciate if you could clarify.

Thanks!

The text was updated successfully, but these errors were encountered:

giorgiop · 2019-05-02T19:49:05Z

Have you tried with a simple example and check what changes if you switch the order?

rosefun · 2019-06-10T13:43:19Z

Hi, I'm confused about the calculation about the forward loss. In your paper, the forward loss should be shown as follows.

However, in the code, it's calculated as:
return -K.sum(y_true * K.log(K.dot(y_pred, P)), axis=-1). Why not use P.T instead of P.

guixianjin · 2019-10-21T01:35:00Z

Hi! @tc64 @rosefun @giorgiop
I think it may be because y_pred shape is N(batch_size) * C(class_num). That is y_pred = [f(x1)^T;f(x2)^T;...] where f(x1) ( a column vector) is the classifier's prediction on exmaple x1.
Thus (P.T * f(x))^T = f(x)^T*P, so it's K.dot( y_pred, P)

And maybe size of y_true is also N(batch_size) * C(class_num), so it's K.dot(y_true, P_inv)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

matrix multiplication order question #5

matrix multiplication order question #5

tc64 commented Apr 19, 2019

giorgiop commented May 2, 2019

rosefun commented Jun 10, 2019

guixianjin commented Oct 21, 2019 •

edited

Loading

matrix multiplication order question #5

matrix multiplication order question #5

Comments

tc64 commented Apr 19, 2019

giorgiop commented May 2, 2019

rosefun commented Jun 10, 2019

guixianjin commented Oct 21, 2019 • edited Loading

guixianjin commented Oct 21, 2019 •

edited

Loading