problem about code,the coefficient of Spearman’s and Kendall’s in Tvsum are 0.5849 and 0.6403 #5

sunguoquan1005 · 2021-06-09T02:49:54Z

I can't reproduce the results, I run the code, and the coefficient of Spearman’s and Kendall’s in Tvsum are 0.5849 and 0.6403 respectively，which are more higher than the results.

Junaid112 · 2021-06-15T01:18:49Z

Did you take average of all k-folds or these are only for one 80-20 distribution? Average along all validation parts will reduce close to original numbers.

sunguoquan1005 · 2021-08-30T10:47:27Z

I take all ,but I get that result

mpalaourg · 2021-12-20T15:03:06Z

Hello, first thank you for your contribution in Video summarization research and for making your work open-source.

I am trying too to compute the correlation coefficient myself and I am in a weird loop. @sunguoquan1005 the reported result on the paper I'll get it if I first take the mean of the user summaries (to only have 1 user/true summary for each video). Then, compute the coefficients (ρ and τ) for each video, take the mean to compute the ρ and τ for each split and then again the mean of the splits.

Your result I'll take it if I skip the first step of taking the mean of the user summaries (and so I'll have N user/true summaries). Compute the coefficients (ρ and τ) for each true summary and then take the mean to compute ρ and τ for each video) And so on, ...

The weird thing is that the results are too good to be true! Reading the paper that introduced this evaluation protocol the authors talk about how much the F1 value is correlated with the use of knapsack. I am thinking (and I would like your opinion on that) that the coefficients must not be computed on the (binary) user summaries (produced by the knapsack) but rather on the (real) user scores! That means that this evaluation protocol is only applicable on TVSum and not SumMe, something that the original paper validate when the ρ and τ coefficients of TVSum are reported.

Sorry for the late response in the issue, but only now I found that discussion and I would love to see your opinion on that matter.

xings19 · 2022-01-26T09:30:37Z

I get that result,too. How should I modify the code in the project to achieve the results in the paper?

Junaid112 · 2022-01-27T14:42:03Z

I get that result,too. How should I modify the code in the project to achieve the results in the paper?

In this paper, evaluation is followed provided by "Video Summarization with Long Short-term Memory" & TvSum paper. There average was used among users and then we did average for k-folds. Weird thing was for SumMe max was taken among score per user and then we took average for k-folds. Consider this average accumulation I just stated and original scores from TvSum for correlation wit prediction then you will get the results. I do not agree for evaluation criteria where max is taken for SumMe but we can avoid this in next research by doing comparison of both criteria's.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

problem about code,the coefficient of Spearman’s and Kendall’s in Tvsum are 0.5849 and 0.6403 #5

problem about code,the coefficient of Spearman’s and Kendall’s in Tvsum are 0.5849 and 0.6403 #5

sunguoquan1005 commented Jun 9, 2021

Junaid112 commented Jun 15, 2021 •

edited

Loading

sunguoquan1005 commented Aug 30, 2021

mpalaourg commented Dec 20, 2021

xings19 commented Jan 26, 2022

Junaid112 commented Jan 27, 2022 •

edited

Loading

problem about code,the coefficient of Spearman’s and Kendall’s in Tvsum are 0.5849 and 0.6403 #5

problem about code,the coefficient of Spearman’s and Kendall’s in Tvsum are 0.5849 and 0.6403 #5

Comments

sunguoquan1005 commented Jun 9, 2021

Junaid112 commented Jun 15, 2021 • edited Loading

sunguoquan1005 commented Aug 30, 2021

mpalaourg commented Dec 20, 2021

xings19 commented Jan 26, 2022

Junaid112 commented Jan 27, 2022 • edited Loading

Junaid112 commented Jun 15, 2021 •

edited

Loading

Junaid112 commented Jan 27, 2022 •

edited

Loading