LLM-based methods on the leaderboard - reg. #8

ashok-arjun · 2024-12-08T00:23:42Z

Hi; this is really cool work!!! Thanks for putting this together!

I have an LLM based method that just prompts an LLM zero-shot, and we show that it achieves strong forecasts (See Direct Prompt in https://arxiv.org/abs/2410.18959, Method in Page 41). Specifically we benchmarked on data that is beyond the cutoff date of the LLM and so is definitely not memorized.

Do you think adding this method to GIFT-Eval would be worthwhile? I'm on the fence because the test data in this benchmark is from public sources so there is no guarantee that LLMs haven't memorized it.

liu-jc · 2024-12-08T02:22:22Z

Hi @ashok-arjun,

Thanks for this issue. In my view, it is worthwhile to put on new a LLM-based method. Meanwhile, I understand your point on the issue of whether the data is memorized by LLMs.

I am thinking about maybe later we can put a column to indicate if the model is trained with "data leakage" (like [yes, no, 'potentially']). What do you think about this?

ashok-arjun · 2024-12-08T15:12:56Z

That's a good idea. I suspect that'll be useful in general as there may be possibly future LLM-based models that we may want to benchmark.

liu-jc · 2024-12-09T04:23:26Z

Yes I totally agree. So, for your model, could you please provide the all_results.csv file and the model config file, and make a PR here? I would love to help you put on the leaderboard :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM-based methods on the leaderboard - reg. #8

LLM-based methods on the leaderboard - reg. #8

ashok-arjun commented Dec 8, 2024

liu-jc commented Dec 8, 2024

ashok-arjun commented Dec 8, 2024

liu-jc commented Dec 9, 2024

LLM-based methods on the leaderboard - reg. #8

LLM-based methods on the leaderboard - reg. #8

Comments

ashok-arjun commented Dec 8, 2024

liu-jc commented Dec 8, 2024

ashok-arjun commented Dec 8, 2024

liu-jc commented Dec 9, 2024