From 7ef282a2b288f1c3be57847fabeaadea47d65e06 Mon Sep 17 00:00:00 2001
From: Jun Tian <tianjun.cpp@gmail.com>
Date: Mon, 19 Feb 2024 22:28:15 +0800
Subject: [PATCH] add FAQ

---
 README.md | 17 +++++++++++++++--
 1 file changed, 15 insertions(+), 2 deletions(-)

diff --git a/README.md b/README.md
index 85f3a10f..b99bac5c 100644
--- a/README.md
+++ b/README.md
@@ -80,8 +80,8 @@ Once finished, the results will be displayed. You may find more details under th
 ## Related Work
 
 - [nuprl/MultiPL-E](https://github.com/nuprl/MultiPL-E/blob/main/prompts/humaneval-jl-transform.jsonl) contains Julia version prompts transformed from the original Python version [HumanEval](https://github.com/openai/human-eval). However, based on my limited Julia programming experience, the prompts are not that accurate and conventional.
-- [Julia-LLM-Leaderboard](https://github.com/svilupp/Julia-LLM-Leaderboard), which focused on practicality and simplicity.
-- [EvalPlus Leaderboard](https://evalplus.github.io/leaderboard.html) 
+- [Julia-LLM-Leaderboard](https://github.com/svilupp/Julia-LLM-Leaderboard), which focuses on practicality and simplicity.
+- [EvalPlus Leaderboard](https://evalplus.github.io/leaderboard.html)
 
 ## Future Work
 
@@ -91,6 +91,19 @@ Once finished, the results will be displayed. You may find more details under th
 
 We're hiring! If you're interested in working on code LLM at [01.ai](https://01.ai/), please contact [yi@01.ai](mailto:yi@01.ai).
 
+
+## FAQ
+
+- [What are the differences compared to the original Python version?](https://github.com/01-ai/HumanEval.jl/discussions/1)
+- [What are the limitations of this project?](https://github.com/01-ai/HumanEval.jl/discussions/2)
+- [How do LLMs perform compared to human?](https://github.com/01-ai/HumanEval.jl/discussions/3)
+- [How difficult is each problem?](https://github.com/01-ai/HumanEval.jl/discussions/4)
+- [Is GPT4 good enough?](https://github.com/01-ai/HumanEval.jl/discussions/5)
+- [How to make this evaluation higher quality?](https://github.com/01-ai/HumanEval.jl/discussions/6)
+- [How should we measure hallucinations?](https://github.com/01-ai/HumanEval.jl/discussions/7)
+- [Are there any other metrics we should care beyond pass@k?](https://github.com/01-ai/HumanEval.jl/discussions/8)
+- [Why does Yi-34B-Chat perform so poor?](https://github.com/01-ai/HumanEval.jl/discussions/9)
+
 ## Acknowledgement
 
 - This project heavily relies on many features provided by [ReTestItems.jl](https://github.com/JuliaTesting/ReTestItems.jl). Great thanks to [Nick Robinson](https://github.com/nickrobinson251)'s help during the development.
\ No newline at end of file