Support running multiple bash commands and compile in one query #736

DonggeLiu · 2024-11-30T02:22:03Z

No description provided.

DonggeLiu · 2024-11-30T02:22:16Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-12-02T22:03:29Z

Reports:
https://llm-exp.oss-fuzz.com/Result-reports/scheduled/2024-11-30-weekly-all-2/
https://llm-exp.oss-fuzz.com/Result-reports/scheduled/2024-12-02-weekly-all-3/

Part of the lower performance of all-3 is due to GKE timeout.
As a result, 24 benchmarks' statuses are 'Running', with 12 of them had 0 build rate.

oliverchang · 2024-12-05T00:18:31Z

agent/base_agent.py

@@ -63,6 +62,11 @@ def _parse_tag(self, response: str, tag: str) -> str:
    match = re.search(rf'<{tag}>(.*?)</{tag}>', response, re.DOTALL)
    return match.group(1).strip() if match else ''

+  def _parse_tags(self, response: str, tag: str) -> list[str]:
+    """Parses the XML-style tags from LLM response."""
+    matches = re.findall(rf'<{tag}>(.*?)</{tag}>', response, re.DOTALL)


For future work: Should we experiment with JSON formatting at some point? Let's create an issue if so.

DonggeLiu · 2024-12-05T23:59:37Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-12-06T05:08:26Z

https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2024-12-06-736-dg-comparison/index.html

DonggeLiu · 2024-12-13T03:39:49Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-12-13T03:41:29Z

/gcbrun exp -n dg -ag

DonggeLiu · 2024-12-15T23:17:29Z

The report looks good: 28/38
https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2024-12-13-736-dg-comparison/index.html

I will remove the libdwarf benchmark set, which was used for testing only.

This reverts commit 33e22ff.

DonggeLiu · 2024-12-15T23:27:39Z

/gcbrun skip

DonggeLiu added 4 commits November 30, 2024 08:20

A longer delay for large exp

9a6527f

A new error to retry

391a114

Show the message has been truncated, lower input token upper bound

171e1a9

Multiple command and compile

18e8dc4

DonggeLiu requested review from oliverchang and mihaimaruseac December 4, 2024 00:14

mihaimaruseac approved these changes Dec 4, 2024

View reviewed changes

oliverchang approved these changes Dec 5, 2024

View reviewed changes

DonggeLiu mentioned this pull request Dec 5, 2024

Interact with LLM in JSON #743

Open

DonggeLiu added 2 commits December 6, 2024 10:30

Bug fix

827b0b9

Make line length consistent

4d4f11a

DonggeLiu added 2 commits December 13, 2024 14:19

Fix truncation logic

0f2667f

Testing the truncation logic on libdwarf

33e22ff

mihaimaruseac approved these changes Dec 13, 2024

View reviewed changes

Revert "Testing the truncation logic on libdwarf"

992729c

This reverts commit 33e22ff.

DonggeLiu merged commit f871736 into main Dec 15, 2024
5 checks passed

DonggeLiu deleted the multi-commands branch December 15, 2024 23:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support running multiple bash commands and compile in one query #736

Support running multiple bash commands and compile in one query #736

DonggeLiu commented Nov 30, 2024

DonggeLiu commented Nov 30, 2024

DonggeLiu commented Dec 2, 2024 •

edited

Loading

oliverchang Dec 5, 2024

DonggeLiu Dec 5, 2024

DonggeLiu commented Dec 5, 2024

DonggeLiu commented Dec 6, 2024

DonggeLiu commented Dec 13, 2024

DonggeLiu commented Dec 13, 2024

DonggeLiu commented Dec 15, 2024

DonggeLiu commented Dec 15, 2024

Support running multiple bash commands and compile in one query #736

Support running multiple bash commands and compile in one query #736

Conversation

DonggeLiu commented Nov 30, 2024

DonggeLiu commented Nov 30, 2024

DonggeLiu commented Dec 2, 2024 • edited Loading

oliverchang Dec 5, 2024

Choose a reason for hiding this comment

DonggeLiu Dec 5, 2024

Choose a reason for hiding this comment

DonggeLiu commented Dec 5, 2024

DonggeLiu commented Dec 6, 2024

DonggeLiu commented Dec 13, 2024

DonggeLiu commented Dec 13, 2024

DonggeLiu commented Dec 15, 2024

DonggeLiu commented Dec 15, 2024

DonggeLiu commented Dec 2, 2024 •

edited

Loading