From a737b64a302790ab2098dc3e12d645b769151534 Mon Sep 17 00:00:00 2001 From: Shaw Date: Thu, 14 Nov 2024 21:01:59 +0800 Subject: [PATCH] Update README.md --- VAB-WebArena-Lite/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/VAB-WebArena-Lite/README.md b/VAB-WebArena-Lite/README.md index 32b4e1b..bc1ef91 100644 --- a/VAB-WebArena-Lite/README.md +++ b/VAB-WebArena-Lite/README.md @@ -231,7 +231,7 @@ tmux bash wa_parallel_run_webrl_chat.sh ``` -### 🚨 Important: Refresh all websites before re-run another round of testing! +## 🚨 Important: Refresh all websites before re-run another round of testing! Since tasks in WebArena may involve changing status and database of websites (e.g., posting comments on Reddit), if websites are not all refreshed before another round of evaluation, the results would be problematic. Please remember to run following command (assume you are hosting WebArena websites on your own) to restart and refresh all website dockers to avoid potential contamination.