trace_synthesis/analysis/strategy_for_task.json
yuyr a84d51a101 1. 增加r1生成综合策略代码和输出;
2. 增加tasks;
3. 增加analysis部分,对策略进行归纳分类,然后进行评测。
2025-04-17 17:40:15 +08:00

10 lines
542 B
JSON

[{
"task_id": 4,
"sites": ["shopping_admin"],
"intent": "What are the top-3 best-selling product in Jan 2023",
"strategies": ["Navigate to Bestsellers Report", "Configure Report Parameters for January 2023", "Generate and Review Bestsellers Report"],
"universal_strategies": ["Navigate To Page/Section", "Configure Parameters/Settings", "Execute Action/Process"],
"universal_strategy_ids": ["US1", "US3", "US4"],
"reference_answer_raw_annotation": "Impulse Duffle, Overnight Duffle, Hawkeye Yoga Short-32-Blue"
}]