|
|
5e632c53ac
|
temp save; temp hold on for rejection sample CoT; try direct RL with raw stepwise data first
|
2025-06-30 10:29:00 +00:00 |
|
|
|
7f4fc8b05b
|
use --system for system prompt
|
2025-06-26 22:46:09 +00:00 |
|
|
|
ee08da12c0
|
pass sample test web with custom orm
|
2025-06-26 18:02:13 +00:00 |
|