Logo
Explore Help
Register Sign In
yuyr/swift_test
1
0
Fork 0
You've already forked swift_test
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
6 Commits 1 Branch 0 Tags
Commit Graph

6 Commits

This Branch
This Branch
All Branches
Author SHA1 Message Date
yuyr
9fc61ca82b configure for group < 10 trace grpo training 2025-07-01 09:50:29 +08:00
yuyr
4295f30f9a update direct stepwise train, pass trainning with mock one step trace duplicated 52 times; reward OK, curve improved as the step grows. 2025-06-30 15:06:38 +00:00
yuyr
5e632c53ac temp save; temp hold on for rejection sample CoT; try direct RL with raw stepwise data first 2025-06-30 10:29:00 +00:00
yuyr
7f4fc8b05b use --system for system prompt 2025-06-26 22:46:09 +00:00
yuyr
ee08da12c0 pass sample test web with custom orm 2025-06-26 18:02:13 +00:00
yuyr
8f168ecbef test swift for qwen3 math 2025-06-25 17:00:47 +08:00
Powered by Gitea Version: 1.23.8 Page: 34ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API