您即将离开知乎,请注意您的账号和财产安全。
https://unsloth.ai/docs/models/gpt-oss-how-to-run-and-fine-tune/gpt-oss-reinforcement-learning