We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
随着 DeepSeek R1 的发布,如果想复刻 R1 或者在某个领域实践 RFT(Reinforcement Fine-Tuning),可以看看我整理的清单,会持续更新。 同时我个人尝试的结果也会更新上。
更新时间:2025.1.29
The text was updated successfully, but these errors were encountered:
更全面:/~https://github.com/AlpacaACE/o1-imitator
/~https://github.com/WangRongsheng/awesome-LLM-resourses?tab=readme-ov-file#open-o1
Sorry, something went wrong.
https://colab.research.google.com/drive/1bfhs1FMLW3FGa8ydvkOZyBNxLYOu0Hev?usp=sharing
nice
No branches or pull requests
随着 DeepSeek R1 的发布,如果想复刻 R1 或者在某个领域实践 RFT(Reinforcement Fine-Tuning),可以看看我整理的清单,会持续更新。
同时我个人尝试的结果也会更新上。
The text was updated successfully, but these errors were encountered: