Does anyone have enough compute to make a distillation dataset out of GLM5.2?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Same as title. Some lucky ppl among us have massive amounts of compute and can run even GLM 5.2. Can someone plss make a BIG distillation dataset (eg 700k-1M examples) so that we can train smaller models like Qwen3.5 properly on it and have better models?
It would be amazing for the community.
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.