Wow 😮<br>You're awesome 😎</p>\n","updatedAt":"2025-12-04T19:09:31.677Z","author":{"_id":"5f1ba750cb8f993fa01f4678","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5f1ba750cb8f993fa01f4678/4-dAcvedO-tIxYJm6aLTL.jpeg","fullname":"Behrooz Azarkhalili","name":"ermiaazarkhalili","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":30,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8897186517715454},"editors":["ermiaazarkhalili"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/5f1ba750cb8f993fa01f4678/4-dAcvedO-tIxYJm6aLTL.jpeg"],"reactions":[{"reaction":"🤗","users":["burtenshaw","taesiri","AUsername111","real-jiakai"],"count":4}],"isReport":false}},{"id":"6931f37e7ca3caa55a72881d","author":{"_id":"6659fd841b9c4fb5cda9b161","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6659fd841b9c4fb5cda9b161/PZ79m3q9jL1MLK0VYa96e.png","fullname":"Dean Williams","name":"dinoamino","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2025-12-04T20:47:58.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Is this still usable without a Pro account? Will it be able to output everything up to \"Submit the job to Hugging Face Jobs\"?","html":"<p>Is this still usable without a Pro account? Will it be able to output everything up to \"Submit the job to Hugging Face Jobs\"?</p>\n","updatedAt":"2025-12-04T20:47:58.716Z","author":{"_id":"6659fd841b9c4fb5cda9b161","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/6659fd841b9c4fb5cda9b161/PZ79m3q9jL1MLK0VYa96e.png","fullname":"Dean Williams","name":"dinoamino","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.852893590927124},"editors":["dinoamino"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/6659fd841b9c4fb5cda9b161/PZ79m3q9jL1MLK0VYa96e.png"],"reactions":[{"reaction":"👀","users":["josephgitau","Muratt03","TheRealOKAI","herocouple","Rohitchaudhary2213","cmz1024"],"count":6}],"isReport":false}},{"id":"69321145bdad9fd465de5dc4","author":{"_id":"5e67bdd61009063689407479","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg","fullname":"Clem 🤗","name":"clem","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":2994,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583856921041-5dd96eb166059660ed1ee413.png","fullname":"Hugging Face","name":"huggingface","type":"org","isHf":true,"details":"The AI community building the future.","plan":"team"}},"createdAt":"2025-12-04T22:55:01.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"So cool!","html":"<p>So cool!</p>\n","updatedAt":"2025-12-04T22:55:01.375Z","author":{"_id":"5e67bdd61009063689407479","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg","fullname":"Clem 🤗","name":"clem","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":2994,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583856921041-5dd96eb166059660ed1ee413.png","fullname":"Hugging Face","name":"huggingface","type":"org","isHf":true,"details":"The AI community building the future.","plan":"team"}}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6243783235549927},"editors":["clem"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1583857146757-5e67bdd61009063689407479.jpeg"],"reactions":[{"reaction":"❤️","users":["evalstate"],"count":1}],"isReport":false}},{"id":"693271693a8b37d03cde5904","author":{"_id":"67c6a533c0b62d612c530e33","avatarUrl":"/avatars/82209727124385e34cc4eb72a902ccc8.svg","fullname":"Kyle Moore","name":"kylechristophermoore","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2025-12-05T05:45:13.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Is there data privacy when doing this?\n\nIs it posted privately to a personal/team hub? \n\nCould this be done locally without the push to the repo?","html":"<p>Is there data privacy when doing this?</p>\n<p>Is it posted privately to a personal/team hub? </p>\n<p>Could this be done locally without the push to the repo?</p>\n","updatedAt":"2025-12-05T05:45:13.177Z","author":{"_id":"67c6a533c0b62d612c530e33","avatarUrl":"/avatars/82209727124385e34cc4eb72a902ccc8.svg","fullname":"Kyle Moore","name":"kylechristophermoore","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9066386818885803},"editors":["kylechristophermoore"],"editorAvatarUrls":["/avatars/82209727124385e34cc4eb72a902ccc8.svg"],"reactions":[{"reaction":"👍","users":["Doctor-Chad-PhD","arpieb","merercalavera","TheRealOKAI","imace","Javadex"],"count":6}],"isReport":false}},{"id":"693287e3ccb25bf360f77989","author":{"_id":"63e979e9dd2c4effdd6a43ba","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63e979e9dd2c4effdd6a43ba/UaB8UVPwGO9KLjCe0yZC0.png","fullname":"Yuki Arimo","name":"yukiarimo","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":91,"isUserFollowing":false},"createdAt":"2025-12-05T07:21:07.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Another agentic way of wasting tokens","html":"<p>Another agentic way of wasting tokens</p>\n","updatedAt":"2025-12-05T07:21:07.101Z","author":{"_id":"63e979e9dd2c4effdd6a43ba","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/63e979e9dd2c4effdd6a43ba/UaB8UVPwGO9KLjCe0yZC0.png","fullname":"Yuki Arimo","name":"yukiarimo","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":91,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6542713046073914},"editors":["yukiarimo"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/63e979e9dd2c4effdd6a43ba/UaB8UVPwGO9KLjCe0yZC0.png"],"reactions":[{"reaction":"👍","users":["franco334578","TheRealOKAI","real-jiakai"],"count":3}],"isReport":false}},{"id":"6932b49aff4db1f36d8f9793","author":{"_id":"64f187a2cc1c03340ac30498","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64f187a2cc1c03340ac30498/dMTUFA5Ul35v595JPKCMw.jpeg","fullname":"Jun Zhang","name":"jzhang533","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":55,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64f187a2cc1c03340ac30498/TYYUxK8xD1AxExFMWqbZD.png","fullname":"BAIDU","name":"baidu","type":"org","isHf":false,"plan":"team"}},"createdAt":"2025-12-05T10:31:54.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"is it possible to use this inside vscode's copilot extension ?","html":"<p>is it possible to use this inside vscode's copilot extension ?</p>\n","updatedAt":"2025-12-05T10:31:54.157Z","author":{"_id":"64f187a2cc1c03340ac30498","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64f187a2cc1c03340ac30498/dMTUFA5Ul35v595JPKCMw.jpeg","fullname":"Jun Zhang","name":"jzhang533","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":55,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/64f187a2cc1c03340ac30498/TYYUxK8xD1AxExFMWqbZD.png","fullname":"BAIDU","name":"baidu","type":"org","isHf":false,"plan":"team"}}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8038625717163086},"editors":["jzhang533"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/64f187a2cc1c03340ac30498/dMTUFA5Ul35v595JPKCMw.jpeg"],"reactions":[],"isReport":false}},{"id":"693320d1a96be1367dbb3b6d","author":{"_id":"67f00bf17530c3fccbb26c79","avatarUrl":"/avatars/f0d56f04b1def33dce872a8de71f560d.svg","fullname":"Anton Protopopov","name":"aprotopopov","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":2,"isUserFollowing":false},"createdAt":"2025-12-05T18:13:37.000Z","type":"comment","data":{"edited":true,"hidden":false,"latest":{"raw":"Skill documentation is not available at the provided link - https://github.com/huggingface/skills/blob/main/hf-llm-trainer/SKILL.md","html":"<p>Skill documentation is not available at the provided link - <a href=\"https://github.com/huggingface/skills/blob/main/hf-llm-trainer/SKILL.md\" rel=\"nofollow\">https://github.com/huggingface/skills/blob/main/hf-llm-trainer/SKILL.md</a></p>\n","updatedAt":"2025-12-05T18:14:22.243Z","author":{"_id":"67f00bf17530c3fccbb26c79","avatarUrl":"/avatars/f0d56f04b1def33dce872a8de71f560d.svg","fullname":"Anton Protopopov","name":"aprotopopov","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":2,"isUserFollowing":false}},"numEdits":1,"identifiedLanguage":{"language":"en","probability":0.6841225624084473},"editors":["aprotopopov"],"editorAvatarUrls":["/avatars/f0d56f04b1def33dce872a8de71f560d.svg"],"reactions":[],"isReport":false},"replies":[{"id":"69332fad7326616c82b07e07","author":{"_id":"6319b36409baf858241f0f89","avatarUrl":"/avatars/909635453bf62a2a7118a01dd51b811c.svg","fullname":"shaun smith","name":"evalstate","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":337,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583856921041-5dd96eb166059660ed1ee413.png","fullname":"Hugging Face","name":"huggingface","type":"org","isHf":true,"details":"The AI community building the future.","plan":"team"}},"createdAt":"2025-12-05T19:17:01.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Ah, we moved a couple of bits around in the repo -- link for that is here: https://github.com/huggingface/skills/blob/main/hf-llm-trainer/skills/model-trainer/SKILL.md -- I'll update the article 👍.","html":"<p>Ah, we moved a couple of bits around in the repo -- link for that is here: <a href=\"https://github.com/huggingface/skills/blob/main/hf-llm-trainer/skills/model-trainer/SKILL.md\" rel=\"nofollow\">https://github.com/huggingface/skills/blob/main/hf-llm-trainer/skills/model-trainer/SKILL.md</a> -- I'll update the article 👍.</p>\n","updatedAt":"2025-12-05T19:17:01.696Z","author":{"_id":"6319b36409baf858241f0f89","avatarUrl":"/avatars/909635453bf62a2a7118a01dd51b811c.svg","fullname":"shaun smith","name":"evalstate","type":"user","isPro":true,"isHf":true,"isHfAdmin":false,"isMod":false,"followerCount":337,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1583856921041-5dd96eb166059660ed1ee413.png","fullname":"Hugging Face","name":"huggingface","type":"org","isHf":true,"details":"The AI community building the future.","plan":"team"}}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8375345468521118},"editors":["evalstate"],"editorAvatarUrls":["/avatars/909635453bf62a2a7118a01dd51b811c.svg"],"reactions":[],"isReport":false,"parentCommentId":"693320d1a96be1367dbb3b6d"}}]},{"id":"6934801c7b4e69f34bd6c878","author":{"_id":"68092d1b2c91d31e3912264a","avatarUrl":"/avatars/3b09fdad9e2cbd7ad54fb276c94445cf.svg","fullname":"Mike Ehrmantraut","name":"AUsername111","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":12,"isUserFollowing":false},"createdAt":"2025-12-06T19:12:28.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is so cool. Many thanks.","html":"<p>This is so cool. Many thanks.</p>\n","updatedAt":"2025-12-06T19:12:28.579Z","author":{"_id":"68092d1b2c91d31e3912264a","avatarUrl":"/avatars/3b09fdad9e2cbd7ad54fb276c94445cf.svg","fullname":"Mike Ehrmantraut","name":"AUsername111","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":12,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9620528221130371},"editors":["AUsername111"],"editorAvatarUrls":["/avatars/3b09fdad9e2cbd7ad54fb276c94445cf.svg"],"reactions":[{"reaction":"❤️","users":["evalstate"],"count":1}],"isReport":false}},{"id":"693661bb7b4e69f34bd6c8ae","author":{"_id":"6932862502baca9ccdd4665d","avatarUrl":"/avatars/ca176894b2f946a3f371252248224246.svg","fullname":"Roman Gardner","name":"Roman1902","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2025-12-08T05:27:23.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"\"Really fascinating read! I found the explanation of Hugging Face’s “Skills Training” initiative — how it lets you use a coding‑agent (like Claude Code or other supported agents) to fine‑tune large language models, submit GPU jobs, monitor progress and push trained models to the Hub — particularly eye‑opening. The combination of high‑level instructions, hardware selection, monitoring, and automation makes the complex process of model training much more approachable, even for developers who may not be ML‑infrastructure experts. \n\nI also recently read a related guide: https://mobisoftinfotech.com/resources/blog/ai‑development/llm‑api‑pricing‑guide \n — which gives practical advice on LLM API usage, token‑based pricing, and how to plan costs when working with LLMs.\n\nPutting your article’s look into empowering accessible LLM fine‑tuning together with the cost‑management strategies from that guide gives a well‑rounded perspective: it helps developers understand not just what is possible now with modern tools, but also how to build and deploy responsibly, balancing capability and cost.\"","html":"<p>\"Really fascinating read! I found the explanation of Hugging Face’s “Skills Training” initiative — how it lets you use a coding‑agent (like Claude Code or other supported agents) to fine‑tune large language models, submit GPU jobs, monitor progress and push trained models to the Hub — particularly eye‑opening. The combination of high‑level instructions, hardware selection, monitoring, and automation makes the complex process of model training much more approachable, even for developers who may not be ML‑infrastructure experts. </p>\n<p>I also recently read a related guide: <a href=\"https://mobisoftinfotech.com/resources/blog/ai%E2%80%91development/llm%E2%80%91api%E2%80%91pricing%E2%80%91guide\" rel=\"nofollow\">https://mobisoftinfotech.com/resources/blog/ai‑development/llm‑api‑pricing‑guide</a><br> — which gives practical advice on LLM API usage, token‑based pricing, and how to plan costs when working with LLMs.</p>\n<p>Putting your article’s look into empowering accessible LLM fine‑tuning together with the cost‑management strategies from that guide gives a well‑rounded perspective: it helps developers understand not just what is possible now with modern tools, but also how to build and deploy responsibly, balancing capability and cost.\"</p>\n","updatedAt":"2025-12-08T05:27:23.785Z","author":{"_id":"6932862502baca9ccdd4665d","avatarUrl":"/avatars/ca176894b2f946a3f371252248224246.svg","fullname":"Roman Gardner","name":"Roman1902","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.8867176175117493},"editors":["Roman1902"],"editorAvatarUrls":["/avatars/ca176894b2f946a3f371252248224246.svg"],"reactions":[],"isReport":false},"replies":[{"id":"6937a65cb8f3ce7a697f0415","author":{"_id":"643fd365e44f30a723213d32","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/qsYMVidL2s7CfqN_3stHW.png","fullname":"Daniel Omusula","name":"DanteWu","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2025-12-09T04:32:28.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Slop alert","html":"<p>Slop alert</p>\n","updatedAt":"2025-12-09T04:32:28.364Z","author":{"_id":"643fd365e44f30a723213d32","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/qsYMVidL2s7CfqN_3stHW.png","fullname":"Daniel Omusula","name":"DanteWu","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.37732750177383423},"editors":["DanteWu"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/qsYMVidL2s7CfqN_3stHW.png"],"reactions":[],"isReport":false,"parentCommentId":"693661bb7b4e69f34bd6c8ae"}}]},{"id":"69369152d78c2090cef4a862","author":{"_id":"67e4339361b84dee66bbf79f","avatarUrl":"/avatars/d48bbf1fef37b3b155f5e516c69bc827.svg","fullname":"Julien Jouganous","name":"julienjouganous","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2025-12-08T08:50:26.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Great work and great article!\nRegarding the maximum models size we can train using this approach, at the beginning of the article it's mentioned \"models from 0.5B to 70B parameters\" but at the end you write that \"For large models (7B+), this HF skills job is not suitable\", which order of magnitude is correct?\nI suspect the max range is 7B, if it's the case, do you plan to support training of larger models?\nThanks!","html":"<p>Great work and great article!<br>Regarding the maximum models size we can train using this approach, at the beginning of the article it's mentioned \"models from 0.5B to 70B parameters\" but at the end you write that \"For large models (7B+), this HF skills job is not suitable\", which order of magnitude is correct?<br>I suspect the max range is 7B, if it's the case, do you plan to support training of larger models?<br>Thanks!</p>\n","updatedAt":"2025-12-08T08:50:26.027Z","author":{"_id":"67e4339361b84dee66bbf79f","avatarUrl":"/avatars/d48bbf1fef37b3b155f5e516c69bc827.svg","fullname":"Julien Jouganous","name":"julienjouganous","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9466469287872314},"editors":["julienjouganous"],"editorAvatarUrls":["/avatars/d48bbf1fef37b3b155f5e516c69bc827.svg"],"reactions":[],"isReport":false}},{"id":"6937932f6290efe69fb7173e","author":{"_id":"638b66745d81d551ab44df52","avatarUrl":"/avatars/2e74d42f73fa197f2a79d39a8842b0cd.svg","fullname":"DAMIENELSON","name":"DAMIENE","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2025-12-09T03:10:39.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"is the trained model now open source and / or available to the public?","html":"<p>is the trained model now open source and / or available to the public?</p>\n","updatedAt":"2025-12-09T03:10:39.665Z","author":{"_id":"638b66745d81d551ab44df52","avatarUrl":"/avatars/2e74d42f73fa197f2a79d39a8842b0cd.svg","fullname":"DAMIENELSON","name":"DAMIENE","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9463943839073181},"editors":["DAMIENE"],"editorAvatarUrls":["/avatars/2e74d42f73fa197f2a79d39a8842b0cd.svg"],"reactions":[],"isReport":false}},{"id":"693a796c693e8158df69033e","author":{"_id":"64169a99bce2fed80ab86122","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1679202958868-noauth.jpeg","fullname":"Sigrid Jin","name":"sigridjineth","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":164,"isUserFollowing":false},"createdAt":"2025-12-11T07:57:32.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"https://huggingface.co/blog/sionic-ai/claude-code-skills-training\n\nNice work about the demo getting Claude Code to fine-tune an open LLM. But the researchers from Sionic AI already do most of their work with Claude Code. It writes training scripts, debugs CUDA errors, searches hyperparameters overnight. For the actual work of building models, Claude has become the default partner. But there was one thing it couldn't do - remember what the teammates learned last week.\n\nCheck how we do here :D","html":"<p><a href=\"https://huggingface.co/blog/sionic-ai/claude-code-skills-training\">https://huggingface.co/blog/sionic-ai/claude-code-skills-training</a></p>\n<p>Nice work about the demo getting Claude Code to fine-tune an open LLM. But the researchers from Sionic AI already do most of their work with Claude Code. It writes training scripts, debugs CUDA errors, searches hyperparameters overnight. For the actual work of building models, Claude has become the default partner. But there was one thing it couldn't do - remember what the teammates learned last week.</p>\n<p>Check how we do here :D</p>\n","updatedAt":"2025-12-11T07:57:32.040Z","author":{"_id":"64169a99bce2fed80ab86122","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/1679202958868-noauth.jpeg","fullname":"Sigrid Jin","name":"sigridjineth","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":164,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9003725051879883},"editors":["sigridjineth"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/1679202958868-noauth.jpeg"],"reactions":[],"isReport":false},"replies":[{"id":"695e8e432d9cf1829bd7026b","author":{"_id":"695d3df489b85dc68f206309","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png","fullname":"go go","name":"cveavy","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":14,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://www.gravatar.com/avatar/c50c76459362c26ca625f024fb5c1950?d=retro&size=100","fullname":"Futurepath Solutions","name":"futurepathsolutions","type":"org","isHf":false,"details":"Passionate about advancing the frontiers of artificial intelligence through research in large language models, multi-modal architectures, and efficient training methodologies. Particularly interested in model alignment, reasoning capabilities, and the intersection of NLP with computer vision.\r\n","plan":"team"}},"createdAt":"2026-01-07T16:48:03.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Right","html":"<p>Right</p>\n","updatedAt":"2026-01-07T16:48:03.609Z","author":{"_id":"695d3df489b85dc68f206309","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png","fullname":"go go","name":"cveavy","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":14,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://www.gravatar.com/avatar/c50c76459362c26ca625f024fb5c1950?d=retro&size=100","fullname":"Futurepath Solutions","name":"futurepathsolutions","type":"org","isHf":false,"details":"Passionate about advancing the frontiers of artificial intelligence through research in large language models, multi-modal architectures, and efficient training methodologies. Particularly interested in model alignment, reasoning capabilities, and the intersection of NLP with computer vision.\r\n","plan":"team"}}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.36209210753440857},"editors":["cveavy"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png"],"reactions":[],"isReport":false,"parentCommentId":"693a796c693e8158df69033e"}}]},{"id":"693b6d2b4db5ca8e59e9a716","author":{"_id":"68aba5d3d466d2506c935465","avatarUrl":"/avatars/45986a2f84b844e06250fe416681a52c.svg","fullname":"Deep","name":"illiliiiiil","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false},"createdAt":"2025-12-12T01:17:31.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"Is it possible to use it even in privately uploaded datasets?","html":"<p>Is it possible to use it even in privately uploaded datasets?</p>\n","updatedAt":"2025-12-12T01:17:31.985Z","author":{"_id":"68aba5d3d466d2506c935465","avatarUrl":"/avatars/45986a2f84b844e06250fe416681a52c.svg","fullname":"Deep","name":"illiliiiiil","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":1,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9924927949905396},"editors":["illiliiiiil"],"editorAvatarUrls":["/avatars/45986a2f84b844e06250fe416681a52c.svg"],"reactions":[],"isReport":false}},{"id":"6944b178c6953b50365d3dec","author":{"_id":"66a18c2696a2ff2a7c4ba554","avatarUrl":"/avatars/ddc40046800db4fb8a9b780b0aec3b1e.svg","fullname":"Ed Dan","name":"Ed13210","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2025-12-19T01:59:20.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"how many tokens will a session incur?","html":"<p>how many tokens will a session incur?</p>\n","updatedAt":"2025-12-19T01:59:20.972Z","author":{"_id":"66a18c2696a2ff2a7c4ba554","avatarUrl":"/avatars/ddc40046800db4fb8a9b780b0aec3b1e.svg","fullname":"Ed Dan","name":"Ed13210","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.7473177313804626},"editors":["Ed13210"],"editorAvatarUrls":["/avatars/ddc40046800db4fb8a9b780b0aec3b1e.svg"],"reactions":[],"isReport":false}},{"id":"695e8dda3543fcff39fac85b","author":{"_id":"695d3df489b85dc68f206309","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png","fullname":"go go","name":"cveavy","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":14,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://www.gravatar.com/avatar/c50c76459362c26ca625f024fb5c1950?d=retro&size=100","fullname":"Futurepath Solutions","name":"futurepathsolutions","type":"org","isHf":false,"details":"Passionate about advancing the frontiers of artificial intelligence through research in large language models, multi-modal architectures, and efficient training methodologies. Particularly interested in model alignment, reasoning capabilities, and the intersection of NLP with computer vision.\r\n","plan":"team"}},"createdAt":"2026-01-07T16:46:18.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"This is genuinely game-changing for AI teams working with limited MLOps resources. Having Claude automatically handle hardware selection, job orchestration, and monitoring removes so much friction from the fine-tuning process - I've seen too many projects stall because teams get bogged down in the infrastructure complexity rather than focusing on model performance. The business impact here is huge: instead of needing dedicated DevOps engineers to manage training pipelines, data scientists can now iterate much faster on custom models. The fact that it supports the full production stack (SFT, DPO, RLHF) means you're not just prototyping but actually building deployment-ready models. What really excites me is the cost optimization angle - automatic hardware matching means you're not overpaying for compute while still getting reasonable training times. The multi-stage pipeline support is particularly valuable for enterprise use cases where you need that SFT → DPO → RLHF workflow for safety and alignment. This could democratize custom model development for mid-market companies who previously couldn't justify the engineering overhead. Looking forward to testing this on some internal projects where we've been manually managing these workflows.","html":"<p>This is genuinely game-changing for AI teams working with limited MLOps resources. Having Claude automatically handle hardware selection, job orchestration, and monitoring removes so much friction from the fine-tuning process - I've seen too many projects stall because teams get bogged down in the infrastructure complexity rather than focusing on model performance. The business impact here is huge: instead of needing dedicated DevOps engineers to manage training pipelines, data scientists can now iterate much faster on custom models. The fact that it supports the full production stack (SFT, DPO, RLHF) means you're not just prototyping but actually building deployment-ready models. What really excites me is the cost optimization angle - automatic hardware matching means you're not overpaying for compute while still getting reasonable training times. The multi-stage pipeline support is particularly valuable for enterprise use cases where you need that SFT → DPO → RLHF workflow for safety and alignment. This could democratize custom model development for mid-market companies who previously couldn't justify the engineering overhead. Looking forward to testing this on some internal projects where we've been manually managing these workflows.</p>\n","updatedAt":"2026-01-07T16:46:18.224Z","author":{"_id":"695d3df489b85dc68f206309","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png","fullname":"go go","name":"cveavy","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":14,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://www.gravatar.com/avatar/c50c76459362c26ca625f024fb5c1950?d=retro&size=100","fullname":"Futurepath Solutions","name":"futurepathsolutions","type":"org","isHf":false,"details":"Passionate about advancing the frontiers of artificial intelligence through research in large language models, multi-modal architectures, and efficient training methodologies. Particularly interested in model alignment, reasoning capabilities, and the intersection of NLP with computer vision.\r\n","plan":"team"}}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9416612386703491},"editors":["cveavy"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png"],"reactions":[],"isReport":false}},{"id":"69622c8b5d4f5276ab3cef27","author":{"_id":"68823d50ca5db489fd00d58b","avatarUrl":"/avatars/822c4cf4f7f3a0b464924457f2e051c4.svg","fullname":"Akili","name":"akiliaiafrica","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false},"createdAt":"2026-01-10T10:40:11.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"I think this document needs to be updated. The skills name has changed based on what I see on the huggingface github repo. ","html":"<p>I think this document needs to be updated. The skills name has changed based on what I see on the huggingface github repo. </p>\n","updatedAt":"2026-01-10T10:40:11.295Z","author":{"_id":"68823d50ca5db489fd00d58b","avatarUrl":"/avatars/822c4cf4f7f3a0b464924457f2e051c4.svg","fullname":"Akili","name":"akiliaiafrica","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"isUserFollowing":false}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.9689856767654419},"editors":["akiliaiafrica"],"editorAvatarUrls":["/avatars/822c4cf4f7f3a0b464924457f2e051c4.svg"],"reactions":[],"isReport":false},"replies":[{"id":"6964d1ddaa865b63109b575c","author":{"_id":"695d3df489b85dc68f206309","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png","fullname":"go go","name":"cveavy","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":14,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://www.gravatar.com/avatar/c50c76459362c26ca625f024fb5c1950?d=retro&size=100","fullname":"Futurepath Solutions","name":"futurepathsolutions","type":"org","isHf":false,"details":"Passionate about advancing the frontiers of artificial intelligence through research in large language models, multi-modal architectures, and efficient training methodologies. Particularly interested in model alignment, reasoning capabilities, and the intersection of NLP with computer vision.\r\n","plan":"team"}},"createdAt":"2026-01-12T10:50:05.000Z","type":"comment","data":{"edited":false,"hidden":false,"latest":{"raw":"correct","html":"<p>correct</p>\n","updatedAt":"2026-01-12T10:50:05.552Z","author":{"_id":"695d3df489b85dc68f206309","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png","fullname":"go go","name":"cveavy","type":"user","isPro":true,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":14,"isUserFollowing":false,"primaryOrg":{"avatarUrl":"https://www.gravatar.com/avatar/c50c76459362c26ca625f024fb5c1950?d=retro&size=100","fullname":"Futurepath Solutions","name":"futurepathsolutions","type":"org","isHf":false,"details":"Passionate about advancing the frontiers of artificial intelligence through research in large language models, multi-modal architectures, and efficient training methodologies. Particularly interested in model alignment, reasoning capabilities, and the intersection of NLP with computer vision.\r\n","plan":"team"}}},"numEdits":0,"identifiedLanguage":{"language":"en","probability":0.6915013790130615},"editors":["cveavy"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/kJYw9Ts14b1MrcNlX8cv3.png"],"reactions":[],"isReport":false,"parentCommentId":"69622c8b5d4f5276ab3cef27"}}]}],"status":"open","isReport":false,"pinned":false,"locked":false,"collection":"community_blogs"},"contextAuthors":["burtenshaw","evalstate"],"primaryEmailConfirmed":false,"discussionRole":0,"acceptLanguages":["en"],"withThread":true,"cardDisplay":false,"repoDiscussionsLocked":false}">
Is this still usable without a Pro account? Will it be able to output everything up to "Submit the job to Hugging Face Jobs"?
Is there data privacy when doing this?
Is it posted privately to a personal/team hub?
Could this be done locally without the push to the repo?
Another agentic way of wasting tokens
is it possible to use this inside vscode's copilot extension ?
This is so cool. Many thanks.
"Really fascinating read! I found the explanation of Hugging Face’s “Skills Training” initiative — how it lets you use a coding‑agent (like Claude Code or other supported agents) to fine‑tune large language models, submit GPU jobs, monitor progress and push trained models to the Hub — particularly eye‑opening. The combination of high‑level instructions, hardware selection, monitoring, and automation makes the complex process of model training much more approachable, even for developers who may not be ML‑infrastructure experts.
I also recently read a related guide: https://mobisoftinfotech.com/resources/blog/ai‑development/llm‑api‑pricing‑guide
— which gives practical advice on LLM API usage, token‑based pricing, and how to plan costs when working with LLMs.
Putting your article’s look into empowering accessible LLM fine‑tuning together with the cost‑management strategies from that guide gives a well‑rounded perspective: it helps developers understand not just what is possible now with modern tools, but also how to build and deploy responsibly, balancing capability and cost."
Great work and great article!
Regarding the maximum models size we can train using this approach, at the beginning of the article it's mentioned "models from 0.5B to 70B parameters" but at the end you write that "For large models (7B+), this HF skills job is not suitable", which order of magnitude is correct?
I suspect the max range is 7B, if it's the case, do you plan to support training of larger models?
Thanks!
is the trained model now open source and / or available to the public?
https://huggingface.co/blog/sionic-ai/claude-code-skills-training
Nice work about the demo getting Claude Code to fine-tune an open LLM. But the researchers from Sionic AI already do most of their work with Claude Code. It writes training scripts, debugs CUDA errors, searches hyperparameters overnight. For the actual work of building models, Claude has become the default partner. But there was one thing it couldn't do - remember what the teammates learned last week.
Check how we do here :D
Is it possible to use it even in privately uploaded datasets?
how many tokens will a session incur?
This is genuinely game-changing for AI teams working with limited MLOps resources. Having Claude automatically handle hardware selection, job orchestration, and monitoring removes so much friction from the fine-tuning process - I've seen too many projects stall because teams get bogged down in the infrastructure complexity rather than focusing on model performance. The business impact here is huge: instead of needing dedicated DevOps engineers to manage training pipelines, data scientists can now iterate much faster on custom models. The fact that it supports the full production stack (SFT, DPO, RLHF) means you're not just prototyping but actually building deployment-ready models. What really excites me is the cost optimization angle - automatic hardware matching means you're not overpaying for compute while still getting reasonable training times. The multi-stage pipeline support is particularly valuable for enterprise use cases where you need that SFT → DPO → RLHF workflow for safety and alignment. This could democratize custom model development for mid-market companies who previously couldn't justify the engineering overhead. Looking forward to testing this on some internal projects where we've been manually managing these workflows.
I think this document needs to be updated. The skills name has changed based on what I see on the huggingface github repo.
Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.
Tap or paste here to upload images
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.