Source code for LLMs. [D]
Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.
I was digging through Hugging Face’s Transformers repo and found
https://github.com/huggingface/transformers/blob/main/src/transformers/models/gpt_oss/modeling_gpt_oss.py
From what I can tell, this isn’t just boilerplate, it looks like a full implementation.
is it actually the full code on which gpt_oss is built on?
or is it a skeleton for experimentation?
Similarly there are many models in
https://github.com/huggingface/transformers/blob/main/src/transformers/models
are they really the true open source implementations?
if not, can we actually find them publicly?
[link] [comments]
More from r/MachineLearning
-
Loss functions in Instance Representation Learning [R]
Jun 29
-
Price elasticity model [R]
Jun 29
-
Rejected MICCAI paper: workshop -> journal/conference or directly journal/conference [R]
Jun 29
-
I built a demo agricultural planning system with an AI advisor for small-scale farmers in Nicaragua using NASA data [p]
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.