r/MachineLearning · · 1 min read

Source code for LLMs. [D]

Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.

I was digging through Hugging Face’s Transformers repo and found
https://github.com/huggingface/transformers/blob/main/src/transformers/models/gpt_oss/modeling_gpt_oss.py

From what I can tell, this isn’t just boilerplate, it looks like a full implementation.
is it actually the full code on which gpt_oss is built on?
or is it a skeleton for experimentation?

Similarly there are many models in
https://github.com/huggingface/transformers/blob/main/src/transformers/models
are they really the true open source implementations?

if not, can we actually find them publicly?

submitted by /u/PravalPattam12945RPG
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/MachineLearning