Training a vision model from scratch on iPod touch 4 images
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| I trained a DCGAN model from scratch on iPod touch 4 pics. I understand the scale needed to train a vision model from scratch so I’m starting with just 1 case/object to take pics of. I took around 350 pics of a red solo cup in different backgrounds, lighting conditions, etc. The pictures that the model generates reminds me of Open AI’s DALL E from back in 2022. I’m gonna try to take around 5000 total, I wanna see if the model can pick up on specific sensor artifacts from the iPods camera. [link] [comments] |
More from r/LocalLLaMA
-
AMD Powers Next-Generation Agent Computers with New Ryzen AI Halo Developer Platform and Ryzen AI Max PRO 400 Series Processors
May 21
-
Qwen3.6 27B and llama.cpp appreciation post
May 21
-
Same task in github-copilot, pi, claude-code, and opencode with Qwen3.6 27B
May 21
-
Back again, many changes have taken place.
May 21
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.