r/LocalLLaMA · · 1 min read

Training a vision model from scratch on iPod touch 4 images

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Training a vision model from scratch on iPod touch 4 images

I trained a DCGAN model from scratch on iPod touch 4 pics. I understand the scale needed to train a vision model from scratch so I’m starting with just 1 case/object to take pics of. I took around 350 pics of a red solo cup in different backgrounds, lighting conditions, etc. The pictures that the model generates reminds me of Open AI’s DALL E from back in 2022. I’m gonna try to take around 5000 total, I wanna see if the model can pick up on specific sensor artifacts from the iPods camera.

submitted by /u/Remarkable-Trick-177
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA