Embedding space [D]
Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.
Hello everyone,
I’m relatively new to this area of machine learning and currently experimenting with Variational Autoencoders (VAEs) to build an embedding space for an image dataset with images have different spatial dimensions, I cannot easily standardize them to a fixed size. My current approach uses adaptive pooling in the encoder to produce a fixed-dimensional latent representation, so the model can in principle handle variable input sizes.
However, now the results are quite poor so far, and the learned embedding does not seem meaningful or well-structured. I would really appreciate any advice, suggestions, or pointers on what might be going wrong or how to improve this setup.
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.