CLIP better than Siglip?

by kexul - opened



@kexul it's actually not about that, the output activations are a bit suppressed due to the nature of siglip model. needs to be normalized to compare for good but I wanted to put it as is here

Thanks for your detailed explanation!

kexul changed discussion status to closed

Sign up or log in to comment