Like CLIP, trained to produce joint embeddings of texts and images
Search Perplexity |Ask ChatGPT |Ask Clade