Mr Ɲicolas MOUART<p>Gemma 3 4B it -QAT- q4 by Google is a pretty good multimodal model given its size and level of quantization. Even better it runs on CPU and is relatively fast. The description of the picture below was generated by it. </p><p>Great to learn new vocabulary!</p><p><a href="https://mastodon.social/tags/accessibility" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>accessibility</span></a> <a href="https://mastodon.social/tags/photography" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>photography</span></a> <a href="https://mastodon.social/tags/multimodal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>multimodal</span></a> <a href="https://mastodon.social/tags/alttext" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>alttext</span></a> <a href="https://mastodon.social/tags/localLLM" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>localLLM</span></a> <a href="https://mastodon.social/tags/visualimpairment" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>visualimpairment</span></a></p>