Multimodal Visual Question Answering with BLIP-2 and Jina
📝
内容提要
LLMs aren't great for working with anything beyond text. But now you can serve BLIP-2 with Jina and DocArray, enhancing LLMs with visual understanding
➡️