how many tokens can the model take as text input? and what's the maximum image size it can process?
· Sign up or log in to comment