Running
217
π¦
Track, rank and evaluate open LLMs and chatbots
Efficient quantized retrieval over Wikipedia
VLMEvalKit Evaluation Results Collection
Video captioning/tracking
In-browser speech recognition w/ word-level timestamps
Need to analyze data? Let a Llama-3.1 agent do it for you!
VLMEvalKit Eval Results in video understanding benchmark
remove background from any image