Jaekoo Kang
jkang
AI & ML interests
Anything fun and interesting
Organizations
Collections
5
-
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
Paper β’ 2309.07915 β’ Published β’ 4 -
Skywork: A More Open Bilingual Foundation Model
Paper β’ 2310.19341 β’ Published β’ 5 -
Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V
Paper β’ 2310.19061 β’ Published β’ 8 -
Lumiere: A Space-Time Diffusion Model for Video Generation
Paper β’ 2401.12945 β’ Published β’ 86
spaces
7
Runtime error
3
π©
ESPNet2 ASR Librispeech word vs bpe tokens
Runtime error
π¨
ESPNet2 ASR Librispeech Conformer (100h)
Runtime error
10
π¨π¨π»βπ¨
Artist Classifier
Runtime error
4
π
Demo Painttransformer
Runtime error
4
π
Demo GradCAM Imagenet
Runtime error
3
π
Demo Image Pyxelate
models
8
jkang/espnet2_an4_transformer
Automatic Speech Recognition
β’
Updated
β’
4
jkang/espnet2_librispeech_100_conformer_char
Automatic Speech Recognition
β’
Updated
β’
4
jkang/espnet2_librispeech_100_conformer_word
Automatic Speech Recognition
β’
Updated
β’
3
jkang/espnet2_librispeech_100_conformer
Automatic Speech Recognition
β’
Updated
β’
13
jkang/espnet2_mini_librispeech_diar
Updated
β’
6
jkang/espnet2_an4_asr
Automatic Speech Recognition
β’
Updated
β’
7
jkang/drawing-artistic-trend-classifier
Updated
β’
22
jkang/drawing-artist-classifier
Updated
β’
13
β’
1
datasets
None public yet