Change use_cache to True which significantly speeds up inference 721f3de TheBloke commited on May 5, 2023