Spaces:

ysharma
/

Explore_llamav2_with_TGI

Running on CPU Upgrade

App Files Files Community

132

Will the inference source code be available?

#43

by jackkwok - opened Jul 20, 2023

Discussion

jackkwok

Jul 20, 2023

I checked the files under "Files" tab. The code is simple and just calls a secret API endpoint. I assume this 60B model is split across multiple GPUs in a cluster. I am curious how it is done behind the scene. Is that code going to be available? If not, are there blog posts on how it is done?

alexx1

Jul 23, 2023

I'm interested in this too, as this space sometimes seems to produce some honestly disturbing answers which I haven't been able to reproduce elsewhere. I'm curious what's exactly going on here

wfcp

Jul 28, 2023

@ysharma could you please share details about the inference API deployment?

warewe

Aug 9, 2023

I would be interested to know the answer too.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment