Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Only first token is received , not the entire response .
#84
by
venkteshm
- opened
@ysharma
the following code returns only the first token of the response ,
Am I missing something basic here . Can you please look into it ?
from gradio_client import Client
client = Client("https://ysharma-explore-llamav2-with-tgi.hf.space/")
result = client.predict(
"Capital of India", # str in 'Message' Textbox component
api_name="/chat"
)
print(result)
result returned : ' The'