Spaces:
Sleeping
Sleeping
# litellm-proxy | |
A local, fast, and lightweight **OpenAI-compatible server** to call 100+ LLM APIs. | |
## usage | |
```shell | |
$ pip install litellm | |
``` | |
```shell | |
$ litellm --model ollama/codellama | |
#INFO: Ollama running on http://0.0.0.0:8000 | |
``` | |
## replace openai base | |
```python | |
import openai # openai v1.0.0+ | |
client = openai.OpenAI(api_key="anything",base_url="http://0.0.0.0:8000") # set proxy to base_url | |
# request sent to model set on litellm proxy, `litellm --model` | |
response = client.chat.completions.create(model="gpt-3.5-turbo", messages = [ | |
{ | |
"role": "user", | |
"content": "this is a test request, write a short poem" | |
} | |
]) | |
print(response) | |
``` | |
[**See how to call Huggingface,Bedrock,TogetherAI,Anthropic, etc.**](https://docs.litellm.ai/docs/simple_proxy) | |