# litellm-proxy A local, fast, and lightweight **OpenAI-compatible server** to call 100+ LLM APIs. ## usage ```shell $ pip install litellm ``` ```shell $ litellm --model ollama/codellama #INFO: Ollama running on http://0.0.0.0:8000 ``` ## replace openai base ```python import openai # openai v1.0.0+ client = openai.OpenAI(api_key="anything",base_url="http://0.0.0.0:8000") # set proxy to base_url # request sent to model set on litellm proxy, `litellm --model` response = client.chat.completions.create(model="gpt-3.5-turbo", messages = [ { "role": "user", "content": "this is a test request, write a short poem" } ]) print(response) ``` [**See how to call Huggingface,Bedrock,TogetherAI,Anthropic, etc.**](https://docs.litellm.ai/docs/simple_proxy)