langchain

by wilmerhenao - opened Aug 2, 2023

Discussion

wilmerhenao

Aug 2, 2023

Does it integrate well with langchain? Are there examples?

yinsong1986

Amazon Web Services org Aug 2, 2023

Since we host this model using Huggingface text generation inference, to the best of my knowledge, you can refer to https://python.langchain.com/docs/integrations/llms/huggingface_textgen_inference to see the example how to use it in langchain. Cheers!

dm-mschubert

Aug 2, 2023

Would love a tutorial on how to set this up for hosting and creating an api endpoint for querying via http on aws. or make it an inference endpoint deployable on huggingface (please also on eu aws computing centers, for DSGVO and DPA approval). Cheers!

yinsong1986

Amazon Web Services org Aug 3, 2023

FYI: @dm-mschubert https://github.com/awslabs/extending-the-context-length-of-open-source-llms/pull/3 We are working on it now :)

chenwuml

Amazon Web Services org Aug 3, 2023

Hi @dm-mschubert , we have a notebook to deploy FalconLite onto a SageMaker endpoint running on AWS - https://github.com/awslabs/extending-the-context-length-of-open-source-llms/blob/main/custom-tgi-ecr/deploy.ipynb
Feel free to give it a try and let us know if any issues. Thanks

ChuggingRace

Sep 25, 2023

Hi @dm-mschubert , we have a notebook to deploy FalconLite onto a SageMaker endpoint running on AWS - https://github.com/awslabs/extending-the-context-length-of-open-source-llms/blob/main/custom-tgi-ecr/deploy.ipynb
Feel free to give it a try and let us know if any issues. Thanks

Which SageMaker Image and Kernel should we use for https://github.com/awslabs/extending-the-context-length-of-open-source-llms/blob/main/custom-tgi-ecr/deploy.ipynb?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment