Add asynchronous generate interface #30001

TheSongg · 2025-02-26T07:14:33Z

PR title: [langchain_community.llms.xinference]: Add asynchronous generate interface

PR message: The asynchronous generate interface support stream data and non-stream data.

  chain = prompt | llm
  async for chunk in chain.astream(input=user_input):
      yield chunk

Add tests and docs:

 from langchain_community.llms import Xinference
 from langchain.prompts import PromptTemplate

 llm = Xinference(
 server_url="http://0.0.0.0:9997",  # replace your xinference server url
 model_uid={model_uid}  # replace model_uid with the model UID return from launching the model
     stream = True
      )
 prompt = PromptTemplate(input=['country'], template="Q: where can we visit in the capital of {country}? A:")
 chain = prompt | llm
 async for chunk in chain.astream(input=user_input):
     yield chunk

vercel · 2025-02-26T07:14:38Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
langchain	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Feb 27, 2025 2:46am

ccurme

How is authentication handled here?

ccurme · 2025-02-26T19:09:34Z

Does their client not support async calls?

TheSongg · 2025-02-27T01:24:19Z

Does their client not support async calls?

no，xinference client only support sync call. In addition, langchain_community.chat_models doesn't support xinference chat client.
https://github.com/xorbitsai/inference/blob/main/xinference/client/restful/restful_client.py

TheSongg · 2025-02-27T02:05:01Z

How is authentication handled here?

In fact, there was no authentication during initialization.
https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/llms/xinference.py

Add asynchronous generate interface

4ec1103

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Feb 26, 2025

dosubot bot added the community Related to langchain-community label Feb 26, 2025

vercel bot deployed to Preview February 26, 2025 07:23 View deployment

un-sorted import block fix

5cd5a2d

vercel bot deployed to Preview February 26, 2025 07:32 View deployment

"_astream" argument type fix

babad30

vercel bot deployed to Preview February 26, 2025 07:52 View deployment

ccurme reviewed Feb 26, 2025

View reviewed changes

ccurme self-assigned this Feb 26, 2025

TheSongg added 4 commits February 27, 2025 10:18

Authentication During Initialization

efa095c

Merge branch 'langchain-ai:master' into master

fc8cb85

fix un-sorted error

6ef1f2a

add return type annotation [no-untyped-def]

266ce03

vercel bot deployed to Preview February 27, 2025 02:46 View deployment

ccurme approved these changes Feb 28, 2025

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Feb 28, 2025

ccurme merged commit 86b364d into langchain-ai:master Feb 28, 2025
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add asynchronous generate interface #30001

Add asynchronous generate interface #30001

TheSongg commented Feb 26, 2025 •

edited

Loading

vercel bot commented Feb 26, 2025 •

edited

Loading

ccurme left a comment

ccurme commented Feb 26, 2025

TheSongg commented Feb 27, 2025

TheSongg commented Feb 27, 2025

Add asynchronous generate interface #30001

Add asynchronous generate interface #30001

Conversation

TheSongg commented Feb 26, 2025 • edited Loading

vercel bot commented Feb 26, 2025 • edited Loading

ccurme left a comment

Choose a reason for hiding this comment

ccurme commented Feb 26, 2025

TheSongg commented Feb 27, 2025

TheSongg commented Feb 27, 2025

TheSongg commented Feb 26, 2025 •

edited

Loading

vercel bot commented Feb 26, 2025 •

edited

Loading