inference-infrastructure

Star

Here are 2 public repositories matching this topic...

bentoml / llm-inference-handbook

Star

Everything you need to know about LLM inference

inference-optimization llm llm-inference inference-handbook inference-infrastructure

Updated Apr 7, 2026
TypeScript

ramsyana / Markdown-Direct-Protocol

Sponsor

Star

A high-performance, QUIC-based protocol for streaming raw UTF-8 markdown from LLMs with minimal CPU overhead. Optimized for internal inference infrastructure.

c markdown high-performance quic streaming-protocol msquic llm inference-infrastructure

Updated Mar 29, 2026
C

Improve this page

Add a description, image, and links to the inference-infrastructure topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference-infrastructure topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference-infrastructure

Here are 2 public repositories matching this topic...

bentoml / llm-inference-handbook

ramsyana / Markdown-Direct-Protocol

Improve this page

Add this topic to your repo