llm-deployment

Star

Here are 13 public repositories matching this topic...

little51 / llm-dev

Star

《大模型项目实战：多领域智能应用开发》配套资源

chat-application llm llm-training llm-inference llm-deployment

Updated Nov 24, 2025
JavaScript

eos3-ai / bubble-rag

Star

rag llm-training llm-deployment rag-chatbot

Updated Sep 25, 2025
Python

A curated collection of open-source Large Language Model (LLM) projects that are production-ready and can be used for solving real-world problems. This repository focuses on high-performance, scalable LLM solutions across various industries and applications.

data-science production agents fine-tuning rag large-language-models llm llm-deployment

Updated May 17, 2025

ajmal-uk / 5-Day-AI-Agents-Intensive-Course-with-Google

Star

5-Day Hands-on AI Agents Course using Google ADK & Vertex AI | From first agent to production deployment

memory mcp multi-agent gemini observability ai-agents rag kaggle-course vertex-ai llm generative-ai llm-deployment agentic-workflows model-context-protocol google-adk

Updated Nov 25, 2025
Jupyter Notebook

BjornMelin / local-llm-workbench

Star

🧠 A comprehensive toolkit for benchmarking, optimizing, and deploying local Large Language Models. Includes performance testing tools, optimized configurations for CPU/GPU/hybrid setups, and detailed guides to maximize LLM performance on your hardware.

cuda gpu-acceleration model-management inference-optimization model-quantization cpu-inference llama-cpp local-llm llm-deployment llm-benchmarking ollama-optimization hybrid-inference wsl-ai-setup context-window-scaling

Updated Mar 27, 2025
Shell

davide97l / LLM-deploy-API

Star

API to efficiently deploy Language Model (LLM) applications using Flask API

flask-application flask-api llm-inference llm-deployment

Updated Feb 26, 2024

paralleliq / piqc-knowledge-base

Star

Production-ready checklists and frameworks for deploying LLMs, GenAI models, and AI infrastructure. Covers vLLM, Kubernetes, GPU optimization, observability, compliance, and Day-0 to Day-2 operations.

kubernetes machine-learning deployment optimization best-practices checklists model-serving gpu-optimization mlops production-readiness ai-governance vllm genai llm-deployment ai-infrastructure

Updated Jan 21, 2026

ajithvcoder / emlo4-session-16-ajithvcoder

Star

AWS EKS + IRSA, Volumes, ISTIO & KServe+ NextJS App + Fastapi Serve + kubernetes + Helm charts + Multimodel or LLM-Deployment The School of AI EMLO-V4 course assignment https://theschoolof.ai/#programs

docker kubernetes nextjs istio aws-eks fastapi kserve llm-deployment

Updated Jan 26, 2025
Python

samanthajmichael / baymax_tc

Star

LLM App for summarization of Terms and Conditions agreements available on the internet.

python terms-and-conditions terms-of-service-agreements large-language-models llm llm-deployment

Updated Nov 18, 2024
Jupyter Notebook

CODEX108 / Good-Reads-ft.-AI

Star

clustering gpt gans attention-mechanism rag llm genai llm-deployment rag-chatbot

Updated Oct 2, 2025

SabaSyed / Latex-to-Code

Star

Python codes generation from latex expressions. Using synthetic dataset and CodeT5-base model.

transformers postprocessing synthetic-data fine-tuning transformer-models t5-model inferencing large-language-models llm t5-base llm-deployment llm-models latex-to-python

Updated Oct 16, 2024
Jupyter Notebook

paralleliq / modelspec

Star

ModelSpec is an open, declarative specification for describing how AI models especially LLMs are deployed, served, and operated in production. It captures execution, serving, and orchestration intent to enable validation, reasoning, and automation across modern AI infrastructure.

kubernetes runtime json-schema inference autoscaling observability model-deployment model-serving declarative-config production-ai mlops ai-systems llm ai-ops gpu-inference vllm llm-deployment ai-infrastructure modelspec

Updated Jan 16, 2026
Python

awesome-software / OpenLLM

Star

Operating LLMs in production

llm-deployment

Updated Oct 13, 2023
Python

Improve this page

Add a description, image, and links to the llm-deployment topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-deployment topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-deployment

Here are 13 public repositories matching this topic...

little51 / llm-dev

eos3-ai / bubble-rag

saucam / Awesome-LLM-Prod

ajmal-uk / 5-Day-AI-Agents-Intensive-Course-with-Google

BjornMelin / local-llm-workbench

davide97l / LLM-deploy-API

paralleliq / piqc-knowledge-base

ajithvcoder / emlo4-session-16-ajithvcoder

samanthajmichael / baymax_tc

CODEX108 / Good-Reads-ft.-AI

SabaSyed / Latex-to-Code

paralleliq / modelspec

awesome-software / OpenLLM

Improve this page

Add this topic to your repo