Skip to content

Convert Flask LLM Serve Code to FastAPI for Asynchronous Support #27

@Himanshu-Yadav-0

Description

@Himanshu-Yadav-0

Currently, the codebase uses Python Flask for serving the LLM. I propose migrating this implementation to FastAPI, which is a more advanced library and provides better asynchronous support. This change will improve scalability, performance, and modernize the API architecture.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions