Convert Flask LLM Serve Code to FastAPI for Asynchronous Support

Currently, the codebase uses Python Flask for serving the LLM. I propose migrating this implementation to **FastAPI**, which is a more advanced library and provides better **asynchronous** support. This change will improve _scalability, performance_, and modernize the _API architecture_.