ChromaDB RAG Chat Application

This project is a modularized version of the ChromaDB Retrieval-Augmented Generation (RAG) chat application based on the Hugging Face cookbook tutorial: Semantic Cache with Chroma Vector Database. The application combines semantic caching and language model-based text generation to provide relevant and contextual responses to user queries.

Description

The ChromaDB RAG Chat Application utilizes the following components:

Dataset: The application loads a dataset (keivalya/MedQuad-MedicalQnADataset) using the datasets library and prepares it for further processing.
Vector Database: The loaded dataset is stored in a ChromaDB collection, which serves as a vector database for efficient retrieval of relevant documents based on user queries.
Semantic Cache: The application implements a semantic cache (SemanticCache) that stores previously asked questions, their embeddings, answers, and response texts. The cache uses the FAISS library for efficient similarity search and the Sentence Transformers library for encoding questions into embeddings.
Language Model: The application utilizes the mistralai/Mistral-7B-Instruct-v0.1 language model for generating responses based on the retrieved context from the vector database or the semantic cache.

Unique Features

Modularized Structure: The application follows a modularized structure, separating different functionalities into individual files. This modular approach enhances code organization, reusability, and maintainability.
Semantic Caching: The application employs a semantic cache that stores previous user queries, their embeddings, and corresponding responses. When a new query is asked, the cache is searched for similar questions using the FAISS library. If a similar question is found, the cached response is returned, reducing the need for database retrieval and language model inference.
Vector Database: The application uses ChromaDB, a vector database, to store and retrieve relevant documents based on user queries. ChromaDB enables efficient similarity search, allowing the application to find the most relevant context for generating responses.
Language Model Integration: The application integrates the mistralai/Mistral-7B-Instruct-v0.1 language model using the LLMModule class. The language model is used to generate contextual responses based on the retrieved context from the vector database or the semantic cache.

Usage

Install the required dependencies:
- datasets
- chromadb
- faiss
- sentence_transformers
- transformers
Run the main.py script to start the chat application.
Enter user queries in the chat interface. The application will retrieve relevant context from the semantic cache or the ChromaDB vector database and generate responses using the language model.
To exit the chat, type 'quit'.

File Structure

dataset.py: Contains functions for loading and preparing the dataset.
vectordb.py: Defines functions for creating and interacting with the ChromaDB vector database.
semantic_cache.py: Implements the semantic cache functionality using FAISS and Sentence Transformers.
llm_module.py: Defines the LLMModule class for loading and utilizing the language model.
main.py: The main script that orchestrates the chat application by combining the dataset, vector database, semantic cache, and language model.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
local_chromadb_frag		local_chromadb_frag
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChromaDB RAG Chat Application

Description

Unique Features

Usage

File Structure

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ChromaDB RAG Chat Application

Description

Unique Features

Usage

File Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages