Spaces:
Running
Running
File size: 3,242 Bytes
de0b5e5 b4e24d3 7d88dd0 de0b5e5 4096e4b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 |
# MedRAG
**MedRAG** is a Retrieval-Augmented Generation (RAG) system designed to make drug-related information more accessible to
patients. Many patients struggle with the patient information leaflets that accompany medications, which are often
packed with crucial information in small print, leading to confusion about dosages, side effects, and other essential
drug details. MedRAG aims to bridge this gap by providing a digital platform where users can easily ask questions and
receive clear, understandable answers.
## Features
- **Bot and Website Development**: A user-friendly interface where patients can input questions and receive responses
derived from drug guides and patient leaflets.
- **Input and Output Guardrails**: Safeguards to ensure the reliability, appropriateness, and safety of the information
provided, preventing potential misuse.
- **DSPy Prompting**: Utilizes Dynamic Scripting in Python (DSPy) to create reliable and adaptable prompts for the
question-answering process.
- **Retrieval-Augmented Generation (RAG) with Semantic Caching**: Enhances data retrieval and response time by caching
semantically similar queries, ensuring both speed and accuracy.
- **Vector Database**: Leverages a vector database for efficient retrieval of drug information, enabling quick and
precise responses to user queries.
- **Observability**: Integrated monitoring tools to track system health, retrieval performance, and ensure consistent,
high-quality user experiences.
## High-Level Architecture
![Architecture](doc/images/MediRAg.drawio.png)
1. **Question-Answering Bot and Website**:
- Users can interact with a bot on the website to ask drug-related questions.
- The bot retrieves information from drug guides and patient information leaflets to provide clear and concise
answers.
2. **Input and Output Guardrails**:
- Implemented to filter inappropriate or potentially harmful queries.
- Ensures the bot's responses are accurate and aligned with medical guidelines.
3. **DSPy Prompting**:
- Uses DSPy to dynamically generate prompts that guide the retrieval process.
- Helps in crafting responses that are both contextually relevant and easy to understand.
4. **Retrieval-Augmented Generation (RAG) with Semantic Caching**:
- Utilizes a RAG model to combine real-time retrieval with language generation.
- Semantic caching improves the response time by reusing answers to similar questions.
5. **Vector Database**:
- Employs a vector database for fast and effective retrieval of information.
- Enhances the bot's ability to search and retrieve relevant content from large datasets.
6. **Observability**:
- Includes tools to monitor and log the system’s performance.
- Helps in maintaining system integrity and ensuring reliable operation.
## Getting Started
To get started with MedRAG:
1. Clone the repository:
```bash
git clone https://github.com/alvinhenrick/medirag.git
```
2. Install the required dependencies:
```bash
cd medirag
poetry install
```
3. Run the bot:
```bash
poetry run python app.py
```
## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details. |