CodebaseRAG

A high-performance backend for codebase retrieval augmented generation (RAG) that allows you to ask natural language questions about your codebase and get accurate, contextual answers.

English | 简体中文

🚀 Features

Superior Performance: Outperforms existing open-source alternatives with faster retrieval and more accurate results
Semantic Code Understanding: Leverages advanced embeddings to understand code semantics
Cross-Reference Awareness: Maintains awareness of relationships between files and functions
Contextual Answers: Provides answers with relevant code snippets and references
Efficient Caching: Smart caching system minimizes redundant processing

📋 Requirements

Python 3.10+
Git

🔧 Installation

Clone the repository:

git clone https://github.com/yourusername/codebase-rag.git
cd codebase-rag

Install dependencies:

pip install -r requirements.txt

💻 Usage

Basic Usage

python src/main.py local

Serve as backend

python src/main.py webserver

Refresh vector cache

python src/main.py local --refresh_cache

Specify the config file

python src/main.py local --config /path/to/config

Advanced Configuration

Edit the src/config.yaml file to customize:

Codebase path
Webserver parameters
LLM parameters

🏗️ Architecture

CodebaseRAG consists of several key components:

Code Parser (codeparser.py): Analyzes and extracts structured information from source code
Embeddings Engine (embeddings.py): Creates semantic representations of code
Vector Store (vector_store.py): Efficiently indexes and retrieves relevant code snippets
LLM Interface (llm.py): Generates human-readable answers from retrieved context
Caching Layer (cache.py): Optimizes performance through intelligent caching

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
src		src
.gitignore		.gitignore
README.md		README.md
README_zh.md		README_zh.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodebaseRAG

🚀 Features

📋 Requirements

🔧 Installation

💻 Usage

Basic Usage

Serve as backend

Refresh vector cache

Specify the config file

Advanced Configuration

🏗️ Architecture

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CodebaseRAG

🚀 Features

📋 Requirements

🔧 Installation

💻 Usage

Basic Usage

Serve as backend

Refresh vector cache

Specify the config file

Advanced Configuration

🏗️ Architecture

🤝 Contributing

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages