📂 PDF-Extraction-Tool

DocuMind AI is a professional-grade Retrieval-Augmented Generation (RAG) application that allows you to have natural conversations with your PDF documents. By combining semantic search with the power of Gemini AI, it provides accurate, grounded answers to your most complex document queries.

🚀 [Live Demo: DocuMind AI](https://huggingface.co/spaces/ghost4488/pdfExtraction)

🌟 Quick Overview

Engine: Google Gemini 2.0 Flash & ChromaDB.
Capability: Instant indexing, automated topic tagging, and cited answers.
UI: Fully Responsive Dark/Light mode dashboard with mobile auto-indexing.
Trust: Grounded responses based strictly on document context to provide maximum factual precision.

✨ Key Features

📱 Mobile Optimized: A fully responsive interface designed for seamless use on smartphones, tablets, and desktops.
🚀 Instant Indexing: Upload any PDF and start chatting in seconds.
🎯 Context-Aware Answers: Our AI understands the full context of your document to provide precise insights.
🏷️ Automated Topic Tagging: Uses AI to categorize your documents with smart, descriptive tags.
🔒 Secure & Private: Built with privacy in mind—your documents are processed securely.
🗑️ Reset Capability: Built-in "Danger Zone" allows you to wipe all uploaded data and start fresh with one click.

🛠️ How It Works (RAG Architecture)

The application uses Retrieval-Augmented Generation (RAG) to ensure that every answer the AI provides is backed by the actual text in your documents:

Ingestion: The PDF is parsed using OCR and split into semantic chunks.
Vectorization: Each chunk is converted into a vector representation and stored in ChromaDB.
Retrieval: When you ask a question, the system performs a semantic search to find the most relevant chunks.
Augmentation: These relevant chunks are provided to the Gemini model as ground-truth context.
Generation: The AI generates an answer based only on that context.

🚀 Tech Stack

Frontend: HTML5, CSS3 (Inter & JetBrains Mono fonts), JavaScript.
Backend: Python / Flask.
AI Model: Google Gemini 2.5 Flash.
Vector Database: ChromaDB.
OCR Engine: Tesseract OCR (via PyMuPDF4LLM).
Deployment: Docker on Hugging Face Spaces.

📥 Getting Started

Prerequisites

Python 3.11+
Google Gemini API Key
Tesseract OCR (for local development)

Installation

Clone the repository:

git clone [https://github.com/ghost4488/pdfExtraction.git](https://github.com/ghost4488/pdfExtraction.git)
cd pdfExtraction

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github		.github
static		static
templates		templates
.dockerignore		.dockerignore
.gitattributes		.gitattributes
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
app.py		app.py
rag_master.py		rag_master.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📂 PDF-Extraction-Tool

🚀 [Live Demo: DocuMind AI](https://huggingface.co/spaces/ghost4488/pdfExtraction)

🌟 Quick Overview

✨ Key Features

🛠️ How It Works (RAG Architecture)

🚀 Tech Stack

📥 Getting Started

Prerequisites

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📂 PDF-Extraction-Tool

🚀 [Live Demo: DocuMind AI](https://huggingface.co/spaces/ghost4488/pdfExtraction)

🌟 Quick Overview

✨ Key Features

🛠️ How It Works (RAG Architecture)

🚀 Tech Stack

📥 Getting Started

Prerequisites

Installation

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages