PDF-RAG with DeepSeek LLM

A lightweight PDF Question Answering app built using DeepSeek LLM and LangChain with Streamlit frontend.

🔍 Overview

This project allows users to upload a PDF document and ask natural language questions about its content. The app processes the PDF, splits the content into chunks, embeds them into a vector database using FAISS and DeepSeek embeddings, and retrieves the most relevant chunks to generate answers using DeepSeek LLM.

🚀 Features

Upload any PDF
Chunk and embed content using LangChain
Semantic search using FAISS
LLM-based question answering using DeepSeek
Streamlit-based intuitive UI

📂 Project Structure

RAG_WITH_DEEPSEEK/
│
├── deepseek/
│   ├── vector_databases.py      # Loads, chunks, embeds PDF content
│   ├── Rag_pipeline.py          # Retrieval + QA logic
│   ├── frontend.py              # Streamlit app
│   └── requirements.txt
│
├── pdfs/                        # PDF upload folder
├── vectorstore/                 # FAISS vector database folder
├── data.pdf                     # Sample PDF
├── .env                         # Environment variables
└── README.md

⚙️ How It Works

Upload a PDF using the Streamlit UI.
PDF is parsed using PDFPlumberLoader.
Content is chunked via RecursiveCharacterTextSplitter.
Chunks are embedded using DeepSeek embeddings via OllamaEmbeddings.
Chunks are stored in a FAISS vector store.
On a user query, relevant chunks are retrieved and passed to DeepSeek LLM for answering.

🧰 Installation

git clone https://github.com/your-username/pdf-rag-deepseek.git
cd pdf-rag-deepseek
python -m venv deepseek
source deepseek/bin/activate  # or .\deepseek\Scripts\activate on Windows
pip install -r requirements.txt

🏃 Usage

streamlit run frontend.py

📝 Requirements

Check requirements.txt for full list.

🏷️ Tags

RAG, LangChain, DeepSeek, Streamlit, Vector Database, LLM, PDF, FAISS

📄 License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF-RAG with DeepSeek LLM

🔍 Overview

🚀 Features

📂 Project Structure

⚙️ How It Works

🧰 Installation

🏃 Usage

📝 Requirements

🏷️ Tags

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
README.md		README.md
Rag_pipeline.py		Rag_pipeline.py
frontend.py		frontend.py
requirements.txt		requirements.txt
vector_databases.py		vector_databases.py

Folders and files

Latest commit

History

Repository files navigation

PDF-RAG with DeepSeek LLM

🔍 Overview

🚀 Features

📂 Project Structure

⚙️ How It Works

🧰 Installation

🏃 Usage

📝 Requirements

🏷️ Tags

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages