Enterprise RAG, on-prem
Built an enterprise RAG system from scratch, unifying knowledge across Confluence, Jira, and GitHub. Deployed fully offline on-premises for a regulated finance environment.
- Hybrid retrieval (BM25 + pgvector) with reciprocal rank fusion
- Cross-encoder reranking for precision
- Instructor-XL embeddings and Qwen 32B running locally on NVIDIA GPU
- REST API, web UI, and Microsoft Teams integration
- −67% time-to-information across previously siloed sources