Deployable On-Premises RAG

I’m excited to introduce Minima, an open-source Retrieval-Augmented Generation (RAG) solution designed to work seamlessly on-premises or with integrations like ChatGPT and the Model Context Protocol (MCP). Whether you’re looking for a fully local RAG setup or prefer to integrate with external LLMs, Minima has you covered.

Minima is a containerized RAG solution that prioritizes security, flexibility, and simplicity. You can run it fully locally or integrate it with external AI services, depending on your needs.

Key Features

Minima currently supports three modes of operation:

Isolated Installation

• Fully on-premises operation with no external dependencies (e.g., ChatGPT or Claude).

• All neural networks—LLM, reranker, and embedding—run on your cloud or local PC.

• Ensures your data stays secure and private.

Custom GPT

• Query your local documents directly through the ChatGPT app or web interface via custom GPTs.

• The indexer runs on your local PC or cloud, while ChatGPT serves as the primary LLM.

Anthropic Claude

• Use the Claude app to query your local documents.

• The indexer operates on your local PC, with Anthropic Claude as the primary LLM.

With Minima, you can enjoy a flexible RAG solution that adapts to your infrastructure and security preferences.

Would love to hear your feedback, thoughts, or ideas! Check it out, and let me know what you think.

Cheers!

https://github.com/dmayboroda/minima

原文链接：Deployable On-Premises RAG

文章版权声明 1、本网站名称：拾光赋
2、本站永久网址：https://www.blogs.ink
3、本网站的文章部分内容可能来源于网络，仅供大家学习与参考，如有侵权，请联系站长QQ：805375623进行删除处理。
4、本站一切资源不代表本站立场，并不代表本站赞同其观点和对其真实性负责。
5、本站一律禁止以任何方式发布或转载任何违法的相关信息，访客发现请向站长举报
6、本站资源大多存储在云盘，如发现链接失效，请联系我们我们会第一时间更新。

THE END