Retrieval-augmented generation framework that utilizes small, specialized models instead of large language models, significantly reducing computational and financial costs while offering cost-effective RAG solutions that can run on standard hardware.