Hi all, at work I get a lot of questions about the state of the art in open source language models, and how to build chatbots on top of your own data.
I made a 100% open source knowledge-grounded chatbot that allows you to ask questions and chat with the Transformers docs. Powered by Flan-UL2 (which I've anecdotally found to be the most performative commercially licensed open source instruction tuned LLM), Langchain, Instructor Embeddings (STOTA in vector embeddings), and FAISS.
You can clone the space and play around with your own data, clone the repo locally, and take every line of code for your own projects.