RC Data Science
Researcher Training
Local LLM example with ollama

Repository



README

Future Work

cloud deployment
improve chunking strategy

probably too fine-grained right now using individual sections
try full pages
try a hierarchical retrieval strategy

use full pages as initial pass
then use sections only from within that page as second pass


"BS" mitigation strategies?
improve embedding db persistence strategy

CI/CD triggered by docs changes


mitigate prompt injection attacks


https://github.com/protectai/rebuff not yet fully local
word counts limits (start at 1k maybe?)
check if response is similar to system prompt, if so, emit message


server-client model

client should be a page sending queries to a server which runs the backend code
client should be very thin and light-weight
streamlit could be a starting point: https://docs.streamlit.io/develop/api-reference/chat