README
Future Work
- cloud deployment
- improve chunking strategy
- probably too fine-grained right now using individual sections
- try full pages
- try a hierarchical retrieval strategy
- use full pages as initial pass
- then use sections only from within that page as second pass
- "BS" mitigation strategies?
- improve embedding db persistence strategy
- CI/CD triggered by docs changes
- mitigate prompt injection attacks
- https://github.com/protectai/rebuff not yet fully local
- word counts limits (start at 1k maybe?)
- check if response is similar to system prompt, if so, emit message
- server-client model
- client should be a page sending queries to a server which runs the backend code
- client should be very thin and light-weight
- streamlit could be a starting point: https://docs.streamlit.io/develop/api-reference/chat