Open Source
Projects we maintain
Semiosis ↗
Unit testing for your knowledge base. A Python framework for evaluating semantic quality of documentation and context systems using LLMs.
Sniffbench ↗
Custom eval cases for coding agent harnesses. Test new skills, configs, and tools against your actual GitHub issues or tasks you design.
Inference Economics ↗
Compare LLM inference costs across local hardware, cloud GPU rentals, and API providers. Make decisions using real benchmark data, not vibes.
Website ↗
This website is open source. Built with Astro, Tailwind, and a custom CRT-inspired design system.
Contribute
Want to contribute or collaborate? Reach out to discuss ideas, report issues, or get involved with our projects.