Research-grade tooling for local inference

We build open-source software alongside research in language models, diffusion, and systems ML—with an emphasis on measurable performance, honest benchmarks, and sustainable engineering.

View organization on GitHub Explore projects

What we work on

Focus areas

The Foundry brings together systems-minded contributors who care about what happens on real hardware—not only on slides.

Efficient inference

Low-overhead runtimes, careful memory lifecycle, and reproducible latency and throughput measurements.

Language & generative models

Tools and experiments grounded in current LLM and diffusion research, shared as open code and docs.

Open collaboration

Contributions welcome where evidence and scope are clear—see each repository for contribution guidelines.

Repositories

Projects

Active and upcoming initiatives under the Inference Foundry GitHub organization.

super-ollama

Terminal-native local LLM engine

Terminal-native, in-process local LLM engine (no HTTP in the main UX); llama.cpp via CGo, focus on low overhead and clean teardown.

Repo Roadmap (wiki)

Crucible

Research journal & lab notes

Open research journal and experimental log.

Repo

BitForge

Planned

Quantization theory, methods, and reproducible experiments across bit-widths and runtimes.

Repo TBD — org doc

Lexicon

Planned

Open fine-tuned prompt catalog with versioning, licensing, and analysis for reuse.

Repo TBD — org doc

Argus

Planned

Algorithms to detect AI-generated images using JEPA-based representations.

Repo TBD — org doc

Leadership

Founders

Reach out directly to the founders or use the community Discord for contributor and project discussions.

Kritarth Dandapat

Founder · Project lead (Argus, Crucible)

GitHub
LinkedIn
Discord: kritarth2006

Atshal Ahmed Khan

Founder · Team lead (super-ollama)

GitHub
LinkedIn
Discord: atshal123

How we work

Principles

These expectations apply across our public spaces and reviews.

Evidence over opinion. Performance claims belong with methodology: hardware, versions, and raw artifacts when possible.
Safety and respect. Follow the organization Code of Conduct; debate the technical content, not the person.
Scope discipline. Large features start with discussion in the relevant repo so maintainers and contributors share context early.

Contact and collaborate

For contributors and collaborators, use Discord first. For direct contact, message the founders through GitHub or LinkedIn.

Join Discord