working on ollama, the easiest way to run llms on your computer
my work at ollama revolves around ai agents, building model capabilties, function/tool calling, structured outputs, the python sdk and general systems engineering
ollama run parthsareen/me
previously ran a startup called extensible ai where I worked on ai agent reliability, extensitrace (a thread-safe tracing library for agents), DAGent (agents as directed acyclic graphs), and also online tool use for agents
used to work on distributed systems at tesla and autodesk with scala, go, and python. built on-device ml pipelines in c++ at apple. did some pm too at some point.
regularly make latte art, sometimes do muay thai, and like to get good at new things (i'm very competitive)
Writings
- Sampling and structured outputs in LLMs 2025-09-10
- Streaming responses with tool calling 2025-05-28
- Structured outputs in ollama 2024-12-06
- Functions as tools in ollama 2024-11-25
- Learnings from building a Graph AI Agent Framework 2024-10-22