I’m Antonie Chirilus, R&D engineer. I share my machine learning and LLMOps learnings here.
Role
R&D · AI Engineer
Focus
LLM reliability, agents
Based
Bucharest, RO
Stack
Python · llama.cpp · MCP
§ 01 · Essays & field notes
Field notes from production LLM work — what breaks, what holds, and the math that decides which. New essays roughly every other week.
qwen3-coder:30b should fit comfortably on a single 5090, but it was silently spilling 14 GB to the CPU for a 256k context window I never used. One Modelfile parameter took inference from 12s to 4s per turn.
Read essay →I spent a week chasing latency on a local LLM setup. The bottleneck wasn't the GPU, the model, or the quantization — it was the prefill stage doing the same 40K tokens of work on every request.
Read essay →Why I'm starting this, what to expect, and the kind of writing I want to do here.
Read essay →§ 02 · Selected work
Persistent memory for LLM agents, modeled on human cognition. Seven typed stores — episodic, semantic, procedural, entity, working, summary, buffer — each with its own update strategy. 44 tests, live Streamlit demo.
Drop-in MCP middleware that catches silent tool-output drift — when Slack reorders threads, an API renames a field, a DB schema shifts — before it cascades through your agent chain. SBERT embeddings + PSI on the projected distribution.
A five-agent pipeline that turns a one-line requirement into a working CrewAI repo on GitHub. Architect, codegen, test-writer, reviewer with self-correction, deployer.
An autonomous code-audit swarm. Navigator + Analyst agents on AutoGen 0.4, tool access through MCP, walking a remote repository and producing a structured improvements report.
§ 03 · Experience
R&D Engineer · AI
Keysight Technologies · Bucharest
Building a local AI assistant that lets users configure a networking platform in plain natural language. On-device inference with llama.cpp.
AI Engineer · Intern
Keysight Technologies
Production RAG over Keysight’s audit corpus. Function-calling agents on Azure OpenAI, retrieval through Azure Cognitive Search, FastAPI.
ML Engineer · Contract
Roglia SRL · Remote
Custom AI chatbot for public institutions. CrewAI, LangChain, Voiceflow, Groq + OpenAI inference, Grafana for observability.
B.Sc Computer Science
University of Bucharest
Finishing this year. Prior: several years of mathematical olympiad at the national stage.
§ 04 · Correspondence
If a post here was useful, wrong, or worth arguing about — email me. I’m also open to AI engineering roles, full-time or contract, hybrid in Bucharest or remote on European hours. The shorter the email, the faster the reply.