EvalFlow
Evaluation workflow system for prompt, model, and agent runs with experiment comparison, trace visibility, and feedback-driven iteration.
Architecting intelligent systems from chaos. Specialized in Machine Learning and deep-diving into MLOps.
Evaluation workflow system for prompt, model, and agent runs with experiment comparison, trace visibility, and feedback-driven iteration.
Autonomous AI code review agent for pull requests, focused on bug detection, regression checks, and security-oriented reasoning.
Agentic video generation system for prompt-driven scene planning, retrieval, orchestration, and automated production pipelines.
End-to-end NLP and MLOps pipeline with experiment tracking, monitoring, artifact versioning, and production telemetry for model visibility.
Rebuilt the core transformer architecture to study attention mechanics, encoder-decoder flow, and training behavior from scratch.
Implemented the original convolutional pipeline to understand early vision architectures, parameter sharing, and classifier construction.