Varun Kotte

Varun Kotte — Staff ML Engineer, Adobe

I build retrieval and LLM systems that stay reliable in production.

Reliability of RAG, retrieval, and LLM serving: calibration, conformal prediction, and knowing when a model should say it doesn't know.

6Adobe products on the platform
~7×Lightroom search click-through
50+citations, lead paper

Highlights

Selected work

LLM-serving & retrieval platform (ILUP)

A shared platform that seven product teams adopted instead of building their own. It backs AI and search features across six Adobe products.

Read more →

Lightroom semantic search

Named-entity recognition and resolution behind intent-based photo search.

Read more →

Reliable, cost-aware inference

Calibrated abstention, model cascades, and routing under load.

Read more →

Selected research

Retrieval-Augmented Generation for Domain-Specific Question Answering

AAAI · SDU
Adobe's production RAG method (co-author). Cited 50+ times by independent groups.

EVICT: evidence-sufficiency verification for visually-grounded QA

CVPR · GRAIL-V
Sole-authored. A training-free probe that catches answers not grounded in the evidence.

PromptPort: a reliability layer for cross-model structured extraction

preprint
Sole-authored. Keeps structured output valid when the same prompt behaves differently across models.
See all research →