Multimodal AI at Work: Document Processing Has Scaled, Video and Vision Still Piloting

6 min readDocument AI and audio transcription are in production at scale. Video understanding and open-ended visual reasoning are still in pilot. A modality-by-modality breakdown of where multimodal AI has earned its place in enterprise workflows — and where reliability gaps are keeping CFOs from removing human review.

The Definitional Gap: Why the Industry Can’t Agree on What “Production” Means for AI Agents

4 min readThe AI industry has no consensus definition of ‘production’ for AI agents. Most enterprise deployments labelled ‘production’ are narrow automations with human oversight, not autonomous multi-step systems. The definitional gap creates real risk: procurement decisions, SLA commitments, and compliance claims are built on undefined terms.