How DoorDash uses AI Models to Understand…

Sep 10

The technical goal of their project was clear: achieve accurate transcription of menu photos into structured menu data while keeping latency and cost low enough for production at scale.

Read →

3 Comments

G Mohammed

Oct 12

why can't they use the guardrail model itself? why we need to use LLM here?

Expand full comment

Rishwanth T

Sep 18Edited

A api call to a Vision Language Model would do the task.

Am I missing something ?

Expand full comment

Tarik Kranda

Sep 11

Thanks to DoorDash Team for sharing their experience with the community and thank you guys for bringing this to us with your expansions. I wondered about two subjects after reading.

1- The decrease in the ratio of human supervision once the pipeline is in production? Business outcome of developing this pipeline?

2- I wondered if this two models would be connected tandem to mitigate their (i.e. Model2 -> Model1) rather than working in parallel to each other, would this improve the overall number of transcription validated in guardrail model?

Thank you!

Expand full comment

ByteByteGo Newsletter

How DoorDash uses AI Models to Understand…