Discussion about this post

User's avatar
Mitchell Kosowski's avatar

The "fusion is the hard part" takeaway lands (and is really true no matter where you go). The specialist models are increasingly commoditized where anyone can stand up character recognition, scene classification, and ASR but stitching their disparate outputs into a shared temporal index is where the real systems work still lives.

7 more comments...

No posts

Ready for more?