Discussion about this post

User's avatar
Neural Foundry's avatar

Fantastic writeup on Lyft's inference stack. The per-team microservice approach really nails the ownership boundaries issue. I'm curious how they handle cross-model feature sharing tho, like if multiple teams need the same preprocessd embeddings. In my expereince, duplicating that work across services kills efficiency fast, but centralized feature stores add coordination overhead.

No posts

Ready for more?