You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It should have model optimization techniques and model compression techniques to deploy a neural network on production to reduce latency and deployment costs at the cost of some accuracy.
The text was updated successfully, but these errors were encountered:
The AI Engineer roadmap doesn't really cover a lot of the basic topics you need to understand how to "deploy" a "neural network" in "production", there are various topics involved such as:
evaluation both offline and in production, that is partially covered by MLOps (monitoring and observability)
some backend development knowledge is also involved when you need to build production-ready stuff, and that is covered in API Design and Backend Developer
Given that the AI Engineer roadmap has a more top-to-bottom approach I think that either it gets redesigned to include the required knowledge in the first place, or just a few topics are added under a Operationalize (or something similar) card; the most important topics I can think of are more related to LLM evaluation/monitoring rather than optimization (you should know what to optimize first).
Roadmap URL
https://roadmap.sh/ai-engineer
Suggestions
It should have model optimization techniques and model compression techniques to deploy a neural network on production to reduce latency and deployment costs at the cost of some accuracy.
The text was updated successfully, but these errors were encountered: