I thought GPT5 and the other models used in ChatGPT are decoder only model, I see the diagram has a encoder/decoder. Also, during inferencing tokenization, prefill and decoding are the three main phases. It would have been great if the blog covered those concepts. I somehow felt that the steps mentioned here may not be exactly what chatgpt does when a request is sent to it, but I would like to get myself corrected if my observation is incorrect.
I thought GPT5 and the other models used in ChatGPT are decoder only model, I see the diagram has a encoder/decoder. Also, during inferencing tokenization, prefill and decoding are the three main phases. It would have been great if the blog covered those concepts. I somehow felt that the steps mentioned here may not be exactly what chatgpt does when a request is sent to it, but I would like to get myself corrected if my observation is incorrect.
excellent!!
thanks