Single Sign-On (SSO) is an authentication scheme. It allows a user to log in to different systems using a single ID.
> To make the raw model safe and helpful, it undergoes supervised fine-tuning and reinforcement learning.
This does not belong to the inference path. Supervised fine-tuning and reinforcement learning are two post-training approaches.
> To make the raw model safe and helpful, it undergoes supervised fine-tuning and reinforcement learning.
This does not belong to the inference path. Supervised fine-tuning and reinforcement learning are two post-training approaches.