This is the official Pytorch implementation for the paper: "From Outcome to Process Supervision : Generative Trajectory Reasoning for Sequential Recommendation"
# Stage 1: Supervised learning
bash run_sft.sh
# Stage 2: Reinforcement learning
bash run_rl.sh