GenTR

This is the official Pytorch implementation for the paper: "From Outcome to Process Supervision : Generative Trajectory Reasoning for Sequential Recommendation"

# Stage 1: Supervised learning
bash run_sft.sh

# Stage 2: Reinforcement learning
bash run_rl.sh

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
config		config
models		models
README.md		README.md
__init__.py		__init__.py
collator.py		collator.py
data_utils.py		data_utils.py
dataset.py		dataset.py
ensemble_results.py		ensemble_results.py
evaluator.py		evaluator.py
grad_utils.py		grad_utils.py
load_best.py		load_best.py
model.py		model.py
rl_trainer.py		rl_trainer.py
run_rl.sh		run_rl.sh
run_sft.sh		run_sft.sh
save_stf_emb.py		save_stf_emb.py
start_rl.py		start_rl.py
start_sft.py		start_sft.py
tokenizer.py		tokenizer.py
trainer.py		trainer.py
utils.py		utils.py