Hi, thanks for your great work. I'm trying to reproduce your model, but somehow my model performance always deteriorates around 1 point of HR@10. The code is actually not that complex, where my preprocessing logic and model architecture follows your paper exactly. My env is tf2.0, any idea why this happens?