Mitigating Safety Issues in Pre-trained Language Models: A Model-Centric Approach Leveraging Interpretation Methods