AI models do their best to represent reality. When you give it custom instructions to make it lie, it has to construct new reality to make everything else it knows not contradictory. In order for the model to still give answers, it needs to ignore the parts that are contradictions, and the downstream false positives from said contradictions: leaving you with a less intelligent model