Your implementation uses Batch Normalization but in the paper is said that the network use Local Response Normalization.
Your implementation uses Batch Normalization but in the paper is said that the network use Local Response Normalization.