Pad00fix
This fix makes it possible to do padding=(0,0) with conv2d of size 3x3, 3x1 and 1x3 both for grouped and non-groped convolutions.
The fix passes all the tests in check.sh. Six more test vectors have also been added for 3x3, 3x1 and 1x3 both grouped and non-grouped.