Speedup of conv2d_1x1_s_dis for ch=112, 176
conv2d_1x1_s_dis selected generic function when the number of channels equals 112 or 176, even though divisible by 16. This fix gives approximately 5% DecT speedup for JVET-AJ0054.
Fix passes all the test vectors tried when running ./check.sh.