Optimization Beyond the Convolution: Generalizing Spatial Relationswith End-to-End Metric Learning