The U-Net architecture, primarily used for image segmentation, employs a characteristic filter shape within its convolutional layers. Here's a breakdown:

Core Concepts:

Typical U-Net Filter Shape:

Why 3x3?

Variations:

In summary: The standard U-Net architecture predominantly uses 3x3 convolutional filters, which strike a balance between capturing local features and computational efficiency.

🧠3x3 filter shape

https://gist.github.com/viadean/82f9a8c3f6a9350248e1b4a66a663ff2

Explanation: