MetNet 3

Paper | Code

  • Temporal resolution: 2 minutes
  • Spatial resolution: 1 km

The network is a U-Net in conjunction with a MaxVit architecture

  • Topographical embedding: automatically embeds time-indipendent variables (4km tokens) for 20 parameters
  • U-Net: based on a fully convolutional neural network whose architecture was modified and extended to work with fewer training images and to yield more precise segmentation
  • MaxVit: hybrid (CNN + ViT) image classification models.

Uses parameter oriented training for lead time (0 - 24 hours). Masks out with 25% probability a block of data.