random-forest gpt Attention-UNet
random-forest based ONNX implementation for dataset policy.
- Input
- 3025-dim embedding
- Encoder
- 124 x Attention-UNet with 58 heads
- Output
- mAP projection
Training config
optimizer=SGD, lr=0.645, scheduler=exponential, warmup=1260