CIFAR-10
NLL and FID results on CIFAR-10 dataset with different numbers of Euler steps. The flow matching model here, which achieves BPD 3.12 as the NLL with 1024 steps and FID 2.60 with 200 steps, is also the teacher model we use in our Shortcut-Distill. For NLL, the closer to the teacher result (3.12 BPD) the better, and for FID, the lower the better. We denote the best results in bold, the second best with underlines, the overall best results in boxes and invalid predictions in gray color.
| Method | 8 Steps | 4 Steps | 2 Steps | 1 Step | ||||
|---|---|---|---|---|---|---|---|---|
| NLL | FID | NLL | FID | NLL | FID | NLL | FID | |
| Flow Matching | -9.93 | 20.63 | -24.01 | 64.27 | -52.85 | 146.24 | -111.19 | 313.54 |
| Shortcut Model | -12.07 | 7.10 | -28.03 | 9.63 | -60.01 | 16.04 | -124.15 | 27.28 |
| Shortcut-Distill (Ours) | -11.42 | 5.01 | -26.82 | 5.41 | -57.72 | 7.13 | -119.42 | 12.75 |
| MeanFlow | -9.00 | 4.34 | -21.26 | 5.14 | -46.63 | 2.84 | -97.59 | 2.80 |
| Shortcut-F2D2 (Ours) | 3.07 | 8.78 | 3.26 | 10.21 | 2.73 | 15.58 | 0.20 | 27.35 |
| Shortcut-Distill-F2D2 (Ours) | 3.12 | 5.68 | 2.87 | 5.96 | 2.38 | 7.35 | 1.62 | 13.76 |
| MeanFlow-F2D2 (Ours) | 2.38 | 3.78 | 1.34 | 4.37 | 1.63 | 2.59 | 3.51 | 3.02 |
ImageNet 64×64
Negative log-likelihood (NLL) measured in BPD and FID results on ImageNet 64×64 dataset with different numbers of Euler steps. The flow matching model here, which achieves BPD 3.34 as the NLL with 1024 steps and FID 13.09 with 200 steps, is also the teacher model we use in our Shortcut-Distill. For NLL, the closer to the teacher result (3.34 BPD) the better, and for FID, the lower the better. We denote the best results in bold and invalid predictions in gray color.
| Method | 8 Steps | 4 Steps | 2 Steps | 1 Step | ||||
|---|---|---|---|---|---|---|---|---|
| NLL | FID | NLL | FID | NLL | FID | NLL | FID | |
| Flow Matching | -6.41 | 31.60 | -15.87 | 68.55 | -35.23 | 170.00 | -74.54 | 363.39 |
| Shortcut-Distill (Ours) | -9.03 | 19.47 | -22.30 | 21.73 | -49.01 | 28.12 | -102.07 | 42.72 |
| Shortcut-Distill-F2D2 (Ours) | 3.51 | 21.91 | 3.94 | 24.05 | 3.97 | 29.83 | 1.54 | 44.02 |
CelebA-64
Negative log-likelihood (NLL) measured in BPD and FID results on CelebA-64 dataset with different numbers of Euler steps. The flow matching model here achieves BPD 1.75 in 1024 steps and FID 2.48 in 200 steps. For NLL, closer to the flow matching estimate is better; for FID, lower is better. Best in bold; invalid in gray.
| Method | 8 Steps | 4 Steps | 2 Steps | 1 Step | ||||
|---|---|---|---|---|---|---|---|---|
| NLL | FID | NLL | FID | NLL | FID | NLL | FID | |
| Flow Matching | -6.88 | 30.60 | -16.39 | 58.14 | -36.46 | 120.65 | -77.51 | 181.23 |
| LSD | -6.78 | 3.33 | -14.89 | 4.04 | -32.72 | 6.32 | -69.83 | 12.96 |
| LSD-F2D2 (Ours) | 1.64 | 2.41 | 1.75 | 2.75 | 1.73 | 3.86 | 1.64 | 6.94 |