8.1 Experiments on Image Benchmark datasets
For the PR-Reducing flows, the final scale ratio between preserved vs. shrunken dimensions for finite integration times is dependent on the quantity \( e^{\rho(g_* -g_i)T} \). Therefore, for fixed end integration time \( T \) and rate \( \rho \), this scaling is dictated by \( g_* - g_i \), which we call the “inflation gap” (IG, Appendix B.2). As this inflation gap increases, compressed dimensions are shrunken to a greater extent, and the denoising networks are required to amortize score estimation over wider noise scales, a harder learning problem. Therefore, for our proposed model, compression should be understood both in terms of the number of dimensions being preserved and the size of this inflation gap.
To assess how these two factors affect model performance, we performed two sets of experiments on two benchmark image datasets (CIFAR-10 Krizhevsky, 2009 and AFHQv2 Choi et al., 2020; Appendix B.4.2). In the first set of experiments, we fixed \( T \), \( \rho \), and the inflation gap (\( \text{IG} = 1.02 \)) while varying only the number of preserved dimensions \( d \) between \( d=1 \) (compression to \( \approx 0.03\% \)) and \( d=3072 \) (no compression) for both datasets (see Tables 1, 2 below - values represent mean \( \pm 2 \sigma \) over 3 sets of seeds, each with either 50K samples for FID scores, or 10K samples, for round-trip MSE).
1 | 12.65 ± 0.07 | 1.47 ± 0.07 |
2 | 11.95± 0.06 | 1.55± 0.21 |
30 | 13.64± 0.02 | 3.79± 0.13 |
62 | 14.05± 0.18 | 5.32± 0.18 |
307 | 15.64± 0.10 | 3.33± 0.13 |
615 | 14.63± 0.07 | 2.42± 0.18 |
1536 | 13.36± 0.12 | 0.14± 0.03 |
3041 | 13.97± 0.13 | 0.28± 0.06 |
3072 | 11.90± 0.08 | 0.38± 0.04 |
1 | 20.76 ± 0.09 | 1.07 ± 0.10 |
2 | 21.29± 0.04 | 0.81± 0.11 |
30 | 23.36± 0.14 | 2.21± 0.08 |
62 | 23.30± 0.19 | 2.27± 0.24 |
307 | 28.07± 0.13 | 0.71± 0.02 |
615 | 24.49± 0.27 | 0.29± 0.03 |
1536 | 17.44± 0.16 | 0.16± 0.06 |
3041 | 16.60± 0.05 | 0.30± 0.02 |
3072 | 17.01± 0.10 | 0.22± 0.03 |
For the second set of experiments, we worked with the AFHQv2 dataset and fixed \( T \), \( \rho \), and \( d=2 \), while varying the inflation gap ( \( \text{IG} = 1.10, 1.25, 1.35, 1.50 \), see Table 3 below, same set up as before).
2 | 1.02 | 11.95 ± 0.06 | 1.55 ± 0.21 |
2 | 1.10 | 13.98± 0.13 | 1.35± 0.08 |
2 | 1.25 | 17.84± 0.15 | 1.65± 0.09 |
2 | 1.35 | 34.68± 0.37 | 1.19± 0.18 |
2 | 1.50 | 107.64± 0.43 | 0.11± 0.02 |
Finally, we also compared our inflationary flows (IFs) model generative performance on CIFAR-10 against three existing injective flow model baselines (Appendix B.5.2) — M-Flows (Brehmer & Cranmer, 2020), Rectangular Flows (RFs, Caterini et al., 2021), and Canonical Manifold Flows (CMFs, Flouris & Konukoglu, 2023) — for different numbers of preserved dimensions (\( d=30, 40, 62 \)). Table 4 below showcases best FID scores (out of 3 independently generated sets of images, each with 10K samples) for each such experiment. For these comparison experiments, we fixed \( \text{IG}=1.02 \) when training our networks for the different \( d \) values.
30 | 23.3 | 541.2 | 544.0 | 532.6 |
40 | 24.3 | 535.7 | 481.3 | 444.6 |
62 | 23.2 | 280.9 | 280.8 | 287.9 |
As a general trend, increasing the number of preserved dimensions at a constant inflation gap led to improvements in generative quality (lower FID scores) and reduced MSE (Tables 1,2). However, some schedules we assessed are not entirely consistent with this trend. We hypothesize this is at least partially due to variance arising from different network initializations for each schedule, as well as differences between the two datasets explored here. As expected, increasing inflation gap while maintaining the number of preserved dimensions leads to worsened generative performance (higher FID scores, Table 3). Finally, in terms of predictive calibration, our model provides substantial gains when compared to existing injective flow model baselines (Table 4).
8.2 Animations for Image Dataset Experiments
To see animations for sample generation (FID) and round-trip (MSE) experiments under select schedules, please check the links below!