Auto Seed Vl2 Today
: (1) Performance on highly structured tasks (e.g., VQA with relational reasoning) drops by 6% compared to exemplar replay. (2) The generator’s meta-update requires 5% of training data as a validation set – not always available. (3) Seed interpretability: unlike real images, seeds are opaque vectors. 8. Conclusion We presented Auto-Seed VL2, a framework for autonomous seed generation in vision-language continual learning. By synthesizing compact, cross-modal aligned seeds conditioned on task gradients, Auto-Seed VL2 eliminates the need for storing real data while achieving superior performance over replay-based methods. Our results demonstrate that synthetic embedding replay is a viable and often superior alternative to exemplar storage. Future work includes extending to online (single-pass) continual learning and exploring seed decomposition for compositional tasks. Acknowledgments [Redacted for blind review] References [1] Radford, A., et al. (2021). Learning transferable visual models from natural language supervision. ICML.
[4] Thengane, V., et al. (2023). Continual-CLIP: Fine-tuning CLIP for continual learning. CVPR Workshop. auto seed vl2
| Configuration | Avg Acc | Drop | |----------------------------------------|---------|------| | Full Auto-Seed VL2 | 82.2 | — | | w/o consistency loss (( \mathcalL \textconsist )) | 75.4 | -6.8 | | w/o gradient-conditioned generation (random seeds) | 68.9 | -13.3 | | w/o meta-update of ( G \phi ) | 74.1 | -8.1 | | w/o seed pruning (full memory) | 82.0 | -0.2 (ns) | : (1) Performance on highly structured tasks (e