profile.jpg
Arnas Uselis email github twitter scholar

About me

I’m a PhD student at the University of Tübingen and IMPRS-IS, advised by Seong Joon Oh at the STAI group. I study vision-language models, with a focus on compositional generalization across model families (CLIP, diffusion models, VLMs). More recently, I’ve been working on autoregressive VLMs and procedural visual simulators for exposing and fixing reasoning failures. Always open to collaborations, feel free to reach out.

Research

A lot of my work has been about looking for what enables or prevents compositional understanding in vision models, and discovering why simple recipes don’t work. Intermediate-layer classifiers showed that current networks have more information than they eventually use [8], but this alone doesn’t lead to generalization. Scaling up data helps, but only by covering an exponentially large space of combinations [7]. Diffusion models perform at similar levels to CLIP-style encoders on compositional tasks [6]. On the theoretical side, I’ve studied what geometric properties representations must have for compositional reasoning [2], and on the practical side, I analyze where failures arise in representation spaces [3] [5].

More recently, I’ve moved into multi-object understanding, where models must bind properties to the correct objects, understand spatial relationships, and count [4]. We found that embedding models encode bound objects as complex units that can’t easily be decomposed, which explains why generalization to unseen combinations is so hard [].

These days, most of my effort goes into procedural visual simulators: building compositionally diverse synthetic environments that expose intermediate reasoning failures in autoregressive VLMs and provide per-step training signal you can’t get from web-scale data alone.


Publications

  1. uselis2026binding.png
    Arnas Uselis*, Darina Koishigarina*, and Seong Joon Oh
    In International Conference on Machine Learning (ICML), 2026
  2. uselis2025conditions
    Arnas Uselis, Andrea Dittadi, and Seong Joon Oh
    In International Conference on Machine Learning (ICML), 2026
  3. jeong2025diffusion
    Yujin Jeong*, Arnas Uselis*, Seong Joon Oh, and Anna Rohrbach
    In Neural Information Processing Systems (NeurIPS) - Datasets and Benchmarks Track, 2025
  4. uselis2025datascaling
    Arnas Uselis, Andrea Dittadi, and Seong Joon Oh
    In International Conference on Machine Learning (ICML), 2025
  5. uselis2025intermediatelayer
    Arnas Uselis and Seong Joon Oh
    In International Conference on Learning Representations (ICLR), 2025