Dana Arad

prof_pic.jpg

I’m a CS PhD candidate at the Technion, part of the Technion CS NLP lab, advised by Yonatan Belinkov. My research aims to improve our understanding of the internal mechanisms of language and vision-language models, focusing on information flow and factuality. I am a fellow of the Ariane de Rothschild Women Doctoral Program. Previously, I interned at Amazon and eBay.

I am passionate about advancing women in STEM and research. I volunteer with QueenB, where I founded the Academy Month, a program aimed at encouraging undergraduate students and early-career graduates to pursue research. I’m also part of She-S, the CS faculty women’s organization. Additionally, I lead NLP-IL’s vision-language club.

If you find any of the above interesting, feel free to reach out!

News

Aug 04, 2025 I’ll be spending the rest of the summer visiting David Bau’s lab at Northeastern University’s Khoury College of Computer Sciences. If you’re in the area and want to chat about interpretability, feel free to reach out!

Selected Publications

  1. saes_steering.png
    SAEs Are Good for Steering - If You Select the Right Features
    Dana Arad, Aaron Mueller, and Yonatan Belinkov
    arXiv, 2025
  2. vlm_different.png
    Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs
    Yaniv Nikankin, Dana Arad, Yossi Gandelsman, and 1 more author
    arXiv, 2025
  3. ICML
    mib.png
    MIB: A Mechanistic Interpretability Benchmark
    Aaron Mueller, Atticus Geiger, Sarah Wiegreffe, and 20 more authors
    ICML, 2025
  4. ACL
    diffusion_lens.jpg
    Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines
    Michael Toker, Hadas Orgad, Mor Ventura, and 2 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2024, Bangkok, Thailand, August 11-16, 2024, 2024
  5. NAACL
    refact.jpg
    ReFACT: Updating Text-to-Image Models by Editing the Text Encoder
    Dana Arad, Hadas Orgad, and Yonatan Belinkov
    In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), NAACL 2024, Mexico City, Mexico, June 16-21, 2024, 2024