Amir Feder

I'm an assistant professor of Computer Science at the Hebrew University and a research scientist at Google.

I work on language models and causal inference, often for applications in computational social science. My research develops methods that integrate causality into language models, and facilitate scientific inquiry with text data. I am also interested in understanding human behavior encoded in language models.

I recently moved from Columbia where I was a postdoctoral fellow in their Data Science Institute and Digital Futures Initiative, working with David Blei. I received my PhD from the Technion, where I worked with Roi Reichart and Uri Shalit. Previously, I was an history, economics and statistics student at Tel Aviv University, the Hebrew University and Northwestern University. I was a co-organizer of the First workshop on NLP and Causal Inference (CI+NLP) at EMNLP 2021, and the tutorial on Causality for NLP at EMNLP 2022.

email: afeder at cs dot huji dot ac dot il

[google scholar] [semantic scholar] [dblp] [github]

Students

Itamar Trainin (Hebrew U, co-advised by Omri Abend)

Zach Bamberger (Technion, co-advised by Ofra Amir)

Shimon Nowik (Hebrew U, co advised by Yaron Lipman)

Alumni

Yuval Shalev (Hebrew U, co-advised by Ariel Goldstein)

Selected Publications

(*=equal contribution)

Evaluating the Moral Beliefs Encoded in LLMs

Nino Scherrer*, Claudia Shi*, Amir Feder, David Blei

Neural information processing systems (NeurIPS) 2023 (Spotlight) [arxiv] [press]

Data Augmentations for Improved (Large) Language Model Generalization

Amir Feder*, Yoav Wald*, Claudia Shi, Suchi Saria, David Blei

Neural information processing systems (NeurIPS) 2023 [arxiv] [pdf]

On Calibration and Out-of-domain Generalization

Yoav Wald*, Amir Feder*, Daniel Greenfeld, Uri Shalit

Neural information processing systems (NeurIPS) 2021 [arxiv] [pdf]

CausaLM: Causal model explanation through counterfactual language models

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Computational Linguistics (CL), 2021 [arxiv] [code] [data] [pdf]

Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Amir Feder*, Katherine A Keith*, Emaad Manzoor*, Reid Pryzant*, Dhanya Sridhar*, Zach Wood-Doughty*, Jacob Eisenstein*, Justin Grimmer*, Roi Reichart*, Margaret E Roberts*, Brandon M Stewart*, Victor Veitch*, Diyi Yang*

Transactions of the Association for Computational Linguistics (TACL), 2022 [arxiv] [reading list]

Tutorial on Causal Inference for NLP

Amir Feder*, Zhijing Jin*, Kun Zhang*

Empirical Methods in Natural Language Processing (EMNLP) 2022 [youtube] [slides (part 1), slides (part 2)]

Here's a Venn diagram describing my research (publication #s below):

All Publications

(*=equal contribution, LM=Language Models, CI=Causal Inference, CSS= Computational Social Science)

[CSS] The impact of DDoS and other security shocks on Bitcoin currency exchanges: Evidence from Mt. Gox

Amir Feder, Neil Gandal, JT Hamrick, Tyler Moore

Journal of Cybersecurity, 2018 [pdf]

[CSS] The rise and fall of cryptocurrencies

Amir Feder, Neil Gandal, JT Hamrick, Tyler Moore, Marie Vasek

Workshop on the Economics of Information Security (WEIS) 2018 [pdf]

[CSS] An examination of the cryptocurrency pump-and-dump ecosystem

JT Hamrick, Farhang Rouhi, Arghya Mukherjee, Amir Feder, Neil Gandal, Tyler Moore, Marie Vasek

Information Processing & Management, 2019 [pdf]

[LM] Active deep learning to detect demographic traits in free-form clinical notes

Amir Feder, Danny Vainstein, Roni Rosenfeld, Tzvika Hartman, Avinatan Hassidim, Yossi Matias

Journal of Biomedical Informatics (JBI), 2020 [data] [pdf]

[CSS, LM] Predicting In-game Actions from Interviews of NBA Players

Amir Feder*, Nadav Oved*, Roi Reichart

Computational Linguistics (CL), 2020 [arxiv] [code] [pdf]

[LM, CI] Are VQA systems RAD? measuring robustness to augmented data with focused interventions

Daniel Rosenberg, Itai Gat, Amir Feder, Roi Reichart

Association for Computational Linguistics (ACL) 2021 [arxiv] [code] [pdf]

[LM, CI] CausaLM: Causal model explanation through counterfactual language models

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Computational Linguistics (CL), 2021 [arxiv] [code] [data] [pdf]

[LM] Learning and evaluating a differentially private pre-trained language model

Shlomo Hoory, Amir Feder, Avichai Tendler, Sofia Erell, Alon Peled-Cohen, Itay Laish, Hootan Nakhost, Uri Stemmer, Ayelet Benjamini, Avinatan Hassidim, Yossi Matias

Findings of the Association for Computational Linguistics: EMNLP 2021 [pdf]

[LM, CI] Model compression for domain adaptation through causal effect estimation

Guy Rotman*, Amir Feder*, Roi Reichart

Transactions of the Association for Computational Linguistics (TACL), 2021 [arxiv] [code] [pdf]

[LM, CI] On Calibration and Out-of-domain Generalization

Yoav Wald*, Amir Feder*, Daniel Greenfeld, Uri Shalit

Neural information processing systems (NeurIPS) 2021 [arxiv] [pdf]

[LM] Structured Understanding of Assessment and Plans in Clinical Documentation

Doron Stupp, Ronnie Barequet, I-Ching Lee, Eyal Oren, Amir Feder, Ayelet Benjamini, Avinatan Hassidim, Yossi Matias, Eran Ofek, Alvin Rajkomar

Machine Learning for Healthcare (ML4H) 2022 [arxiv]

[CSS, LM] Shared computational principles for language processing in humans and deep language models

Ariel Goldstein, Zaid Zada*, Eliav Buchnik*, Mariano Schain*, Amy Price*, Bobbi Aubrey*, Samuel A Nastase*, Amir Feder*, Dotan Emanuel*, Alon Cohen*, Aren Jansen, Harshvardhan Gazula, Gina Choe, Aditi Rao, Catherine Kim, Colton Casto, Lora Fanda, Werner Doyle, Daniel Friedman, Patricia Dugan, Lucia Melloni, Roi Reichart, Sasha Devore, Adeen Flinker, Liat Hasenfratz, Omer Levy, Avinatan Hassidim, Michael Brenner, Yossi Matias, Kenneth A Norman, Orrin Devinsky, Uri Hasson

Nature Neuroscience 2022 [arxiv] [pdf]

[LM, CI] DoCoGen: Domain Counterfactual Generation for Low Resource Domain Adaptation

Nitay Calderon, Eyal Ben-David, Amir Feder, Roi Reichart

Association for Computational Linguistics (ACL) 2022 [arxiv] [code] [pdf]

[LM, CSS, CI] Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond

Transactions of the Association for Computational Linguistics (TACL), 2022 [arxiv] [reading list]

[LM] Useful Confidence Measures: Beyond the Max Score

Gal Yona, Amir Feder, Itay Laish

NeurIPS 2022 Workshop on Distribution Shifts: Connecting Methods and Applications [arxiv]

[LM] Section Classification in Clinical Notes with Multi-task Transformers

Fan Zhang, Itay Laish, Ayelet Benjamini, Amir Feder

Proceedings of the 13th International Conference on Health Text Mining and Information Analysis (LOUHI), 2022 [pdf]

[LM] Building a Clinically-Focused Problem List From Medical Notes

Amir Feder, Itay Laish, Shashank Agarwal, Uri Lerner, Aviel Atias, Cathy Cheung, Peter Clardy, Alon Peled-Cohen, Rachana Fellinger, Hengrui Liu, Lan Huong Nguyen, Birju Patel, Natan Potikha, Amir Taubenfeld, Liwen Xu, Seung Doo Yang, Ayelet Benjamini, Avinatan Hassidim

Proceedings of the 13th International Conference on Health Text Mining and Information Analysis (LOUHI), 2022 [pdf]

[LM, CI] CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior

Eldar David Abraham*, Karel D'Oosterlinck*, Amir Feder*, Yair Ori Gat*, Atticus Geiger*, Christopher Potts*, Roi Reichart*, Zhengxuan Wu*

Neural information processing systems (NeurIPS) 2022 [arxiv] [data]v

[CSS, LM, CI] In the Eye of the Beholder: Robust Prediction with Causal User Modeling

Amir Feder, Guy Horowitz, Yoav Wald, Roi Reichart, and Nir Rosenfeld

Neural information processing systems (NeurIPS) 2022 [arxiv]

[LM, CI] An Invariant Learning Characterization of Controlled Text Generation

Claudia Shi*, Carolina Zheng*, Keyon Vafa, Amir Feder, David Blei

Association for Computational Linguistics (ACL) 2023 [Spotlight at NeurIPS 2022 Workshop on Robustness in Sequence Modeling]

[CSS, LM] Evaluating the Moral Beliefs Encoded in LLMs

Nino Scherrer*, Claudia Shi*, Amir Feder, David Blei

Neural information processing systems (NeurIPS) 2023 (Spotlight) [arxiv] [press]

[LM, CI] Data Augmentations for Improved (Large) Language Model Generalization

Amir Feder*, Yoav Wald*, Claudia Shi, Suchi Saria, David Blei

Neural information processing systems (NeurIPS) 2023 [arxiv]

[LM] LLMs Accelerate Annotation for Medical Information Extraction

Akshay Goel*, Almog Gueta*, Omry Gilon, Sofia Erell, Chang Liu, Lan Huong Nguyen, Xiaohong Hao, Bolous Jaber, Shashir Reddy, Jean Steiner, Itay Laish, Amir Feder

Proceedings of the Machine Learning for Health Symposium (ML4H), 2023

[LM, CI] Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals

Yair Gat*, Nitay Calderon*, Amir Feder, Alexander Chapanin, Amit Sharma, Roi Reichart

Proceedings of the Twelfth International Conference on Learning Representations (ICLR), 2024 [arxiv] [pdf]

[CSS, LM] Alignment of brain embeddings and artificial contextual embeddings in natural language points to common geometric patterns

Ariel Goldstein, Avigail Dabush, Bobbi Aubrey, Mariano Schain, Samuel A. Nastase, Zaid Zada, Eric Ham, Zhuoqiao Hong, Amir Feder, Harshvardhan Gazula, Eliav Buchnik, Werner Doyle, Sasha Devore, Patricia Dugan, Daniel Friedman, Michael Brenner, Avinatan Hassidim, Orrin Devinsky, Adeen Flinker, Uri Hasson

Nature Communications, 2024 [arxiv] [pdf]

[LM] Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig

Empirical Methods in Natural Language Processing (EMNLP) 2024 [arxiv]

[LM] Exploring the Learning Capabilities of Language Models using LEVERWORLDS
Eitan Wagner, Amir Feder, Omri Abend
Empirical Methods in Natural Language Processing (EMNLP) 2024
[CSS, LM] Can LLMs Learn Macroeconomic Narratives from Social Media?

Almog Gueta, Amir Feder, Zorik Gekhman, Ariel Goldstein, Roi Reichart

Findings of the Association for Computational Linguistics: NAACL 2025 [arxiv] [pdf]

[LM] Confidence Improves Self-Consistency in LLMs

Amir Taubenfeld, Tom Sheffer, Eran Ofek, Amir Feder, Ariel Goldstein, Zorik Gekhman, Gal Yona

Findings of the Association for Computational Linguistics: ACL 2025 [arxiv]

Preprints

Distributional reasoning in LLMs: Parallel reasoning processes in multi-hop reasoning

Yuval Shalev, Amir Feder, Ariel Goldstein [arxiv]

Explaining Classifiers with Causal Concept Effect (CaCE)

Yash Goyal, Amir Feder, Uri Shalit, Been Kim

2019 [arxiv]

Teaching

Lecturer

Technion - Israel Institute of Technology
- [097215] Natural Language Processing (2021-2022)

Teaching Assistant
- Technion - Israel Institute of Technology
  - [236756] Foundations of Machine Learning (2020-2021)
- Tel Aviv University
  - Econometrics and Causal Analysis (2015-2016)
  - Monetary Economics (2015-2016)
  - The Macroeconomic Environment, Kellogg-Recanati International MBA (2015-2016)

Tutorials

Tutorial on Causal Inference for NLP, EMNLP 2022

With Kun Zhang, Zhijing Jin [youtube] [slides (part 1), slides (part 2)]

Workshops

First Workshop on Causal Inference & NLP (CI+NLP), EMNLP 2021

With Katherine Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty,
Jacob Eisenstein, Justin Grimmer, Roi Reichart, Molly Roberts, Uri Shalit, Brandon Stewart, Victor Veitch, Diyi Yang [url]

Spurious Correlations, Invariance and Stability (SCIS), ICML 2023

With Yoav Wald, Claudia Shi, Aahlad Puli, Limor Gultchin, Mark Goldstein, Maggie Makar, Victor Veitch, Uri Shalit [url]

NLP for Science (NLP4S), EMNLP 2024

With Nitay Calderon, Alex Chapanin, Rotem Dror, Ariel Goldstein, Anna Korhonen, Shir Lissak, Yaakov Ophir, Roi Reichart, Ilanit Sobol, Refael Tikochinski, Mor Ventura

Page updated

Google Sites

Report abuse