Alex Tamkin

Email: atamkin_cs_stanford_edu | Research Updates: @alextamkin

I am a researcher at Anthropic. I'm interested in how we can understand and improve the societal impacts of AI systems. Some recent work in this vein:

Tracking the economic impacts of AI systems (Anthropic Economic Index)
New tools for understanding AI's broader societal impacts (Clio)
New interfaces for human-AI collaboration (Claude Artifacts)
New interpretability methods to understand and steer models (Codebook Features)
New ways for AI systems to understand what people want (Eliciting human preferences with language models)

Previously, I completed my PhD in Computer Science at Stanford, where I was advised by Noah Goodman and part of the Stanford AI Lab and Stanford NLP Group.

Selected Work

Anthropic Education Report: How University Students Use Claude

Kunal Handa*, Drew Bent*, Alex Tamkin, Miles McCain, Esin Durmus, Michael Stern, Mike Schiraldi, Saffron Huang, Stuart Ritchie, Steven Syverud, Kamya Jagadish, Margaret Vo, Matt Bell, Deep Ganguli

Which Economic Tasks are Performed with AI? Evidence from Millions of Claude Conversations [📝blogpost] [📝second report]

Kunal Handa*, Alex Tamkin*, Miles McCain, Saffron Huang, Esin Durmus, Sarah Heck, Jared Mueller, Jerry Hong, Stuart Ritchie, Tim Belonax, Kevin K. Troy, Dario Amodei, Jared Kaplan, Jack Clark, Deep GanguliPreprintPress: [Washington Post] [Axios] [Platformer] [Boston Globe] [Forbes] [VentureBeat] [The Verge] [The Register]Public Talks: [American Enterprise Institute] [SXSW] [UChicago] [IMF & World Bank]

Clio: Privacy-preserving Insights into Real-world AI Use [📝blogpost]

Alex Tamkin*, Miles McCain*, Kunal Handa, Esin Durmus, Liane Lovitt, Ankur Rathi, Saffron Huang, Alfred Mountfield, Jerry Hong, Stuart Ritchie, Michael Stern, Brian Clarke, Landon Goldberg, Theodore R. Sumers, Jared Mueller, William McEachen, Wes Mitchell, Shan Carter, Jack Clark, Jared Kaplan, Deep GanguliPreprintPress: [Platformer] [Axios] [TechCrunch]

Collective Constitutional AI: Aligning a Language Model with Public Input [📝blogpost]

Saffron Huang, Divya Siddarth, Liane Lovitt, Thomas I. Liao, Esin Durmus, Alex Tamkin, Deep GanguliFAccT 2024Press: [New York Times] [Time Magazine] [Business Insider]

Evaluating and Mitigating Discrimination in Language Model Decisions [🐦thread]

Alex Tamkin, Amanda Askell, Liane Lovitt, Esin Durmus, Nicholas Joseph, Shauna Kravec, Karina Nguyen, Jared Kaplan, Deep GanguliNeurIPS 2024 Workshop on Algorithmic Fairness through the Lens of Metrics and Evaluation - Spotlight TalkPress: [VentureBeat] [TechCrunch]

Eliciting Human Preferences with Language Models [🐦thread]

Belinda Z. Li*, Alex Tamkin*, Noah D. Goodman, Jacob AndreasArXiv PreprintPress: [VentureBeat]

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet [📝blogpost]

Adly Templeton*, Tom Conerly*, Jonathan Marcus, Jack Lindsey, Trenton Bricken, Brian Chen, Adam Pearce, Craig Citro, Emmanuel Ameisen, Andy Jones, Hoagy Cunningham, Nicholas L Turner, Callum McDougall, Monte MacDiarmid, Alex Tamkin, Esin Durmus, Tristan Hume, Francesco Mosconi, C. Daniel Freeman, Theodore R. Sumers, Edward Rees, Joshua Batson, Adam Jermyn, Shan Carter, Chris Olah, Tom HenighanPress: [New York Times] [WIRED] [TIME]

Codebook Features: Sparse and Discrete Interpretability for Neural Networks [🐦thread][📝blogpost]

Alex Tamkin, Mohammad Taufeeque, Noah D. GoodmanICML 2024

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Esin Durmus, Karina Nguyen, Thomas I. Liao, Nicholas Schiefer, Amanda Askell, Anton Bakhtin, Carol Chen, Zac Hatfield-Dodds, Danny Hernandez, Nicholas Joseph, Liane Lovitt, Sam McCandlish, Orowa Sikder, Alex Tamkin, Janel Thamkul, Jared Kaplan, Jack Clark, Deep GanguliCOLM 2024

Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

Trenton Bricken*, Adly Templeton*, Joshua Batson*, Brian Chen*, Adam Jermyn*, Tom Conerly, Nicholas L Turner, Cem Anil, Carson Denison, Amanda Askell, Robert Lasenby, Yifan Wu, Shauna Kravec, Nicholas Schiefer, Tim Maxwell, Nicholas Joseph, Alex Tamkin, Karina Nguyen, Brayden McLean, Josiah E Burke, Tristan Hume, Shan Carter, Tom Henighan, Chris OlahPreprint

Feature Dropout: Revisiting the Role of Augmentations in Contrastive Learning

Alex Tamkin, Margalit Glasgow, Xiluo He, Noah GoodmanNeurIPS 2023

Task Ambiguity in Humans and Language Models

Alex Tamkin*, Kunal Handa*, Avash Shrestha, Noah GoodmanICLR 2023

Oolong: Investigating What Makes Crosslingual Transfer Hard with Controlled Studies [🐦thread]

Zhengxuan Wu*, Isabel Papadimitriou*, Alex Tamkin*EMNLP 2023

DABS 2.0: Improved Datasets and Algorithms for Universal Self-Supervision [🐦thread]

Alex Tamkin, Gaurab Banerjee, Mohamed Owda, Vincent Liu, Shashank Rammoorthy, Noah GoodmanNeurIPS 2022

Active Learning Helps Pretrained Models Learn the Intended Task [🐦thread]

Alex Tamkin*, Dat Nguyen*, Salil Deshpande*, Jesse Mu, Noah GoodmanNeurIPS 2022

DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning [🌐site] [🐦thread]

Alex Tamkin, Vincent Liu, Rongfei Lu, Daniel Fein, Colin Schultz, Noah GoodmanNeurIPS 2021Press: [Redshift Magazine] [AIM Magazine] [Stanford HAI]

C5T5: Controllable Generation of Organic Molecules with Transformers

Daniel Rothchild, Alex Tamkin, Julie Yu, Ujval Misra, Joseph GonzalezArXiv Preprint

On the Opportunities and Risks of Foundation Models

Center for Research on Foundation Models (full list of authors)– Section 4.2: Training and Self-Supervision, Alex Tamkin– Section 4.9: AI Safety and Alignment, Alex Tamkin, Geoff Keeling, Jack Ryan, Sydney von Arx– Coauthor: Sections §2.2: Vision, §3.3: Education, §4.1 Modeling, §5.6: Ethics of ScalePress: [Forbes] [The Economist] [VentureBeat]

Viewmaker Networks: Learning Views for Unsupervised Representation Learning [📝blogpost] [🐦thread]

Alex Tamkin, Mike Wu, Noah GoodmanICLR 2021

Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models [📝blogpost]

Alex Tamkin*, Miles Brundage*, Jack Clark, Deep GanguliArXiv Preprint Press: [WIRED] [VentureBeat] [Datanami] [Slator]

Language Through a Prism: A Spectral Approach for Multiscale Language Representations [🐦thread] [📝blogpost]

Alex Tamkin, Dan Jurafsky, Noah GoodmanNeurIPS 2020

Investigating Transferability in Pretrained Language Models [🐦thread]

Alex Tamkin, Trisha Singh, Davide Giovanardi, Noah GoodmanFindings of EMNLP 2020; Presented at CoNLL 2020

Distributionally-Aware Exploration for CVaR Bandits.

Alex Tamkin, Ramtin Keramati, Christoph Dann, Emma Brunskill. NeurIPS 2019 Workshop on Safety and Robustness in Decision Making; RLDM 2019

Personal

I like also making art, especially ceramics and photography!

Google Sites

Report abuse