Portrait image
Portrait image

Hi! 👋

I'm Alexander Wan, an undergraduate at UC Berkeley majoring in Computer Science. I'm broadly interested in machine learning and NLP, particularly in building better model evaluations. I'm currently doing research at Stanford CRFM. I formerly worked on LLM robustness & security at the Berkeley NLP group, and interned at the MSU Heterogeneous Learning and Reasoning lab.

See my: LinkedIn / Github / Google Scholar / Twitter


News

Nov 2023

I gave a talk at USC ISI's Natural Language seminar on the manipulation of LLMs through data.

Apr 2023

Our paper on poisoning instruction-tuned models was accepted to ICML.


Publications

The 2025 Foundation Model Transparency Index

Alexander Wan*, Kevin Klyman*, Sayash Kapoor*, Nestor Maslej, Shayne Longpre, Betty Xiong, Percy Liang, Rishi Bommasani*

Website / arXiv

The California Report on Frontier AI Policy

Rishi Bommasani, Scott R. Singer, ..., Alexander Wan, ..., Li Fei-Fei

Website / arXiv / SB53

Stanford Response to the US AI Safety Institute Request for Comment on Misuse of Dual-Use Foundation Models

Rishi Bommasani, Alexander Wan, Yifan Mai, Percy Liang, Daniel E. Ho

Website

What Evidence Do Language Models Find Convincing?

Alexander Wan, Eric Wallace, Dan Klein

ACL 2024 (Main)

arXiv

Poisoning Language Models During Instruction Tuning

Alexander Wan*, Eric Wallace*, Sheng Shen, Dan Klein

ICML 2023

arXiv

GLUECons: A Generic Benchmark for Learning Under Constraints

Hossein Rajaby Faghihi, Aliakbar Nafar, Chen Zheng, Roshanak Mirzaee, Yue Zhang, Andrzej Uszok, Alexander Wan, Tanawan Premsri, Dan Roth, Parisa Kordjamshidi

AAAI 2023

arXiv


Projects

DIY infini-gram

  • Prior to the official code-release, built an open-source implementation of infini-gram, a method for analyzing and integrating LLMs with extremely large text corpora using suffix arrays. (May 2024)
  • Built upon this work by replacing the suffix array with an FM-index and wavelet trees to reduce disk-usage by 7.5x. (June 2024)

Github (reproduction) / Github (with FM-Index)


Miscellaneous


Contact

Email: first 4 letters of first name + last name [at] berkeley [dot] edu