Sijun Tan (谭嗣俊)

I am a third-year CS PhD student at UC Berkeley, advised by Raluca Ada Popa. I am affiliated to Berkeley’s Sky Computing Lab.

Previously, I graduated from University of Virginia with a Bachelor’s degree in Computer Science and Mathematics, where I was advised by David Wu and Yuan Tian. I also worked with Professors Haifeng Xu and Xiaohui Bei. I interned at Facebook AI Research (FAIR) and worked at Ant Group as a senior algorithm engineer.

My current research focuses on LLM post-training and Agentic AI. I am the project lead at Agentica, and we are on a mission to democratize reinforcement learning for training language models and agents.

We are looking for highly motivated students to work with! If you are interested in research opportunities, please send me an email.

news

Apr 10, 2025	We released DeepCoder, a 14B model reaching O3 mini level on coding and math.
Feb 11, 2025	We released DeepScaleR, a 1.5B model that surpasses o1-preview by scaling RL🔥. Check out our code.
Oct 16, 2024	We are excited to release JudgeBench: a challenging benchmark to evaluate LLM-based judges. Check out our leaderboard and code.

selected publications

Notion

DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level

Michael Luo*, Sijun Tan*, Roy Huang*, Ameen Patel, Alpay Ariyak, Qingyang Wu, Xiaoxiang Shi, Rachel Xin, Colin Cai, Maurice Weber, Ce Zhang, Li Erran Li, Raluca Ada Popa, and Ion Stoica

2025

Blog Code
Notion

DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL

Michael Luo*, Sijun Tan*, Justin Wong, Xiaoxiang Shi, William Y. Tang, Manan Roongta, Colin Cai, Jeffrey Luo, Tianjun Zhang*, Li Erran Li, Raluca Ada Popa, and Ion Stoica

2025

Blog Code Website
ICLR

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Sijun Tan*, Siyuan Zhuang*, Kyle Montgomery*, Willian Y. Tang, Alejandro Cuadron, Chenguang Wang, Raluca Ada Popa, and Ion Stoica

International Conference on Learning Representations (ICLR), 2025

Paper Code
EMNLP

LLoCO: Learning Long Contexts Offline

Sijun Tan*, Xiuyu Li*, Shishir Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, and Raluca Ada Popa

Empirical Methods in Natural Language Processing (EMNLP), 2024

Paper Code
OSDI

Flock: A Framework for Deploying On-Demand Distributed Trust

Darya Kaviani*, Sijun Tan*, Pravein Govindan Kannan, and Raluca Ada Popa

Operating Systems Design and Implementation (OSDI), 2024

Paper Code
IEEE S&P

MPCAuth: Multi-factor Authentication for Distributed-trust Systems

Sijun Tan, Weikeng Chen, Ryan Deng, and Raluca Ada Popa

IEEE Symposium on Security and Privacy (Oakland), 2023

Paper TL;DR

Systems with distributed trust have attracted growing research attention and seen increasing industry adoptions. In these systems, critical secrets are distributed across N servers, and computations are performed privately using secure multi-party computation (SMPC). Authentication for these distributed-trust systems faces two challenges. The first challenge is ease-of-use. Namely, how can an authentication protocol maintain its user experience without sacrificing security? To avoid a central point of attack, a client needs to authenticate to each server separately. However, this would require the client to authenticate N times for each authentication factor, which greatly hampers usability. The second challenge is privacy, as the client’s sensitive profiles are now exposed to all N servers under different trust domains, which creates N times the attack surface for the profile data. To address both challenges, we present MPCAuth, a multi-factor authentication system for distributed-trust applications. Our system enables a client to authenticate to N servers independently with the work of only one authentication. In addition, our system is profile hiding, meaning that the client’s authentication profiles such as her email username, phone number, passwords, and biometric features are not revealed unless all servers are compromised. We propose secure and practical protocols for an array of widely adopted authentication factors, including email passcodes, SMS messages, U2F, security questions/passwords, and biometrics. Our system finds practical applications in the space of cryptocurrency custody and collaborative machine learning, and benefits future adoptions of distributed-trust applications.
IEEE S&P

CryptGPU: Fast Privacy Preserving Machine Learning on the GPU

Sijun Tan, Brian Knott, Yuan Tian, and David J Wu

IEEE Symposium on Security and Privacy (Oakland), 2021

Paper TL;DR Code

We introduce CryptGPU, a system for privacy-preserving machine learning that implements all operations on the GPU (graphics processing unit). Just as GPUs played a pivotal role in the success of modern deep learning, they are also essential for realizing scalable privacy-preserving deep learning. In this work, we start by introducing a new interface to losslessly embed cryptographic operations over secret-shared values (in a discrete domain) into floating-point operations that can be processed by highly-optimized CUDA kernels for linear algebra. We then identify a sequence of "GPU-friendly" cryptographic protocols to enable privacy-preserving evaluation of both linear and non-linear operations on the GPU. Our microbenchmarks indicate that our private GPU-based convolution protocol is over 150× faster than the analogous CPU-based protocol; for non-linear operations like the ReLU activation function, our GPU-based protocol is around 10× faster than its CPU analog. With CryptGPU, we support private inference and training on convolutional neural networks with over 60 million parameters as well as handle large datasets like ImageNet. Compared to the previous state-of-the-art, our protocols achieve a 2× to 8× improvement in private inference for large networks and datasets. For private training, we achieve a 6× to 36× improvement over prior state-of-the-art. Our work not only showcases the viability of performing secure multiparty computation (MPC) entirely on the GPU to newly enable fast privacy-preserving machine learning, but also highlights the importance of designing new MPC primitives that can take full advantage of the GPU’s computing capabilities.