Software Engineering Undergraduate

Xiangkun Sun

I study how modern AI systems make decisions, where those decisions become fragile, and how interventions can turn mechanistic clues into reliable understanding. I am being admitted to College AI, Tsinghua University, directed by Tonghan Wang.

Read Latest Paper Email Me

Latest Publication arXiv:2605.09314

Persuasion as a compact causal mechanism

A research focus on tracing how persuasive inputs reroute attention, shift latent answer geometry, and produce factual errors that can be monitored through intervention.

01 Persuasive Evidence

02 Routing Feature

03 Decision Heads

figures

cs.AI

Mechanisms for trustworthy AI behavior.

Mechanistic interpretability

Localizing sparse model components, causal features, and internal routing paths that explain downstream behavior.

LLM persuasion and safety

Studying how persuasive evidence can redirect model choices, and how compact interventions can monitor or block that shift.

Algorithmic problem solving

Building on competitive programming experience to design careful experiments, efficient tooling, and robust evaluation pipelines.

Publication

Latest work on persuasion-induced factual errors.

arXiv:2605.09314 cs.AI May 2026

How LLMs Are Persuaded: A Few Attention Heads, Rerouted

Xiangkun Sun, Lingkai Kong, Aoqi Zhang, Liang Zeng, Tonghan Wang

The paper identifies a compact causal mechanism behind persuasion-induced factual errors: a small group of mid-layer attention heads routes answer options through a low-dimensional choice geometry. Persuasion redirects attention toward a target option, and intervention on a rank-one evidence-routing feature can steer or block the effect.

Show compact abstract

The mechanism appears across open-source LLMs and realistic poisoning settings such as Generative Engine Optimization, suggesting persuasion can be framed as a narrow, monitorable circuit rather than a diffuse loss of belief.

arXiv PDF

Honors

Competition results shaped by algorithms and execution.

2025

ICPC Regional Contest (Shanghai), Silver Medal

2025

CCPC Invitational Contest (Northeast), Gold Medal

2025

RoboCom Developer Competition, National First Prize

Toolkit

Programming, modeling, and communication.

Python C++ R Go Machine Learning Data Structures Algorithms Academic Writing English German